A STUDY ON THE USE OF NORMALIZED L2-METRIC IN CLASSIFICATION TASKS

Context. In machine learning, similarity measures, and distance metrics are pivotal in tasks like classification, clustering, and dimensionality reduction. The effectiveness of traditional metrics, such as Euclidean distance, can be limited when applied to complex datasets. The object of the study i...

Full description

Saved in:
Bibliographic Details
Published inRadìoelektronika, informatika, upravlìnnâ no. 2; pp. 110 - 115
Main Author Kondruk, N. E
Format Journal Article
LanguageEnglish
Published 29.06.2025
Online AccessGet full text
ISSN1607-3274
2313-688X
DOI10.15588/1607-3274-2025-2-9

Cover

Loading…
Abstract Context. In machine learning, similarity measures, and distance metrics are pivotal in tasks like classification, clustering, and dimensionality reduction. The effectiveness of traditional metrics, such as Euclidean distance, can be limited when applied to complex datasets. The object of the study is the processes of data classification and dimensionality reduction in machine learning tasks, in particular, the use of metric methods to assess the similarity between objects.Objective. The study aims to evaluate the feasibility and performance of a normalized L2-metric (Normalized Euclidean Distance, NED) for improving the accuracy of machine learning algorithms, specifically in classification and dimensionality reduction.Method. We prove mathematically that the normalized L2-metric satisfies the properties of boundedness, scale invariance, and monotonicity. It is shown that NED can be interpreted as a measure of dissimilarity of feature vectors. Its integration into k-nearest neighbors and t-SNE algorithms is investigated using a high-dimensional Alzheimer’s disease dataset. The study implemented four models combining different approaches to classification and dimensionality reduction. Model M1 utilized the k-nearest neighbors method with Euclidean distance without dimensionality reduction, serving as a baseline; Model M2 employed the normalized L2-metric in kNN; Model M3 integrated t-SNE for dimensionality reduction followed by kNN based on Euclidean distance; and Model M4 combined t-SNE and the normalized L2-metric for both reduction and classification stages. A hyperparameter optimization prоcedure was implemented for all models, including the number of neighbors, voting type, and the perplexity parameter for t-SNE. Cross-validation was conducted on five folds to evaluate classification quality objectively. Additionally, the impact of data normalization on model accuracy was examined.Results. Models using NED consistently outperformed models based on Euclidean distance, with the highest classification accuracy of 91.4% achieved when it was used in t-SNE and the nearest neighbor method (Model M4). This emphasizes the adaptability of NED to complex data structures and its advantage in preserving key features in high and low-dimensional spaces.Conclusions. The normalized L2-metric shows potential as an effective measure of dissimilarity for machine learning tasks. It improves the performance of algorithms while maintaining scalability and robustness, which indicates its suitability for various applications in high-dimensional data contexts.
AbstractList Context. In machine learning, similarity measures, and distance metrics are pivotal in tasks like classification, clustering, and dimensionality reduction. The effectiveness of traditional metrics, such as Euclidean distance, can be limited when applied to complex datasets. The object of the study is the processes of data classification and dimensionality reduction in machine learning tasks, in particular, the use of metric methods to assess the similarity between objects.Objective. The study aims to evaluate the feasibility and performance of a normalized L2-metric (Normalized Euclidean Distance, NED) for improving the accuracy of machine learning algorithms, specifically in classification and dimensionality reduction.Method. We prove mathematically that the normalized L2-metric satisfies the properties of boundedness, scale invariance, and monotonicity. It is shown that NED can be interpreted as a measure of dissimilarity of feature vectors. Its integration into k-nearest neighbors and t-SNE algorithms is investigated using a high-dimensional Alzheimer’s disease dataset. The study implemented four models combining different approaches to classification and dimensionality reduction. Model M1 utilized the k-nearest neighbors method with Euclidean distance without dimensionality reduction, serving as a baseline; Model M2 employed the normalized L2-metric in kNN; Model M3 integrated t-SNE for dimensionality reduction followed by kNN based on Euclidean distance; and Model M4 combined t-SNE and the normalized L2-metric for both reduction and classification stages. A hyperparameter optimization prоcedure was implemented for all models, including the number of neighbors, voting type, and the perplexity parameter for t-SNE. Cross-validation was conducted on five folds to evaluate classification quality objectively. Additionally, the impact of data normalization on model accuracy was examined.Results. Models using NED consistently outperformed models based on Euclidean distance, with the highest classification accuracy of 91.4% achieved when it was used in t-SNE and the nearest neighbor method (Model M4). This emphasizes the adaptability of NED to complex data structures and its advantage in preserving key features in high and low-dimensional spaces.Conclusions. The normalized L2-metric shows potential as an effective measure of dissimilarity for machine learning tasks. It improves the performance of algorithms while maintaining scalability and robustness, which indicates its suitability for various applications in high-dimensional data contexts.
Author Kondruk, N. E
Author_xml – sequence: 1
  givenname: N. E
  surname: Kondruk
  fullname: Kondruk, N. E
BookMark eNo9kNFOgzAYhRszE3HuCbzpC1Tbv7SUS2SwNTJIVkjUm4Z2JdHoZsAb315R49W5-c5JvnOJFsfTMSB0zegNE0KpWyZpQjgkMQEKggBJz1AEnHEilXpYoOgfuECraXqhlDKhJIuTCN1l2LTd-hE3NW63Be5MgZsS181-l1X6qVjjCsiuaPc6x7rGeZUZo0udZ62eG5m5N1fofOhfp7D6yyXqyqLNt6RqNt9gRTxL4g8SQPZu8GlInQgS-hg8TSkodpABlBPAesmUPKhEBOcEH2IplVNy8KC4D54vEf_d9eNpmsYw2Pfx-a0fPy2j9ucJO4vaWdTOT1iwKf8CoQhL7g
ContentType Journal Article
DBID AAYXX
CITATION
DOI 10.15588/1607-3274-2025-2-9
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList CrossRef
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISSN 2313-688X
EndPage 115
ExternalDocumentID 10_15588_1607_3274_2025_2_9
GroupedDBID 9MQ
AAYXX
ADBBV
ALMA_UNASSIGNED_HOLDINGS
BCNDV
CITATION
GROUPED_DOAJ
ID FETCH-LOGICAL-c174t-e26abfc9e9b5e62a42c090281d6e28b521a6186d875ebb53f4668b86fc283cec3
ISSN 1607-3274
IngestDate Thu Jul 03 08:38:26 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 2
Language English
License https://creativecommons.org/licenses/by-sa/4.0
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c174t-e26abfc9e9b5e62a42c090281d6e28b521a6186d875ebb53f4668b86fc283cec3
OpenAccessLink https://ric.zp.edu.ua/article/download/332936/321874
PageCount 6
ParticipantIDs crossref_primary_10_15588_1607_3274_2025_2_9
PublicationCentury 2000
PublicationDate 2025-06-29
PublicationDateYYYYMMDD 2025-06-29
PublicationDate_xml – month: 06
  year: 2025
  text: 2025-06-29
  day: 29
PublicationDecade 2020
PublicationTitle Radìoelektronika, informatika, upravlìnnâ
PublicationYear 2025
SSID ssj0001586147
ssib018208917
ssib015895113
ssib044757822
Score 2.2960289
Snippet Context. In machine learning, similarity measures, and distance metrics are pivotal in tasks like classification, clustering, and dimensionality reduction. The...
SourceID crossref
SourceType Index Database
StartPage 110
Title A STUDY ON THE USE OF NORMALIZED L2-METRIC IN CLASSIFICATION TASKS
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV07b9swECbadGmHok1b9A0O2RymFS3S1Ki6NuLElgE_gLSLIFLUksIODLlDf33vKOrRJAiSLnoQwlHQfTh-R92DkCMs8TXIzVeWhYKzUMGVjnTARFQA4dWBUAa3BmaJPF2HZxfiou3f6bJLSn1i_tyaV_I_WoUx0CtmyT5As41QGIBr0C8cQcNwvJeOY0fofvTmiYvcWS9HGMSTzBezeDr5Ofrem3I2G60WE0xK6g2nMZjOsU8d7q3i5fmyy00XoG38bT7cwlJ06XrjXDpq6Yurlv52f7XLfv-qHgX2i-cmRPl8u8l3e9_Xx2c5-D0FLjD2yW88VGZQ4u4lr_rnnFg3BkSwz6RyjYBrVPCO6Qt8eKr1d-JWAy2EwqyDZgLmpucsatej-h_8tWWqCR5EtwXFpCgkRSEpCkl5Gj0mTzi4C7zjWoNdAcABj2xpC9asVx03FWseDuoyhlV2uQLW4vry1O_pC1bhvF9uvnyH1HTYyeoFee7dChpXGHlJHtnNIXnWKTb5inyLqUMLnScU0EIBLXQ-pi1aaIMWOknov2ihDi2vyXo8Wg1Pme-gwQx4miWzXGa6MJGNtLCSZyE3GIcLPoq0XGmgbhn2S8jBabVai34RSqm0koUB1mms6b8hB5vtxr4lNOJ6IINC2CALQ2Fy8EONMoHIsbKxtcU7clx_gvSqKpSS3qGp9w97_AN52uL0Izkod3v7CdhgqT87Vf8F9lFClw
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+STUDY+ON+THE+USE+OF+NORMALIZED+L2-METRIC+IN+CLASSIFICATION+TASKS&rft.jtitle=Rad%C3%ACoelektronika%2C+informatika%2C+upravl%C3%ACnn%C3%A2&rft.au=Kondruk%2C+N.+E&rft.date=2025-06-29&rft.issn=1607-3274&rft.eissn=2313-688X&rft.issue=2&rft.spage=110&rft.epage=115&rft_id=info:doi/10.15588%2F1607-3274-2025-2-9&rft.externalDBID=n%2Fa&rft.externalDocID=10_15588_1607_3274_2025_2_9
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1607-3274&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1607-3274&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1607-3274&client=summon