Comparative Analysis of Cross-Validation Methods on PPMI Dataset
Artificial Intelligence (AI) is significantly impacting the management of neurodegenerative diseases in the realm of radiology and medical imaging. AI plays a pivotal role in identifying biomarkers in rare disorders for prediction and classification. However, a crucial issue remains due to the chall...
Saved in:
Published in | Medical Measurement and Applications (MEMEA), IEEE International Workshop on pp. 1 - 5 |
---|---|
Main Authors | , , , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
26.06.2024
|
Subjects | |
Online Access | Get full text |
ISSN | 2837-5882 |
DOI | 10.1109/MeMeA60663.2024.10596885 |
Cover
Loading…
Abstract | Artificial Intelligence (AI) is significantly impacting the management of neurodegenerative diseases in the realm of radiology and medical imaging. AI plays a pivotal role in identifying biomarkers in rare disorders for prediction and classification. However, a crucial issue remains due to the challenges posed by limited dataset size. The high dimensionality of the feature space and the restricted cohort exacerbate these challenges. Here, we use a dataset sourced from the Parkinson's Progression Markers Initiative (PPMI). In this study, 100 Parkinson's disease (PD) patients and 73 healthy controls (HC) were included, encompassing 160 features of both imaging and cognitive data. The dataset is partitioned into training (80%) and test sets (20%). In this work, we compare two cross-validation (CV) methods: nested CV, employed for unbiased performance estimation on the training data, and shuffle CV that is a drawback of nested. Moreover, a novel hybrid approach to feature selection was applied on MRI data, aiming to enhance the selection of relevant features. The methodology combines correlation analysis with SHAP (SHapley Additive exPlanations). Subsequently, XGBoost models is deployed for the classification of patients from healthy subjects. Our findings reveal the superiority of nested CV over shuffle CV when validating an independent test set, indicating its robustness. Furthermore, our study underscores the significance of identifying informative features, such as differences in brain regions, to enhance the robustness and accuracy of the model. Overall, our research contributes to the advancement of AI applications in neurodegenerative disease management by addressing challenges associated with small dataset sizes and emphasizing the importance of effective feature selection. |
---|---|
AbstractList | Artificial Intelligence (AI) is significantly impacting the management of neurodegenerative diseases in the realm of radiology and medical imaging. AI plays a pivotal role in identifying biomarkers in rare disorders for prediction and classification. However, a crucial issue remains due to the challenges posed by limited dataset size. The high dimensionality of the feature space and the restricted cohort exacerbate these challenges. Here, we use a dataset sourced from the Parkinson's Progression Markers Initiative (PPMI). In this study, 100 Parkinson's disease (PD) patients and 73 healthy controls (HC) were included, encompassing 160 features of both imaging and cognitive data. The dataset is partitioned into training (80%) and test sets (20%). In this work, we compare two cross-validation (CV) methods: nested CV, employed for unbiased performance estimation on the training data, and shuffle CV that is a drawback of nested. Moreover, a novel hybrid approach to feature selection was applied on MRI data, aiming to enhance the selection of relevant features. The methodology combines correlation analysis with SHAP (SHapley Additive exPlanations). Subsequently, XGBoost models is deployed for the classification of patients from healthy subjects. Our findings reveal the superiority of nested CV over shuffle CV when validating an independent test set, indicating its robustness. Furthermore, our study underscores the significance of identifying informative features, such as differences in brain regions, to enhance the robustness and accuracy of the model. Overall, our research contributes to the advancement of AI applications in neurodegenerative disease management by addressing challenges associated with small dataset sizes and emphasizing the importance of effective feature selection. |
Author | Lagana, Filippo Bianco, Maria Giovanna Oliva, Giuseppe Quattrone, Andrea Calomino, Camilla Pullano, Salvatore A. |
Author_xml | – sequence: 1 givenname: Camilla surname: Calomino fullname: Calomino, Camilla email: camilla.calomino@unicz.it organization: Neuroscience Research Center, Magna Graecia University,Catanzaro,Italy – sequence: 2 givenname: Maria Giovanna surname: Bianco fullname: Bianco, Maria Giovanna email: mg.bianco@unicz.it organization: Neuroscience Research Center, Magna Graecia University,Catanzaro,Italy – sequence: 3 givenname: Giuseppe surname: Oliva fullname: Oliva, Giuseppe email: giuseppe.oliva@unicz.it organization: "Magna Græcia" University,Dept. of Health Sciences,Catanzaro,Italy – sequence: 4 givenname: Filippo surname: Lagana fullname: Lagana, Filippo email: filippo.lagana@unicz.it organization: "Magna Græcia" University,Dept. of Health Sciences,Catanzaro,Italy – sequence: 5 givenname: Salvatore A. surname: Pullano fullname: Pullano, Salvatore A. email: pullano@unicz.it organization: "Magna Græcia" University,Dept. of Health Sciences,Catanzaro,Italy – sequence: 6 givenname: Andrea surname: Quattrone fullname: Quattrone, Andrea email: an.quattrone@unicz.it organization: Institute of neurology, Magna Graecia University,Catanzaro,Italy |
BookMark | eNo1z81KxDAUhuEoCo5j78BFbqD1JGnSZGepfwNTnIW6HU7bE6x0mqEpwty9BXX1LR744L1mF2MYiTEuIBMC3F1NNZUGjFGZBJlnArQz1uozlrjCWaVBQeGcOmcraVWRamvlFUti_AJYSABYs2L3VTgcccK5_yZejjicYh958LyaQozpBw59t2AYeU3zZ-gWG_luV2_4A84Yab5hlx6HSMnfrtn70-Nb9ZJuX583VblNe1GYOc2hEblUWufQCUWq80VroEGQrS7QWlAeOmw9qQZ11-ZoPWkib4yUrXSNWrPb39-eiPbHqT_gdNr_R6sfWmFN3A |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/MeMeA60663.2024.10596885 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering |
EISBN | 9798350307993 |
EISSN | 2837-5882 |
EndPage | 5 |
ExternalDocumentID | 10596885 |
Genre | orig-research |
GroupedDBID | 6IE 6IF 6IK 6IL 6IN AAJGR ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IPLJI M43 OCL RIE RIL RNS |
ID | FETCH-LOGICAL-i176t-40b14235540d13e3df7c60ba02c57a8803f0dacfe3ba5dc4a8fe5eef6622c29b3 |
IEDL.DBID | RIE |
IngestDate | Wed Aug 27 02:36:36 EDT 2025 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i176t-40b14235540d13e3df7c60ba02c57a8803f0dacfe3ba5dc4a8fe5eef6622c29b3 |
PageCount | 5 |
ParticipantIDs | ieee_primary_10596885 |
PublicationCentury | 2000 |
PublicationDate | 2024-June-26 |
PublicationDateYYYYMMDD | 2024-06-26 |
PublicationDate_xml | – month: 06 year: 2024 text: 2024-June-26 day: 26 |
PublicationDecade | 2020 |
PublicationTitle | Medical Measurement and Applications (MEMEA), IEEE International Workshop on |
PublicationTitleAbbrev | MEMEA |
PublicationYear | 2024 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0003010086 |
Score | 1.87718 |
Snippet | Artificial Intelligence (AI) is significantly impacting the management of neurodegenerative diseases in the realm of radiology and medical imaging. AI plays a... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 1 |
SubjectTerms | Artificial Intelligence Brain modeling Feature extraction Magnetic resonance imaging Nested Cross Validation Parkinson's disease Radiology Select shuffle test SHAP Training Training data |
Title | Comparative Analysis of Cross-Validation Methods on PPMI Dataset |
URI | https://ieeexplore.ieee.org/document/10596885 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFH64nfTir4m_ycFru7T50famTMcUOnZwsttI0hcQoRNpL_71Ju26qSB4KymBkLR833t53_cAbmJmMk01DzCSIuDcZEFmGXNETjh2HknNuRcK51M5mfOnhVisxeqNFgYRm-IzDP1jc5dfrEztU2VDzwVkmooe9Fzk1oq1NgkV96V6ft5V69BsmGOOd56gMxcHxjzspv9opNLgyHgfpt0K2vKRt7CudGg-f5kz_nuJBzDYSvbIbANGh7CD5RHsfXMbPIbb0dbpm3RmJGRlycgjZfDiGHnbYInkTVtp964ks1n-SO5V5cCuGsB8_PA8mgTrBgrBa5TIysWGPsXjGQUtfLazsImRVCsaG5Eo9-cySwtlLDKtRGG4Si0KRCtlHJs40-wE-uWqxFMgbkRipgzVTHNphE4iZRgWJnEwqFN-BgO_Gcv31iNj2e3D-R_jF7Drz8QXXcXyEvrVR41XDt4rfd0c6xfdLKOW |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3LSsNAFL1oXagbXxXfzsJtYpJ5JLNTqqXVpnTRSndlZnIDIqQi6cavdyZpWhUEd2ECwzAPzpk795wLcBNRI3WgmYeh4B5jRnoyp9QSOW7ZeSg0Y04onA5Fb8Kepny6FKtXWhhErJLP0Hef1Vt-NjcLFyq7dVxAJAnfhC0L_EzWcq1VSMXuVcfQm3ydQN6mmOK9o-jU3gQj5jcd_CilUiFJdw-GzRjqBJI3f1Fq33z-smf89yD3ob0W7ZHRCo4OYAOLQ9j95jd4BHedtdc3aexIyDwnHYeV3ovl5HWJJZJWhaXtv4KMRmmfPKjSwl3Zhkn3cdzpecsSCt5rGIvS3g5dkMdxiiBz8c4sj40ItAoiw2Nlzy7Ng0yZHKlWPDNMJTlyxFyIKDKR1PQYWsW8wBMgtkWgVCbQVDNhuI5DZShmJrZAqBN2Cm03GbP32iVj1szD2R_t17DdG6eD2aA_fD6HHbc-LgUrEhfQKj8WeGnBvtRX1RJ_AaTrpuY |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Medical+Measurement+and+Applications+%28MEMEA%29%2C+IEEE+International+Workshop+on&rft.atitle=Comparative+Analysis+of+Cross-Validation+Methods+on+PPMI+Dataset&rft.au=Calomino%2C+Camilla&rft.au=Bianco%2C+Maria+Giovanna&rft.au=Oliva%2C+Giuseppe&rft.au=Lagana%2C+Filippo&rft.date=2024-06-26&rft.pub=IEEE&rft.eissn=2837-5882&rft.spage=1&rft.epage=5&rft_id=info:doi/10.1109%2FMeMeA60663.2024.10596885&rft.externalDocID=10596885 |