Comparative Analysis of Cross-Validation Methods on PPMI Dataset

Artificial Intelligence (AI) is significantly impacting the management of neurodegenerative diseases in the realm of radiology and medical imaging. AI plays a pivotal role in identifying biomarkers in rare disorders for prediction and classification. However, a crucial issue remains due to the chall...

Full description

Saved in:
Bibliographic Details
Published inMedical Measurement and Applications (MEMEA), IEEE International Workshop on pp. 1 - 5
Main Authors Calomino, Camilla, Bianco, Maria Giovanna, Oliva, Giuseppe, Lagana, Filippo, Pullano, Salvatore A., Quattrone, Andrea
Format Conference Proceeding
LanguageEnglish
Published IEEE 26.06.2024
Subjects
Online AccessGet full text
ISSN2837-5882
DOI10.1109/MeMeA60663.2024.10596885

Cover

Loading…
Abstract Artificial Intelligence (AI) is significantly impacting the management of neurodegenerative diseases in the realm of radiology and medical imaging. AI plays a pivotal role in identifying biomarkers in rare disorders for prediction and classification. However, a crucial issue remains due to the challenges posed by limited dataset size. The high dimensionality of the feature space and the restricted cohort exacerbate these challenges. Here, we use a dataset sourced from the Parkinson's Progression Markers Initiative (PPMI). In this study, 100 Parkinson's disease (PD) patients and 73 healthy controls (HC) were included, encompassing 160 features of both imaging and cognitive data. The dataset is partitioned into training (80%) and test sets (20%). In this work, we compare two cross-validation (CV) methods: nested CV, employed for unbiased performance estimation on the training data, and shuffle CV that is a drawback of nested. Moreover, a novel hybrid approach to feature selection was applied on MRI data, aiming to enhance the selection of relevant features. The methodology combines correlation analysis with SHAP (SHapley Additive exPlanations). Subsequently, XGBoost models is deployed for the classification of patients from healthy subjects. Our findings reveal the superiority of nested CV over shuffle CV when validating an independent test set, indicating its robustness. Furthermore, our study underscores the significance of identifying informative features, such as differences in brain regions, to enhance the robustness and accuracy of the model. Overall, our research contributes to the advancement of AI applications in neurodegenerative disease management by addressing challenges associated with small dataset sizes and emphasizing the importance of effective feature selection.
AbstractList Artificial Intelligence (AI) is significantly impacting the management of neurodegenerative diseases in the realm of radiology and medical imaging. AI plays a pivotal role in identifying biomarkers in rare disorders for prediction and classification. However, a crucial issue remains due to the challenges posed by limited dataset size. The high dimensionality of the feature space and the restricted cohort exacerbate these challenges. Here, we use a dataset sourced from the Parkinson's Progression Markers Initiative (PPMI). In this study, 100 Parkinson's disease (PD) patients and 73 healthy controls (HC) were included, encompassing 160 features of both imaging and cognitive data. The dataset is partitioned into training (80%) and test sets (20%). In this work, we compare two cross-validation (CV) methods: nested CV, employed for unbiased performance estimation on the training data, and shuffle CV that is a drawback of nested. Moreover, a novel hybrid approach to feature selection was applied on MRI data, aiming to enhance the selection of relevant features. The methodology combines correlation analysis with SHAP (SHapley Additive exPlanations). Subsequently, XGBoost models is deployed for the classification of patients from healthy subjects. Our findings reveal the superiority of nested CV over shuffle CV when validating an independent test set, indicating its robustness. Furthermore, our study underscores the significance of identifying informative features, such as differences in brain regions, to enhance the robustness and accuracy of the model. Overall, our research contributes to the advancement of AI applications in neurodegenerative disease management by addressing challenges associated with small dataset sizes and emphasizing the importance of effective feature selection.
Author Lagana, Filippo
Bianco, Maria Giovanna
Oliva, Giuseppe
Quattrone, Andrea
Calomino, Camilla
Pullano, Salvatore A.
Author_xml – sequence: 1
  givenname: Camilla
  surname: Calomino
  fullname: Calomino, Camilla
  email: camilla.calomino@unicz.it
  organization: Neuroscience Research Center, Magna Graecia University,Catanzaro,Italy
– sequence: 2
  givenname: Maria Giovanna
  surname: Bianco
  fullname: Bianco, Maria Giovanna
  email: mg.bianco@unicz.it
  organization: Neuroscience Research Center, Magna Graecia University,Catanzaro,Italy
– sequence: 3
  givenname: Giuseppe
  surname: Oliva
  fullname: Oliva, Giuseppe
  email: giuseppe.oliva@unicz.it
  organization: "Magna Græcia" University,Dept. of Health Sciences,Catanzaro,Italy
– sequence: 4
  givenname: Filippo
  surname: Lagana
  fullname: Lagana, Filippo
  email: filippo.lagana@unicz.it
  organization: "Magna Græcia" University,Dept. of Health Sciences,Catanzaro,Italy
– sequence: 5
  givenname: Salvatore A.
  surname: Pullano
  fullname: Pullano, Salvatore A.
  email: pullano@unicz.it
  organization: "Magna Græcia" University,Dept. of Health Sciences,Catanzaro,Italy
– sequence: 6
  givenname: Andrea
  surname: Quattrone
  fullname: Quattrone, Andrea
  email: an.quattrone@unicz.it
  organization: Institute of neurology, Magna Graecia University,Catanzaro,Italy
BookMark eNo1z81KxDAUhuEoCo5j78BFbqD1JGnSZGepfwNTnIW6HU7bE6x0mqEpwty9BXX1LR744L1mF2MYiTEuIBMC3F1NNZUGjFGZBJlnArQz1uozlrjCWaVBQeGcOmcraVWRamvlFUti_AJYSABYs2L3VTgcccK5_yZejjicYh958LyaQozpBw59t2AYeU3zZ-gWG_luV2_4A84Yab5hlx6HSMnfrtn70-Nb9ZJuX583VblNe1GYOc2hEblUWufQCUWq80VroEGQrS7QWlAeOmw9qQZ11-ZoPWkib4yUrXSNWrPb39-eiPbHqT_gdNr_R6sfWmFN3A
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/MeMeA60663.2024.10596885
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 9798350307993
EISSN 2837-5882
EndPage 5
ExternalDocumentID 10596885
Genre orig-research
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
M43
OCL
RIE
RIL
RNS
ID FETCH-LOGICAL-i176t-40b14235540d13e3df7c60ba02c57a8803f0dacfe3ba5dc4a8fe5eef6622c29b3
IEDL.DBID RIE
IngestDate Wed Aug 27 02:36:36 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i176t-40b14235540d13e3df7c60ba02c57a8803f0dacfe3ba5dc4a8fe5eef6622c29b3
PageCount 5
ParticipantIDs ieee_primary_10596885
PublicationCentury 2000
PublicationDate 2024-June-26
PublicationDateYYYYMMDD 2024-06-26
PublicationDate_xml – month: 06
  year: 2024
  text: 2024-June-26
  day: 26
PublicationDecade 2020
PublicationTitle Medical Measurement and Applications (MEMEA), IEEE International Workshop on
PublicationTitleAbbrev MEMEA
PublicationYear 2024
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0003010086
Score 1.87718
Snippet Artificial Intelligence (AI) is significantly impacting the management of neurodegenerative diseases in the realm of radiology and medical imaging. AI plays a...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Artificial Intelligence
Brain modeling
Feature extraction
Magnetic resonance imaging
Nested Cross Validation
Parkinson's disease
Radiology
Select shuffle test
SHAP
Training
Training data
Title Comparative Analysis of Cross-Validation Methods on PPMI Dataset
URI https://ieeexplore.ieee.org/document/10596885
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFH64nfTir4m_ycFru7T50famTMcUOnZwsttI0hcQoRNpL_71Ju26qSB4KymBkLR833t53_cAbmJmMk01DzCSIuDcZEFmGXNETjh2HknNuRcK51M5mfOnhVisxeqNFgYRm-IzDP1jc5dfrEztU2VDzwVkmooe9Fzk1oq1NgkV96V6ft5V69BsmGOOd56gMxcHxjzspv9opNLgyHgfpt0K2vKRt7CudGg-f5kz_nuJBzDYSvbIbANGh7CD5RHsfXMbPIbb0dbpm3RmJGRlycgjZfDiGHnbYInkTVtp964ks1n-SO5V5cCuGsB8_PA8mgTrBgrBa5TIysWGPsXjGQUtfLazsImRVCsaG5Eo9-cySwtlLDKtRGG4Si0KRCtlHJs40-wE-uWqxFMgbkRipgzVTHNphE4iZRgWJnEwqFN-BgO_Gcv31iNj2e3D-R_jF7Drz8QXXcXyEvrVR41XDt4rfd0c6xfdLKOW
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3LSsNAFL1oXagbXxXfzsJtYpJ5JLNTqqXVpnTRSndlZnIDIqQi6cavdyZpWhUEd2ECwzAPzpk795wLcBNRI3WgmYeh4B5jRnoyp9QSOW7ZeSg0Y04onA5Fb8Kepny6FKtXWhhErJLP0Hef1Vt-NjcLFyq7dVxAJAnfhC0L_EzWcq1VSMXuVcfQm3ydQN6mmOK9o-jU3gQj5jcd_CilUiFJdw-GzRjqBJI3f1Fq33z-smf89yD3ob0W7ZHRCo4OYAOLQ9j95jd4BHedtdc3aexIyDwnHYeV3ovl5HWJJZJWhaXtv4KMRmmfPKjSwl3Zhkn3cdzpecsSCt5rGIvS3g5dkMdxiiBz8c4sj40ItAoiw2Nlzy7Ng0yZHKlWPDNMJTlyxFyIKDKR1PQYWsW8wBMgtkWgVCbQVDNhuI5DZShmJrZAqBN2Cm03GbP32iVj1szD2R_t17DdG6eD2aA_fD6HHbc-LgUrEhfQKj8WeGnBvtRX1RJ_AaTrpuY
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Medical+Measurement+and+Applications+%28MEMEA%29%2C+IEEE+International+Workshop+on&rft.atitle=Comparative+Analysis+of+Cross-Validation+Methods+on+PPMI+Dataset&rft.au=Calomino%2C+Camilla&rft.au=Bianco%2C+Maria+Giovanna&rft.au=Oliva%2C+Giuseppe&rft.au=Lagana%2C+Filippo&rft.date=2024-06-26&rft.pub=IEEE&rft.eissn=2837-5882&rft.spage=1&rft.epage=5&rft_id=info:doi/10.1109%2FMeMeA60663.2024.10596885&rft.externalDocID=10596885