Classification and diagnosis model for Alzheimer's disease based on multimodal data fusion
Alzheimer disease (AD) is the most commonly occurring neurodegenerative disease. However, current diagnostics for AD primarily rely on invasive tests, which limit the application of diagnostic procedures in early screening. Speech, as a noninvasive biomarker, is closely associated with AD but has no...
Saved in:
Published in | Medicine (Baltimore) Vol. 103; no. 52; p. e41016 |
---|---|
Main Authors | , , , , , , |
Format | Journal Article |
Language | English |
Published |
Hagerstown, MD
Lippincott Williams & Wilkins
27.12.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Alzheimer disease (AD) is the most commonly occurring neurodegenerative disease. However, current diagnostics for AD primarily rely on invasive tests, which limit the application of diagnostic procedures in early screening. Speech, as a noninvasive biomarker, is closely associated with AD but has not been fully leveraged as a diagnostic tool. This study develops a novel early AD diagnosis method that uses primitive speech and explores its potential application in community screening. Moreover, the study proposes an innovative multimodal method for speech feature fusion that combines acoustic and semantic information to differentiate patients with AD from normal controls. This method uses the ImageBind audio encoder to extract acoustic features and the Embeddings from Language Model to extract semantic features, thereby effectively integrating the features by mid-level fusion. The training set comprises 166 speech recordings, which comprise 87 samples from individuals with AD and 79 samples from healthy control subjects. The ratio of training set to test set is 7:3. Evaluation of the Alzheimer dementia recognition through spontaneous speech only dataset showed that the proposed model achieved a classification accuracy of 0.903 and a recall rate of 1, and it considerably outperformed existing baseline models, thereby confirming the effectiveness of the proposed approach to AD diagnosis. This study applies the multimodal fusion of speech features to an early AD diagnostic procedure and achieves excellent performance. The findings of this study not only provide a new approach to noninvasive AD screening but also open new pathways to the early diagnosis of other neurodegenerative diseases. |
---|---|
Bibliography: | Received: 27 August 2024 / Received in final form: 12 November 2024 / Accepted: 2 December 2024 This study was supported by the "Xinglin Scholars" Research Promotion Project of Chengdu University of Traditional Chinese Medicine (BSH2023025), the 73rd batch of China Postdoctoral Science Foundation (2023M730378), the Postdoctoral Fellowship Program of CPSF (No. GZB20230092), the China Postdoctoral Science Foundation (No. 2023M740383), the Natural Science Foundation of Sichuan Province (No. 24NSFSC1654), the Sichuan Science and Technology Program (2024NSFSC0722). Ethical approval was not necessary for this study as it did not involve human subjects, and all data used were obtained from publicly available sources or preexisting datasets. The authors have no conflicts of interest to disclose. The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request. How to cite this article: Fu Y, Xu L, Zhang Y, Zhang L, Zhang P, Cao L, Jiang T. Classification and diagnosis model for Alzheimer's disease based on multimodal data fusion. Medicine 2024;103:52(e41016). *Correspondence: Tao Jiang, School of Intelligent Medicine, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China (e-mail: jiangtop@cdutcm.edu.cn). ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 1536-5964 0025-7974 1536-5964 |
DOI: | 10.1097/MD.0000000000041016 |