Envelope multi-type transformation ensemble algorithm of Parkinson speech samples
In recent years, the study of machine learning-based speech recognition methods for Parkinson’s disease has become a hot topic in recent years. However, existing related machine learning methods focus on feature learning and classifier design, with minute attention to sample optimization. Studies ha...
Saved in:
Published in | Applied intelligence (Dordrecht, Netherlands) Vol. 53; no. 12; pp. 15957 - 15978 |
---|---|
Main Authors | , , , , , |
Format | Journal Article |
Language | English |
Published |
New York
Springer US
01.06.2023
Springer Nature B.V |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | In recent years, the study of machine learning-based speech recognition methods for Parkinson’s disease has become a hot topic in recent years. However, existing related machine learning methods focus on feature learning and classifier design, with minute attention to sample optimization. Studies have shown that each subject often includes multiple corpus (segment samples, or samples) with different representational abilities and high redundancy, making it necessary to optimize the segment samples within each subject. To solve this problem, this paper proposes a multi-type transformation ensemble algorithm for PD speech samples based on a subject envelope (MTEA). The proposed algorithm takes the speech samples (segment samples) within a subject as an envelope and performs a multi-type transformation on the segment samples within the envelope to construct new segment samples. The quality of the sample transformation is improved by preserving the local structure and global structure information of the samples through the joint structure consistency mechanism (JSCM). In addition, a sparse weighted fusion mechanism is designed to fuse the results of multiple classifiers. Two representative public datasets and one self-collected dataset are used to evaluate the proposed model. Experimental results show that the proposed method is significantly more effective than the compared algorithms. Besides, the ablation experiments show that the multi-type transformation mechanism is effective. Based on this study, the multi-type transformation of PD speech samples will help improve the quality of the existing segment samples in PD speech data and thereby improve classification accuracy. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
ISSN: | 0924-669X 1573-7497 |
DOI: | 10.1007/s10489-022-04345-y |