Envelope multi-type transformation ensemble algorithm of Parkinson speech samples

In recent years, the study of machine learning-based speech recognition methods for Parkinson’s disease has become a hot topic in recent years. However, existing related machine learning methods focus on feature learning and classifier design, with minute attention to sample optimization. Studies ha...

Full description

Saved in:

Bibliographic Details
Published in	Applied intelligence (Dordrecht, Netherlands) Vol. 53; no. 12; pp. 15957 - 15978
Main Authors	Li, Yongming, Liu, Chengyu, Wang, Pin, Zhang, Hehua, Wei, Anhai, Zhang, Yanling
Format	Journal Article
Language	English
Published	New York Springer US 01.06.2023 Springer Nature B.V
Subjects	Ablation Algorithms Artificial Intelligence Classifiers Computer Science Datasets Design optimization Machine learning Machines Manufacturing Mechanical Engineering Parkinson's disease Processes Redundancy Segments Speech Speech recognition Transformations Sample optimization Multi-type transformation Speech data Clustering operator Convolution operator Parkinson’s disease Envelope learning
Online Access	Get full text

Cover

Loading…

More Information
Summary:	In recent years, the study of machine learning-based speech recognition methods for Parkinson’s disease has become a hot topic in recent years. However, existing related machine learning methods focus on feature learning and classifier design, with minute attention to sample optimization. Studies have shown that each subject often includes multiple corpus (segment samples, or samples) with different representational abilities and high redundancy, making it necessary to optimize the segment samples within each subject. To solve this problem, this paper proposes a multi-type transformation ensemble algorithm for PD speech samples based on a subject envelope (MTEA). The proposed algorithm takes the speech samples (segment samples) within a subject as an envelope and performs a multi-type transformation on the segment samples within the envelope to construct new segment samples. The quality of the sample transformation is improved by preserving the local structure and global structure information of the samples through the joint structure consistency mechanism (JSCM). In addition, a sparse weighted fusion mechanism is designed to fuse the results of multiple classifiers. Two representative public datasets and one self-collected dataset are used to evaluate the proposed model. Experimental results show that the proposed method is significantly more effective than the compared algorithms. Besides, the ablation experiments show that the multi-type transformation mechanism is effective. Based on this study, the multi-type transformation of PD speech samples will help improve the quality of the existing segment samples in PD speech data and thereby improve classification accuracy.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	0924-669X 1573-7497
DOI:	10.1007/s10489-022-04345-y