Accuracy Optimization in Speech Pathology Diagnosis with Data Preprocessing Techniques

Using acoustic analysis to classify and identify speech disorders non-invasively can reduce waiting times for patients and specialists while also increasing the accuracy of diagnoses. In order to identify models to use in a vocal disease diagnosis system, we want to know which models have higher suc...

Full description

Saved in:

Bibliographic Details
Published in	Optimization, Learning Algorithms and Applications pp. 287 - 299
Main Authors	Fernandes, Joana Filipa Teixeira, Freitas, Diamantino Rui, Teixeira, João Paulo
Format	Book Chapter
Language	English
Published	Cham Springer Nature Switzerland 2024
Series	Communications in Computer and Information Science
Subjects	Machine Learning Normalization Outliers Speech Features Speech Pathologies Vocal Acoustic Analysis
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Using acoustic analysis to classify and identify speech disorders non-invasively can reduce waiting times for patients and specialists while also increasing the accuracy of diagnoses. In order to identify models to use in a vocal disease diagnosis system, we want to know which models have higher success rates in distinguishing between healthy and pathological sounds. For this purpose, 708 diseased people spread throughout 19 pathologies, and 194 control people were used. There are nine sound files per subject, three vowels in three tones, for each subject. From each sound file, 13 parameters were extracted. For the classification of healthy/pathological individuals, a variety of classifiers based on Machine Learning models were used, including decision trees, discriminant analyses, logistic regression classifiers, naive Bayes classifiers, support vector machines, classifiers of closely related variables, ensemble classifiers and artificial neural network classifiers. For each patient, 118 parameters were used initially. The first analysis aimed to find the best classifier, thus obtaining an accuracy of 81.3% for the Ensemble Sub-space Discriminant classifier. The second and third analyses aimed to improve ground accuracy using preprocessing methodologies. Therefore, in the second analysis, the PCA technique was used, with an accuracy of 80.2%. The third analysis combined several outlier treatment models with several data normalization models and, in general, accuracy improved, obtaining the best accuracy (82.9%) with the combination of the Greebs model for outliers treatment and the range model for the normalization of data procedure.
ISBN:	3031530241 9783031530241
ISSN:	1865-0929 1865-0937
DOI:	10.1007/978-3-031-53025-8_20