Estimation of age from speech using excitation source features

Extraction of various distinct features from speech, Gaussian mixture models (GMMs) have been employed as classifiers. Age estimation performance has been inferred by employing excitation source features (LPCCs). The detected age of a speaker is deemed to belong to any of the 9 age groups of 5–10, 1...

Full description

Saved in:
Bibliographic Details
Published inMaterials today : proceedings Vol. 46; pp. 11046 - 11049
Main Authors Avikal, Shwetank, Sharma, Kritika, Barthwal, Anuragh, Nithin Kumar, K.C., Kumar Badhotiya, Gaurav
Format Journal Article
LanguageEnglish
Published Elsevier Ltd 2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Extraction of various distinct features from speech, Gaussian mixture models (GMMs) have been employed as classifiers. Age estimation performance has been inferred by employing excitation source features (LPCCs). The detected age of a speaker is deemed to belong to any of the 9 age groups of 5–10, 10–15, 15–20, 20–25, 25–30, 30–35, 35–40, 40–45 and 45–50. An age group speech corpus has been collected in this work by recording voice of speakers of Hindi language of different age groups and dialects using ten text prompts in neutral speech. Textually neutral Hindi words have been used to construct text prompts which have been recorded in neutral emotion. Different age group has been characterized by using these texts prompt. The average age performance of multispeaker (male + female) is around 94%. In this research, classification of different group of speakers has been done on the basis of excitation source in human speech.
ISSN:2214-7853
2214-7853
DOI:10.1016/j.matpr.2021.02.159