Automatic Recognition of Speaker Age and Gender Based on Deep Neural Networks

In the given article, we present a novel approach in the paralinguistic field of age and gender recognition by speaker voice based on deep neural networks. The training and testing of proposed models were implemented on the German speech corpus aGender. We conducted experiments using different netwo...

Full description

Saved in:

Bibliographic Details
Published in	Speech and Computer Vol. 11658; pp. 327 - 336
Main Authors	Markitantov, Maxim, Verkholyak, Oxana
Format	Book Chapter
Language	English
Published	Switzerland Springer International Publishing AG 2019 Springer International Publishing
Series	Lecture Notes in Computer Science
Subjects	Age and gender recognition Computational Paralinguistics Convolutional neural networks Deep neural networks Machine learning
Online Access	Get full text
ISBN	3030260607 9783030260606
ISSN	0302-9743 1611-3349
DOI	10.1007/978-3-030-26061-3_34

Cover

Abstract	In the given article, we present a novel approach in the paralinguistic field of age and gender recognition by speaker voice based on deep neural networks. The training and testing of proposed models were implemented on the German speech corpus aGender. We conducted experiments using different network topologies, including neural networks with fully-connected and convolutional layers. In a joint recognition of speaker age and gender, our system reached the recognition performance measured as unweighted accuracy of 48.41%. In a separate age and gender recognition setup, the obtained performance was 57.53% and 88.80%, respectively. Applied deep neural networks provide the best result of speaker age recognition in comparison to existing traditional classification methods.
AbstractList	In the given article, we present a novel approach in the paralinguistic field of age and gender recognition by speaker voice based on deep neural networks. The training and testing of proposed models were implemented on the German speech corpus aGender. We conducted experiments using different network topologies, including neural networks with fully-connected and convolutional layers. In a joint recognition of speaker age and gender, our system reached the recognition performance measured as unweighted accuracy of 48.41%. In a separate age and gender recognition setup, the obtained performance was 57.53% and 88.80%, respectively. Applied deep neural networks provide the best result of speaker age recognition in comparison to existing traditional classification methods.
Author	Verkholyak, Oxana Markitantov, Maxim
Author_xml	– sequence: 1 givenname: Maxim surname: Markitantov fullname: Markitantov, Maxim email: m.markitantov@yandex.ru organization: St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences (SPIIRAS), St. Petersburg, Russia – sequence: 2 givenname: Oxana surname: Verkholyak fullname: Verkholyak, Oxana email: overkholyak@gmail.com organization: St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences (SPIIRAS), St. Petersburg, Russia
BookMark	eNo1kNtOwzAMhsNRbGNvwEVfoGDXaZNcjtNAGiBxuI7S1h1j0JSmE69PxuHK9v_7t-RvLPZb37IQJwinCKDOjNIppUCQZgUUmJIluSOmUaYo_mi0K0ZYYPRImj0x_jdA7YvRtk-NknQoxogoc2Mk4ZGYhvAGAFkGBpUeibvZZvAfblhVySNXftmuhpVvE98kTx27NffJbMmJa-tkzm0dx3MXuE7iyiVzl9zzpnfvsQxfvl-HY3HQuPfA0786ES_XV88XN-niYX57MVukXSZpSCWDBm2K0ik0Ze1ko6q6gcJocmWtS6kU1oCYE5VFnZFEnSmpdQONQ9JEE5H93g1dv2qX3NvS-3WwCHZLz0ZMlmxkYH9I2S29GJK_oa73nxsOg-VtquJ2iD9Ur64buA82Nxma3FgyMSeRvgHW_21-
ContentType	Book Chapter
Copyright	Springer Nature Switzerland AG 2019
Copyright_xml	– notice: Springer Nature Switzerland AG 2019
DBID	FFUUA
DEWEY	6.35
DOI	10.1007/978-3-030-26061-3_34
DatabaseName	ProQuest Ebook Central - Book Chapters - Demo use only
DatabaseTitleList
DeliveryMethod	fulltext_linktorsrc
Discipline	Computer Science
EISBN	9783030260613 3030260615
EISSN	1611-3349
Editor	Salah, Albert Ali Potapova, Rodmonga Karpov, Alexey
Editor_xml	– sequence: 1 fullname: Salah, Albert Ali – sequence: 2 fullname: Karpov, Alexey – sequence: 3 fullname: Potapova, Rodmonga
EndPage	336
ExternalDocumentID	EBC5921959_391_341
GroupedDBID	38. AABBV AEDXK AEJLV AEKFX AIFIR ALMA_UNASSIGNED_HOLDINGS AYMPB BBABE CXBFT CZZ EXGDT FCSXQ FFUUA I4C IEZ MGZZY NSQWD OORQV SBO TPJZQ TSXQS Z5O Z7R Z7S Z7U Z7V Z7W Z7X Z7Y Z7Z Z81 Z82 Z83 Z84 Z85 Z87 Z88 -DT -~X 29L 2HA 2HV ACGFS ADCXD EJD F5P LAS LDH P2P RSU ~02
ID	FETCH-LOGICAL-p243t-4e080896ba719bda4f7cdf06983abd8b4771d011533b6d2341827488f0fa13833
ISBN	3030260607 9783030260606
ISSN	0302-9743
IngestDate	Tue Jul 29 19:56:57 EDT 2025 Fri Apr 11 21:40:43 EDT 2025
IsPeerReviewed	true
IsScholarly	true
LCCallNum	Q334-342
Language	English
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-p243t-4e080896ba719bda4f7cdf06983abd8b4771d011533b6d2341827488f0fa13833
OCLC	1114599431
PQID	EBC5921959_391_341
PageCount	10
ParticipantIDs	springer_books_10_1007_978_3_030_26061_3_34 proquest_ebookcentralchapters_5921959_391_341
PublicationCentury	2000
PublicationDate	2019 20190724
PublicationDateYYYYMMDD	2019-01-01 2019-07-24
PublicationDate_xml	– year: 2019 text: 2019
PublicationDecade	2010
PublicationPlace	Switzerland
PublicationPlace_xml	– name: Switzerland – name: Cham
PublicationSeriesSubtitle	Lecture Notes in Artificial Intelligence
PublicationSeriesTitle	Lecture Notes in Computer Science
PublicationSeriesTitleAlternate	Lect.Notes Computer
PublicationSubtitle	21st International Conference, SPECOM 2019, Istanbul, Turkey, August 20-25, 2019, Proceedings
PublicationTitle	Speech and Computer
PublicationYear	2019
Publisher	Springer International Publishing AG Springer International Publishing
Publisher_xml	– name: Springer International Publishing AG – name: Springer International Publishing
RelatedPersons	Hartmanis, Juris Gao, Wen Bertino, Elisa Woeginger, Gerhard Goos, Gerhard Steffen, Bernhard Yung, Moti
RelatedPersons_xml	– sequence: 1 givenname: Gerhard surname: Goos fullname: Goos, Gerhard organization: Karlsruhe Institute of Technology, Karlsruhe, Germany – sequence: 2 givenname: Juris surname: Hartmanis fullname: Hartmanis, Juris organization: Cornell University, Ithaca, USA – sequence: 3 givenname: Elisa surname: Bertino fullname: Bertino, Elisa organization: Purdue University, West Lafayette, USA – sequence: 4 givenname: Wen surname: Gao fullname: Gao, Wen organization: Peking University, Beijing, China – sequence: 5 givenname: Bernhard surname: Steffen fullname: Steffen, Bernhard organization: TU Dortmund University, Dortmund, Germany – sequence: 6 givenname: Gerhard surname: Woeginger fullname: Woeginger, Gerhard organization: RWTH Aachen, Aachen, Germany – sequence: 7 givenname: Moti surname: Yung fullname: Yung, Moti organization: Columbia University, New York, USA
SSID	ssj0002209178 ssj0002792
Score	2.0375402
Snippet	In the given article, we present a novel approach in the paralinguistic field of age and gender recognition by speaker voice based on deep neural networks. The...
SourceID	springer proquest
SourceType	Publisher
StartPage	327
SubjectTerms	Age and gender recognition Computational Paralinguistics Convolutional neural networks Deep neural networks Machine learning
Title	Automatic Recognition of Speaker Age and Gender Based on Deep Neural Networks
URI	http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=5921959&ppg=341 http://link.springer.com/10.1007/978-3-030-26061-3_34
Volume	11658
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3Nb9MwFLdYuSAOjC8xYMgHbpWnxE5s58BhbEPTNIqEtombZScOQtPaqs2mjb-e57y4SbtdxiWqXMtx3899fn4fPxPyWSldyoSXrFCuYJlSOdO60kwWpeTcWtDLLdvnRB6fZye_8kGgva0uadxe-ffBupL_QRXaANdQJfsIZFeDQgN8BnzhCQjDc8P4XXezYg3H3PsS69Li1Qy9e3kBvcP1wDdYj3P75yp-d-EXl6Dx7myrBn_c2qkdLpv962aGLK4_Y2oRWpTwOnsZEup_Y8AB76Abf4VdsAoRh0Pv5-NA9QGYTzC3HM31IAm__HLaBSsms6bNAVtNOqqXof-hLXli_L7_ccOD2TvR1g6ssGEGEjOZyIGegyZYKsjWtOdRD8vAriiQzbTTrQJJBLptWiBvyr0dYJj0ASOz8DYYyohsi2wpnY3I0_2jk9OLlSOOczCZVE8xHxgVMfSEswoFQXHWCimb-l8xKMZ86JVrx5aNSHtrwJxtk-ehqIWGahOQ30vyxE9fkRcRAtpB8Jp8X-FPB_jTWU07_CngTwF_ivjTFn8KXQL-FPGnEf835Pzb0dnBMetu3GBznomGZR4OELqQzqq0cJXNalVWdSILLayrtIP_clqFQ4QQTlYcLCDNFWwBdVLbVGgh3pLRdDb17wi1Lq_LwvIyTR3sCxYsIZdlVorE8lTmfIewKBrT5gV0ycglCmJp8oIH4iMjihQkme6QcZSfCd2XJhJug-CNMCB40wreBMG_f1TvD-RZv7I_klGzuPa7YGs27lO3Wv4BUZNz5Q
linkProvider	Library Specific Holdings
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Speech+and+Computer&rft.au=Markitantov%2C+Maxim&rft.au=Verkholyak%2C+Oxana&rft.atitle=Automatic+Recognition+of+Speaker+Age+and+Gender+Based+on+Deep+Neural+Networks&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2019-07-24&rft.pub=Springer+International+Publishing&rft.isbn=9783030260606&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=327&rft.epage=336&rft_id=info:doi/10.1007%2F978-3-030-26061-3_34
thumbnail_s	http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F5921959-l.jpg