Automatic Recognition of Speaker Age and Gender Based on Deep Neural Networks

In the given article, we present a novel approach in the paralinguistic field of age and gender recognition by speaker voice based on deep neural networks. The training and testing of proposed models were implemented on the German speech corpus aGender. We conducted experiments using different netwo...

Full description

Saved in:
Bibliographic Details
Published inSpeech and Computer Vol. 11658; pp. 327 - 336
Main Authors Markitantov, Maxim, Verkholyak, Oxana
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2019
Springer International Publishing
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN3030260607
9783030260606
ISSN0302-9743
1611-3349
DOI10.1007/978-3-030-26061-3_34

Cover

Abstract In the given article, we present a novel approach in the paralinguistic field of age and gender recognition by speaker voice based on deep neural networks. The training and testing of proposed models were implemented on the German speech corpus aGender. We conducted experiments using different network topologies, including neural networks with fully-connected and convolutional layers. In a joint recognition of speaker age and gender, our system reached the recognition performance measured as unweighted accuracy of 48.41%. In a separate age and gender recognition setup, the obtained performance was 57.53% and 88.80%, respectively. Applied deep neural networks provide the best result of speaker age recognition in comparison to existing traditional classification methods.
AbstractList In the given article, we present a novel approach in the paralinguistic field of age and gender recognition by speaker voice based on deep neural networks. The training and testing of proposed models were implemented on the German speech corpus aGender. We conducted experiments using different network topologies, including neural networks with fully-connected and convolutional layers. In a joint recognition of speaker age and gender, our system reached the recognition performance measured as unweighted accuracy of 48.41%. In a separate age and gender recognition setup, the obtained performance was 57.53% and 88.80%, respectively. Applied deep neural networks provide the best result of speaker age recognition in comparison to existing traditional classification methods.
Author Verkholyak, Oxana
Markitantov, Maxim
Author_xml – sequence: 1
  givenname: Maxim
  surname: Markitantov
  fullname: Markitantov, Maxim
  email: m.markitantov@yandex.ru
  organization: St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences (SPIIRAS), St. Petersburg, Russia
– sequence: 2
  givenname: Oxana
  surname: Verkholyak
  fullname: Verkholyak, Oxana
  email: overkholyak@gmail.com
  organization: St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences (SPIIRAS), St. Petersburg, Russia
BookMark eNo1kNtOwzAMhsNRbGNvwEVfoGDXaZNcjtNAGiBxuI7S1h1j0JSmE69PxuHK9v_7t-RvLPZb37IQJwinCKDOjNIppUCQZgUUmJIluSOmUaYo_mi0K0ZYYPRImj0x_jdA7YvRtk-NknQoxogoc2Mk4ZGYhvAGAFkGBpUeibvZZvAfblhVySNXftmuhpVvE98kTx27NffJbMmJa-tkzm0dx3MXuE7iyiVzl9zzpnfvsQxfvl-HY3HQuPfA0786ES_XV88XN-niYX57MVukXSZpSCWDBm2K0ik0Ze1ko6q6gcJocmWtS6kU1oCYE5VFnZFEnSmpdQONQ9JEE5H93g1dv2qX3NvS-3WwCHZLz0ZMlmxkYH9I2S29GJK_oa73nxsOg-VtquJ2iD9Ur64buA82Nxma3FgyMSeRvgHW_21-
ContentType Book Chapter
Copyright Springer Nature Switzerland AG 2019
Copyright_xml – notice: Springer Nature Switzerland AG 2019
DBID FFUUA
DEWEY 6.35
DOI 10.1007/978-3-030-26061-3_34
DatabaseName ProQuest Ebook Central - Book Chapters - Demo use only
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9783030260613
3030260615
EISSN 1611-3349
Editor Salah, Albert Ali
Potapova, Rodmonga
Karpov, Alexey
Editor_xml – sequence: 1
  fullname: Salah, Albert Ali
– sequence: 2
  fullname: Karpov, Alexey
– sequence: 3
  fullname: Potapova, Rodmonga
EndPage 336
ExternalDocumentID EBC5921959_391_341
GroupedDBID 38.
AABBV
AEDXK
AEJLV
AEKFX
AIFIR
ALMA_UNASSIGNED_HOLDINGS
AYMPB
BBABE
CXBFT
CZZ
EXGDT
FCSXQ
FFUUA
I4C
IEZ
MGZZY
NSQWD
OORQV
SBO
TPJZQ
TSXQS
Z5O
Z7R
Z7S
Z7U
Z7V
Z7W
Z7X
Z7Y
Z7Z
Z81
Z82
Z83
Z84
Z85
Z87
Z88
-DT
-~X
29L
2HA
2HV
ACGFS
ADCXD
EJD
F5P
LAS
LDH
P2P
RSU
~02
ID FETCH-LOGICAL-p243t-4e080896ba719bda4f7cdf06983abd8b4771d011533b6d2341827488f0fa13833
ISBN 3030260607
9783030260606
ISSN 0302-9743
IngestDate Tue Jul 29 19:56:57 EDT 2025
Fri Apr 11 21:40:43 EDT 2025
IsPeerReviewed true
IsScholarly true
LCCallNum Q334-342
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-p243t-4e080896ba719bda4f7cdf06983abd8b4771d011533b6d2341827488f0fa13833
OCLC 1114599431
PQID EBC5921959_391_341
PageCount 10
ParticipantIDs springer_books_10_1007_978_3_030_26061_3_34
proquest_ebookcentralchapters_5921959_391_341
PublicationCentury 2000
PublicationDate 2019
20190724
PublicationDateYYYYMMDD 2019-01-01
2019-07-24
PublicationDate_xml – year: 2019
  text: 2019
PublicationDecade 2010
PublicationPlace Switzerland
PublicationPlace_xml – name: Switzerland
– name: Cham
PublicationSeriesSubtitle Lecture Notes in Artificial Intelligence
PublicationSeriesTitle Lecture Notes in Computer Science
PublicationSeriesTitleAlternate Lect.Notes Computer
PublicationSubtitle 21st International Conference, SPECOM 2019, Istanbul, Turkey, August 20-25, 2019, Proceedings
PublicationTitle Speech and Computer
PublicationYear 2019
Publisher Springer International Publishing AG
Springer International Publishing
Publisher_xml – name: Springer International Publishing AG
– name: Springer International Publishing
RelatedPersons Hartmanis, Juris
Gao, Wen
Bertino, Elisa
Woeginger, Gerhard
Goos, Gerhard
Steffen, Bernhard
Yung, Moti
RelatedPersons_xml – sequence: 1
  givenname: Gerhard
  surname: Goos
  fullname: Goos, Gerhard
  organization: Karlsruhe Institute of Technology, Karlsruhe, Germany
– sequence: 2
  givenname: Juris
  surname: Hartmanis
  fullname: Hartmanis, Juris
  organization: Cornell University, Ithaca, USA
– sequence: 3
  givenname: Elisa
  surname: Bertino
  fullname: Bertino, Elisa
  organization: Purdue University, West Lafayette, USA
– sequence: 4
  givenname: Wen
  surname: Gao
  fullname: Gao, Wen
  organization: Peking University, Beijing, China
– sequence: 5
  givenname: Bernhard
  surname: Steffen
  fullname: Steffen, Bernhard
  organization: TU Dortmund University, Dortmund, Germany
– sequence: 6
  givenname: Gerhard
  surname: Woeginger
  fullname: Woeginger, Gerhard
  organization: RWTH Aachen, Aachen, Germany
– sequence: 7
  givenname: Moti
  surname: Yung
  fullname: Yung, Moti
  organization: Columbia University, New York, USA
SSID ssj0002209178
ssj0002792
Score 2.0375402
Snippet In the given article, we present a novel approach in the paralinguistic field of age and gender recognition by speaker voice based on deep neural networks. The...
SourceID springer
proquest
SourceType Publisher
StartPage 327
SubjectTerms Age and gender recognition
Computational Paralinguistics
Convolutional neural networks
Deep neural networks
Machine learning
Title Automatic Recognition of Speaker Age and Gender Based on Deep Neural Networks
URI http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=5921959&ppg=341
http://link.springer.com/10.1007/978-3-030-26061-3_34
Volume 11658
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3Nb9MwFLdYuSAOjC8xYMgHbpWnxE5s58BhbEPTNIqEtombZScOQtPaqs2mjb-e57y4SbtdxiWqXMtx3899fn4fPxPyWSldyoSXrFCuYJlSOdO60kwWpeTcWtDLLdvnRB6fZye_8kGgva0uadxe-ffBupL_QRXaANdQJfsIZFeDQgN8BnzhCQjDc8P4XXezYg3H3PsS69Li1Qy9e3kBvcP1wDdYj3P75yp-d-EXl6Dx7myrBn_c2qkdLpv962aGLK4_Y2oRWpTwOnsZEup_Y8AB76Abf4VdsAoRh0Pv5-NA9QGYTzC3HM31IAm__HLaBSsms6bNAVtNOqqXof-hLXli_L7_ccOD2TvR1g6ssGEGEjOZyIGegyZYKsjWtOdRD8vAriiQzbTTrQJJBLptWiBvyr0dYJj0ASOz8DYYyohsi2wpnY3I0_2jk9OLlSOOczCZVE8xHxgVMfSEswoFQXHWCimb-l8xKMZ86JVrx5aNSHtrwJxtk-ehqIWGahOQ30vyxE9fkRcRAtpB8Jp8X-FPB_jTWU07_CngTwF_ivjTFn8KXQL-FPGnEf835Pzb0dnBMetu3GBznomGZR4OELqQzqq0cJXNalVWdSILLayrtIP_clqFQ4QQTlYcLCDNFWwBdVLbVGgh3pLRdDb17wi1Lq_LwvIyTR3sCxYsIZdlVorE8lTmfIewKBrT5gV0ycglCmJp8oIH4iMjihQkme6QcZSfCd2XJhJug-CNMCB40wreBMG_f1TvD-RZv7I_klGzuPa7YGs27lO3Wv4BUZNz5Q
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Speech+and+Computer&rft.au=Markitantov%2C+Maxim&rft.au=Verkholyak%2C+Oxana&rft.atitle=Automatic+Recognition+of+Speaker+Age+and+Gender+Based+on+Deep+Neural+Networks&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2019-07-24&rft.pub=Springer+International+Publishing&rft.isbn=9783030260606&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=327&rft.epage=336&rft_id=info:doi/10.1007%2F978-3-030-26061-3_34
thumbnail_s http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F5921959-l.jpg