Automatic Recognition of Speaker Age and Gender Based on Deep Neural Networks
In the given article, we present a novel approach in the paralinguistic field of age and gender recognition by speaker voice based on deep neural networks. The training and testing of proposed models were implemented on the German speech corpus aGender. We conducted experiments using different netwo...
Saved in:
Published in | Speech and Computer Vol. 11658; pp. 327 - 336 |
---|---|
Main Authors | , |
Format | Book Chapter |
Language | English |
Published |
Switzerland
Springer International Publishing AG
2019
Springer International Publishing |
Series | Lecture Notes in Computer Science |
Subjects | |
Online Access | Get full text |
ISBN | 3030260607 9783030260606 |
ISSN | 0302-9743 1611-3349 |
DOI | 10.1007/978-3-030-26061-3_34 |
Cover
Abstract | In the given article, we present a novel approach in the paralinguistic field of age and gender recognition by speaker voice based on deep neural networks. The training and testing of proposed models were implemented on the German speech corpus aGender. We conducted experiments using different network topologies, including neural networks with fully-connected and convolutional layers. In a joint recognition of speaker age and gender, our system reached the recognition performance measured as unweighted accuracy of 48.41%. In a separate age and gender recognition setup, the obtained performance was 57.53% and 88.80%, respectively. Applied deep neural networks provide the best result of speaker age recognition in comparison to existing traditional classification methods. |
---|---|
AbstractList | In the given article, we present a novel approach in the paralinguistic field of age and gender recognition by speaker voice based on deep neural networks. The training and testing of proposed models were implemented on the German speech corpus aGender. We conducted experiments using different network topologies, including neural networks with fully-connected and convolutional layers. In a joint recognition of speaker age and gender, our system reached the recognition performance measured as unweighted accuracy of 48.41%. In a separate age and gender recognition setup, the obtained performance was 57.53% and 88.80%, respectively. Applied deep neural networks provide the best result of speaker age recognition in comparison to existing traditional classification methods. |
Author | Verkholyak, Oxana Markitantov, Maxim |
Author_xml | – sequence: 1 givenname: Maxim surname: Markitantov fullname: Markitantov, Maxim email: m.markitantov@yandex.ru organization: St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences (SPIIRAS), St. Petersburg, Russia – sequence: 2 givenname: Oxana surname: Verkholyak fullname: Verkholyak, Oxana email: overkholyak@gmail.com organization: St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences (SPIIRAS), St. Petersburg, Russia |
BookMark | eNo1kNtOwzAMhsNRbGNvwEVfoGDXaZNcjtNAGiBxuI7S1h1j0JSmE69PxuHK9v_7t-RvLPZb37IQJwinCKDOjNIppUCQZgUUmJIluSOmUaYo_mi0K0ZYYPRImj0x_jdA7YvRtk-NknQoxogoc2Mk4ZGYhvAGAFkGBpUeibvZZvAfblhVySNXftmuhpVvE98kTx27NffJbMmJa-tkzm0dx3MXuE7iyiVzl9zzpnfvsQxfvl-HY3HQuPfA0786ES_XV88XN-niYX57MVukXSZpSCWDBm2K0ik0Ze1ko6q6gcJocmWtS6kU1oCYE5VFnZFEnSmpdQONQ9JEE5H93g1dv2qX3NvS-3WwCHZLz0ZMlmxkYH9I2S29GJK_oa73nxsOg-VtquJ2iD9Ur64buA82Nxma3FgyMSeRvgHW_21- |
ContentType | Book Chapter |
Copyright | Springer Nature Switzerland AG 2019 |
Copyright_xml | – notice: Springer Nature Switzerland AG 2019 |
DBID | FFUUA |
DEWEY | 6.35 |
DOI | 10.1007/978-3-030-26061-3_34 |
DatabaseName | ProQuest Ebook Central - Book Chapters - Demo use only |
DatabaseTitleList | |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Computer Science |
EISBN | 9783030260613 3030260615 |
EISSN | 1611-3349 |
Editor | Salah, Albert Ali Potapova, Rodmonga Karpov, Alexey |
Editor_xml | – sequence: 1 fullname: Salah, Albert Ali – sequence: 2 fullname: Karpov, Alexey – sequence: 3 fullname: Potapova, Rodmonga |
EndPage | 336 |
ExternalDocumentID | EBC5921959_391_341 |
GroupedDBID | 38. AABBV AEDXK AEJLV AEKFX AIFIR ALMA_UNASSIGNED_HOLDINGS AYMPB BBABE CXBFT CZZ EXGDT FCSXQ FFUUA I4C IEZ MGZZY NSQWD OORQV SBO TPJZQ TSXQS Z5O Z7R Z7S Z7U Z7V Z7W Z7X Z7Y Z7Z Z81 Z82 Z83 Z84 Z85 Z87 Z88 -DT -~X 29L 2HA 2HV ACGFS ADCXD EJD F5P LAS LDH P2P RSU ~02 |
ID | FETCH-LOGICAL-p243t-4e080896ba719bda4f7cdf06983abd8b4771d011533b6d2341827488f0fa13833 |
ISBN | 3030260607 9783030260606 |
ISSN | 0302-9743 |
IngestDate | Tue Jul 29 19:56:57 EDT 2025 Fri Apr 11 21:40:43 EDT 2025 |
IsPeerReviewed | true |
IsScholarly | true |
LCCallNum | Q334-342 |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-p243t-4e080896ba719bda4f7cdf06983abd8b4771d011533b6d2341827488f0fa13833 |
OCLC | 1114599431 |
PQID | EBC5921959_391_341 |
PageCount | 10 |
ParticipantIDs | springer_books_10_1007_978_3_030_26061_3_34 proquest_ebookcentralchapters_5921959_391_341 |
PublicationCentury | 2000 |
PublicationDate | 2019 20190724 |
PublicationDateYYYYMMDD | 2019-01-01 2019-07-24 |
PublicationDate_xml | – year: 2019 text: 2019 |
PublicationDecade | 2010 |
PublicationPlace | Switzerland |
PublicationPlace_xml | – name: Switzerland – name: Cham |
PublicationSeriesSubtitle | Lecture Notes in Artificial Intelligence |
PublicationSeriesTitle | Lecture Notes in Computer Science |
PublicationSeriesTitleAlternate | Lect.Notes Computer |
PublicationSubtitle | 21st International Conference, SPECOM 2019, Istanbul, Turkey, August 20-25, 2019, Proceedings |
PublicationTitle | Speech and Computer |
PublicationYear | 2019 |
Publisher | Springer International Publishing AG Springer International Publishing |
Publisher_xml | – name: Springer International Publishing AG – name: Springer International Publishing |
RelatedPersons | Hartmanis, Juris Gao, Wen Bertino, Elisa Woeginger, Gerhard Goos, Gerhard Steffen, Bernhard Yung, Moti |
RelatedPersons_xml | – sequence: 1 givenname: Gerhard surname: Goos fullname: Goos, Gerhard organization: Karlsruhe Institute of Technology, Karlsruhe, Germany – sequence: 2 givenname: Juris surname: Hartmanis fullname: Hartmanis, Juris organization: Cornell University, Ithaca, USA – sequence: 3 givenname: Elisa surname: Bertino fullname: Bertino, Elisa organization: Purdue University, West Lafayette, USA – sequence: 4 givenname: Wen surname: Gao fullname: Gao, Wen organization: Peking University, Beijing, China – sequence: 5 givenname: Bernhard surname: Steffen fullname: Steffen, Bernhard organization: TU Dortmund University, Dortmund, Germany – sequence: 6 givenname: Gerhard surname: Woeginger fullname: Woeginger, Gerhard organization: RWTH Aachen, Aachen, Germany – sequence: 7 givenname: Moti surname: Yung fullname: Yung, Moti organization: Columbia University, New York, USA |
SSID | ssj0002209178 ssj0002792 |
Score | 2.0375402 |
Snippet | In the given article, we present a novel approach in the paralinguistic field of age and gender recognition by speaker voice based on deep neural networks. The... |
SourceID | springer proquest |
SourceType | Publisher |
StartPage | 327 |
SubjectTerms | Age and gender recognition Computational Paralinguistics Convolutional neural networks Deep neural networks Machine learning |
Title | Automatic Recognition of Speaker Age and Gender Based on Deep Neural Networks |
URI | http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=5921959&ppg=341 http://link.springer.com/10.1007/978-3-030-26061-3_34 |
Volume | 11658 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3Nb9MwFLdYuSAOjC8xYMgHbpWnxE5s58BhbEPTNIqEtombZScOQtPaqs2mjb-e57y4SbtdxiWqXMtx3899fn4fPxPyWSldyoSXrFCuYJlSOdO60kwWpeTcWtDLLdvnRB6fZye_8kGgva0uadxe-ffBupL_QRXaANdQJfsIZFeDQgN8BnzhCQjDc8P4XXezYg3H3PsS69Li1Qy9e3kBvcP1wDdYj3P75yp-d-EXl6Dx7myrBn_c2qkdLpv962aGLK4_Y2oRWpTwOnsZEup_Y8AB76Abf4VdsAoRh0Pv5-NA9QGYTzC3HM31IAm__HLaBSsms6bNAVtNOqqXof-hLXli_L7_ccOD2TvR1g6ssGEGEjOZyIGegyZYKsjWtOdRD8vAriiQzbTTrQJJBLptWiBvyr0dYJj0ASOz8DYYyohsi2wpnY3I0_2jk9OLlSOOczCZVE8xHxgVMfSEswoFQXHWCimb-l8xKMZ86JVrx5aNSHtrwJxtk-ehqIWGahOQ30vyxE9fkRcRAtpB8Jp8X-FPB_jTWU07_CngTwF_ivjTFn8KXQL-FPGnEf835Pzb0dnBMetu3GBznomGZR4OELqQzqq0cJXNalVWdSILLayrtIP_clqFQ4QQTlYcLCDNFWwBdVLbVGgh3pLRdDb17wi1Lq_LwvIyTR3sCxYsIZdlVorE8lTmfIewKBrT5gV0ycglCmJp8oIH4iMjihQkme6QcZSfCd2XJhJug-CNMCB40wreBMG_f1TvD-RZv7I_klGzuPa7YGs27lO3Wv4BUZNz5Q |
linkProvider | Library Specific Holdings |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Speech+and+Computer&rft.au=Markitantov%2C+Maxim&rft.au=Verkholyak%2C+Oxana&rft.atitle=Automatic+Recognition+of+Speaker+Age+and+Gender+Based+on+Deep+Neural+Networks&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2019-07-24&rft.pub=Springer+International+Publishing&rft.isbn=9783030260606&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=327&rft.epage=336&rft_id=info:doi/10.1007%2F978-3-030-26061-3_34 |
thumbnail_s | http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F5921959-l.jpg |