An estimation method of voice timbre evaluation values using feature extraction with Gaussian mixture model based on reference singer

This paper presents an estimation method of voice timbre evaluation values for arbitrary singer's singing voices generated with a singing voice synthesis system towards the development of a singing voice retrieval system. The voice timbre evaluation values are numerical values corresponding to...

Full description

Saved in:

Bibliographic Details
Published in	2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) pp. 5265 - 5269
Main Authors	Yamane, Soichi, Kobayashi, Kazuhiro, Toda, Tomoki, Nakano, Tomoyasu, Goto, Masataka, Nakamura, Satoshi
Format	Conference Proceeding Journal Article
Language	English
Published	IEEE 01.03.2016
Subjects	Acoustics Age Covariance matrices Data models Estimation estimation of evaluation values Feature extraction Gaussian Gaussian mixture model Mathematical models reference singer Regression Singing singing voice synthesis Timbre Voice voice timbre
Online Access	Get full text

Cover

Loading…

Abstract	This paper presents an estimation method of voice timbre evaluation values for arbitrary singer's singing voices generated with a singing voice synthesis system towards the development of a singing voice retrieval system. The voice timbre evaluation values are numerical values corresponding to voice timbre expression words, such as "Age" and "Gender", and they usually need to be manually assigned to individual singers' singing voices through listening. To make it possible to automatically estimate them from given singer's singing voices, an acoustic feature to well capture only each singer's voice timbre is extracted with a Gaussian mixture model trained using parallel data between singing voices sung by many pre-stored target singers and same voices sung by a reference singer. Then, the voice timbre evaluation values are estimated from the extracted feature using regression models. The experimental results showed that the proposed method is capable of accurately estimating those values for some expression words, such as "Age" and "Gender", and nonlinear regression is effective for the expression words, "Powerfulness" and "Uniqueness."
AbstractList	This paper presents an estimation method of voice timbre evaluation values for arbitrary singer's singing voices generated with a singing voice synthesis system towards the development of a singing voice retrieval system. The voice timbre evaluation values are numerical values corresponding to voice timbre expression words, such as "Age" and "Gender", and they usually need to be manually assigned to individual singers' singing voices through listening. To make it possible to automatically estimate them from given singer's singing voices, an acoustic feature to well capture only each singer's voice timbre is extracted with a Gaussian mixture model trained using parallel data between singing voices sung by many pre-stored target singers and same voices sung by a reference singer. Then, the voice timbre evaluation values are estimated from the extracted feature using regression models. The experimental results showed that the proposed method is capable of accurately estimating those values for some expression words, such as "Age" and "Gender", and nonlinear regression is effective for the expression words, "Powerfulness" and "Uniqueness."
Author	Kobayashi, Kazuhiro Nakano, Tomoyasu Nakamura, Satoshi Yamane, Soichi Toda, Tomoki Goto, Masataka
Author_xml	– sequence: 1 givenname: Soichi surname: Yamane fullname: Yamane, Soichi organization: Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan – sequence: 2 givenname: Kazuhiro surname: Kobayashi fullname: Kobayashi, Kazuhiro organization: Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan – sequence: 3 givenname: Tomoki surname: Toda fullname: Toda, Tomoki organization: Inf. Technol. Center, Nagoya Univ., Nagoya, Japan – sequence: 4 givenname: Tomoyasu surname: Nakano fullname: Nakano, Tomoyasu organization: Nat. Inst. of Adv. Ind. Sci. & Technol., Tsukuba, Japan – sequence: 5 givenname: Masataka surname: Goto fullname: Goto, Masataka organization: Nat. Inst. of Adv. Ind. Sci. & Technol., Tsukuba, Japan – sequence: 6 givenname: Satoshi surname: Nakamura fullname: Nakamura, Satoshi organization: Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan
BookMark	eNotkM1OwzAQhA0CibbwBL34yKXFP2liH6sKClIlkAoSt8hO1tQoiYvtQHkA3hv3Zy-70nya1cwQXXSuA4TGlEwpJfLuaTFfr1-mjNB8WmQFywU7Q0OaFTKNEPQcDRgv5IRK8n6FhiF8EkJEkYkB-pt3GEK0rYrWdbiFuHE1dgZ_O1sBToL2gOFbNf2R2F8QcB9s94ENqNjv9V30qjroPzZu8FL1IViV_OzuALSuhgZrFSCZd9iDAQ9derC3AX-NLo1qAtyc9gi9Pdy_Lh4nq-dlCreaWEZEnCidSeBGqMoQxrKaazpTQAmtCqGFLojR3NQzQ3jNKacVEaKGXIOUBWEy03yEbo--W---UoxYtjZU0DSqA9eHkgo2yyQlOU3o-IhaACi3PjXkf8tTu_wf00N0Hg
ContentType	Conference Proceeding Journal Article
DBID	6IE 6IH CBEJK RIE RIO 7SP 8FD L7M
DOI	10.1109/ICASSP.2016.7472682
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEL IEEE Proceedings Order Plans (POP) 1998-present Electronics & Communications Abstracts Technology Research Database Advanced Technologies Database with Aerospace
DatabaseTitle	Technology Research Database Advanced Technologies Database with Aerospace Electronics & Communications Abstracts
DatabaseTitleList	Technology Research Database
Database_xml	– sequence: 1 dbid: RIE name: IEEE Xplore url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering
EISBN	1479999881 9781479999880
EISSN	2379-190X
EndPage	5269
ExternalDocumentID	7472682
Genre	orig-research
GroupedDBID	23M 29P 6IE 6IF 6IH 6IK 6IL 6IM 6IN AAJGR ABLEC ACGFS ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IJVOP IPLJI JC5 M43 OCL RIE RIL RIO RNS 7SP 8FD L7M
ID	FETCH-LOGICAL-i208t-ab49e3f8acf0224d3b15ae101c78b8b70fb3fd5f03d3131c088de6be9970294b3
IEDL.DBID	RIE
IngestDate	Thu Apr 11 21:12:45 EDT 2024 Wed Jun 26 19:23:39 EDT 2024
IsPeerReviewed	false
IsScholarly	true
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i208t-ab49e3f8acf0224d3b15ae101c78b8b70fb3fd5f03d3131c088de6be9970294b3
Notes	ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Conference-1 ObjectType-Feature-3 content type line 23 SourceType-Conference Papers & Proceedings-2
PQID	1825491061
PQPubID	23500
PageCount	5
ParticipantIDs	proquest_miscellaneous_1825491061 ieee_primary_7472682
PublicationCentury	2000
PublicationDate	20160301
PublicationDateYYYYMMDD	2016-03-01
PublicationDate_xml	– month: 03 year: 2016 text: 20160301 day: 01
PublicationDecade	2010
PublicationTitle	2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
PublicationTitleAbbrev	ICASSP
PublicationYear	2016
Publisher	IEEE
Publisher_xml	– name: IEEE
SSID	ssj0008748
Score	2.0043318
Snippet	This paper presents an estimation method of voice timbre evaluation values for arbitrary singer's singing voices generated with a singing voice synthesis...
SourceID	proquest ieee
SourceType	Aggregation Database Publisher
StartPage	5265
SubjectTerms	Acoustics Age Covariance matrices Data models Estimation estimation of evaluation values Feature extraction Gaussian Gaussian mixture model Mathematical models reference singer Regression Singing singing voice synthesis Timbre Voice voice timbre
Title	An estimation method of voice timbre evaluation values using feature extraction with Gaussian mixture model based on reference singer
URI	https://ieeexplore.ieee.org/document/7472682 https://search.proquest.com/docview/1825491061
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1bS8MwFA5zT_riZRPnjSP4aLe2abv0cQznBSaDOdjbSJoTGWIrWyviu__bJO02UR98C7QJITn5Ti7n-w4hlzRJfCFD4aDLuBPIrtRLinlOlPicMak0Pho28vAhup0E99NwWiNXay4MItrgM2ybon3Ll1lSmKuyjt76-hHTgLvFXL_kaq1Rl3UDVqkKeW7cuev3xuORCd2K2lW1Kn_KL9C1nmSwS4arPpQBJM_tIhft5OOHPON_O7lHmhvOHozW3mif1DA9IDvf5AYb5LOXghHVKNmKUCaPhkzBW6bhAvQHfT6Gjf43mBIuwcTGP4FCqwEKGs0XJRsCzCUu3PBiaZiY8DJ_tz_Y5Dpg3KNuPIV1IhNY2ivEJpkMrh_7t06VhsGZ-y7LHS6CGKliPFHG4UsqvJCjXsqJnkcmuq4SVMlQuVRSj3qJxi2JkcA47rp-HAh6SOppluIRAe0vPSUY567hPni-QEYjxgOUVFuIH7ZIwwzo7LVU2phVY9kiF6spm2nrN08aPMWsWM48e8A1x9rjv6uekG1jA2XU2Cmp54sCz_Q2Ihfn1n6-AJ5sy9I
link.rule.ids	310,311,315,786,790,795,796,802,23958,23959,25170,27955,27956,55107
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8QwEA6iB_XiG9-O4NGubdNHepTFdX2sCCp4K0kzERFbcbci3v3fTtK6inrwFmgTQjL5Jo_5vmFsjxdFqHSsPPSF9CKdalpSIvCSIpRCaEP4aNnIg4ukfxOd3sa3E2x_zIVBRBd8hh1bdG_5uipqe1V2QFvfMBEEuFPk5_20YWuNcVekkWh1hQI_OzjpHl5dXdrgraTTVmwzqPyCXedLenNs8NmLJoTkoVOPVKd4-yHQ-N9uzrPlL9YeXI790QKbwHKRzX4THFxi74clWFmNhq8ITfpoqAy8VAQYQB_ohAxfCuBgSzgEGx1_BwadCigQnj83fAiw17hwLOuh5WLC4_2r-8Gl1wHrIKnxEsapTGDoLhGX2U3v6Lrb99pEDN596IuRJ1WUITdCFsa6fM1VEEukxVzQTAqV-kZxo2Pjc80DHhSEXBoThVmW-mEWKb7CJsuqxFUG5DEDo4SUvmU_BKFCwRMhI9ScbCSM19iSHdD8qdHayNuxXGO7n1OWk_3bRw1ZYlUP88Adce3Bdv3vqjtsun89OM_PTy7ONtiMtYcmhmyTTY6ea9yiTcVIbTtb-gB-dM8m
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=2016+IEEE+International+Conference+on+Acoustics%2C+Speech+and+Signal+Processing+%28ICASSP%29&rft.atitle=An+estimation+method+of+voice+timbre+evaluation+values+using+feature+extraction+with+Gaussian+mixture+model+based+on+reference+singer&rft.au=Yamane%2C+Soichi&rft.au=Kobayashi%2C+Kazuhiro&rft.au=Toda%2C+Tomoki&rft.au=Nakano%2C+Tomoyasu&rft.date=2016-03-01&rft.pub=IEEE&rft.eissn=2379-190X&rft.spage=5265&rft.epage=5269&rft_id=info:doi/10.1109%2FICASSP.2016.7472682&rft.externalDocID=7472682