An estimation method of voice timbre evaluation values using feature extraction with Gaussian mixture model based on reference singer
This paper presents an estimation method of voice timbre evaluation values for arbitrary singer's singing voices generated with a singing voice synthesis system towards the development of a singing voice retrieval system. The voice timbre evaluation values are numerical values corresponding to...
Saved in:
Published in | 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) pp. 5265 - 5269 |
---|---|
Main Authors | , , , , , |
Format | Conference Proceeding Journal Article |
Language | English |
Published |
IEEE
01.03.2016
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | This paper presents an estimation method of voice timbre evaluation values for arbitrary singer's singing voices generated with a singing voice synthesis system towards the development of a singing voice retrieval system. The voice timbre evaluation values are numerical values corresponding to voice timbre expression words, such as "Age" and "Gender", and they usually need to be manually assigned to individual singers' singing voices through listening. To make it possible to automatically estimate them from given singer's singing voices, an acoustic feature to well capture only each singer's voice timbre is extracted with a Gaussian mixture model trained using parallel data between singing voices sung by many pre-stored target singers and same voices sung by a reference singer. Then, the voice timbre evaluation values are estimated from the extracted feature using regression models. The experimental results showed that the proposed method is capable of accurately estimating those values for some expression words, such as "Age" and "Gender", and nonlinear regression is effective for the expression words, "Powerfulness" and "Uniqueness." |
---|---|
AbstractList | This paper presents an estimation method of voice timbre evaluation values for arbitrary singer's singing voices generated with a singing voice synthesis system towards the development of a singing voice retrieval system. The voice timbre evaluation values are numerical values corresponding to voice timbre expression words, such as "Age" and "Gender", and they usually need to be manually assigned to individual singers' singing voices through listening. To make it possible to automatically estimate them from given singer's singing voices, an acoustic feature to well capture only each singer's voice timbre is extracted with a Gaussian mixture model trained using parallel data between singing voices sung by many pre-stored target singers and same voices sung by a reference singer. Then, the voice timbre evaluation values are estimated from the extracted feature using regression models. The experimental results showed that the proposed method is capable of accurately estimating those values for some expression words, such as "Age" and "Gender", and nonlinear regression is effective for the expression words, "Powerfulness" and "Uniqueness." |
Author | Kobayashi, Kazuhiro Nakano, Tomoyasu Nakamura, Satoshi Yamane, Soichi Toda, Tomoki Goto, Masataka |
Author_xml | – sequence: 1 givenname: Soichi surname: Yamane fullname: Yamane, Soichi organization: Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan – sequence: 2 givenname: Kazuhiro surname: Kobayashi fullname: Kobayashi, Kazuhiro organization: Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan – sequence: 3 givenname: Tomoki surname: Toda fullname: Toda, Tomoki organization: Inf. Technol. Center, Nagoya Univ., Nagoya, Japan – sequence: 4 givenname: Tomoyasu surname: Nakano fullname: Nakano, Tomoyasu organization: Nat. Inst. of Adv. Ind. Sci. & Technol., Tsukuba, Japan – sequence: 5 givenname: Masataka surname: Goto fullname: Goto, Masataka organization: Nat. Inst. of Adv. Ind. Sci. & Technol., Tsukuba, Japan – sequence: 6 givenname: Satoshi surname: Nakamura fullname: Nakamura, Satoshi organization: Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan |
BookMark | eNotkM1OwzAQhA0CibbwBL34yKXFP2liH6sKClIlkAoSt8hO1tQoiYvtQHkA3hv3Zy-70nya1cwQXXSuA4TGlEwpJfLuaTFfr1-mjNB8WmQFywU7Q0OaFTKNEPQcDRgv5IRK8n6FhiF8EkJEkYkB-pt3GEK0rYrWdbiFuHE1dgZ_O1sBToL2gOFbNf2R2F8QcB9s94ENqNjv9V30qjroPzZu8FL1IViV_OzuALSuhgZrFSCZd9iDAQ9derC3AX-NLo1qAtyc9gi9Pdy_Lh4nq-dlCreaWEZEnCidSeBGqMoQxrKaazpTQAmtCqGFLojR3NQzQ3jNKacVEaKGXIOUBWEy03yEbo--W---UoxYtjZU0DSqA9eHkgo2yyQlOU3o-IhaACi3PjXkf8tTu_wf00N0Hg |
ContentType | Conference Proceeding Journal Article |
DBID | 6IE 6IH CBEJK RIE RIO 7SP 8FD L7M |
DOI | 10.1109/ICASSP.2016.7472682 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEL IEEE Proceedings Order Plans (POP) 1998-present Electronics & Communications Abstracts Technology Research Database Advanced Technologies Database with Aerospace |
DatabaseTitle | Technology Research Database Advanced Technologies Database with Aerospace Electronics & Communications Abstracts |
DatabaseTitleList | Technology Research Database |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Xplore url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering |
EISBN | 1479999881 9781479999880 |
EISSN | 2379-190X |
EndPage | 5269 |
ExternalDocumentID | 7472682 |
Genre | orig-research |
GroupedDBID | 23M 29P 6IE 6IF 6IH 6IK 6IL 6IM 6IN AAJGR ABLEC ACGFS ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IJVOP IPLJI JC5 M43 OCL RIE RIL RIO RNS 7SP 8FD L7M |
ID | FETCH-LOGICAL-i208t-ab49e3f8acf0224d3b15ae101c78b8b70fb3fd5f03d3131c088de6be9970294b3 |
IEDL.DBID | RIE |
IngestDate | Thu Apr 11 21:12:45 EDT 2024 Wed Jun 26 19:23:39 EDT 2024 |
IsPeerReviewed | false |
IsScholarly | true |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i208t-ab49e3f8acf0224d3b15ae101c78b8b70fb3fd5f03d3131c088de6be9970294b3 |
Notes | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Conference-1 ObjectType-Feature-3 content type line 23 SourceType-Conference Papers & Proceedings-2 |
PQID | 1825491061 |
PQPubID | 23500 |
PageCount | 5 |
ParticipantIDs | proquest_miscellaneous_1825491061 ieee_primary_7472682 |
PublicationCentury | 2000 |
PublicationDate | 20160301 |
PublicationDateYYYYMMDD | 2016-03-01 |
PublicationDate_xml | – month: 03 year: 2016 text: 20160301 day: 01 |
PublicationDecade | 2010 |
PublicationTitle | 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
PublicationTitleAbbrev | ICASSP |
PublicationYear | 2016 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0008748 |
Score | 2.0043318 |
Snippet | This paper presents an estimation method of voice timbre evaluation values for arbitrary singer's singing voices generated with a singing voice synthesis... |
SourceID | proquest ieee |
SourceType | Aggregation Database Publisher |
StartPage | 5265 |
SubjectTerms | Acoustics Age Covariance matrices Data models Estimation estimation of evaluation values Feature extraction Gaussian Gaussian mixture model Mathematical models reference singer Regression Singing singing voice synthesis Timbre Voice voice timbre |
Title | An estimation method of voice timbre evaluation values using feature extraction with Gaussian mixture model based on reference singer |
URI | https://ieeexplore.ieee.org/document/7472682 https://search.proquest.com/docview/1825491061 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1bS8MwFA5zT_riZRPnjSP4aLe2abv0cQznBSaDOdjbSJoTGWIrWyviu__bJO02UR98C7QJITn5Ti7n-w4hlzRJfCFD4aDLuBPIrtRLinlOlPicMak0Pho28vAhup0E99NwWiNXay4MItrgM2ybon3Ll1lSmKuyjt76-hHTgLvFXL_kaq1Rl3UDVqkKeW7cuev3xuORCd2K2lW1Kn_KL9C1nmSwS4arPpQBJM_tIhft5OOHPON_O7lHmhvOHozW3mif1DA9IDvf5AYb5LOXghHVKNmKUCaPhkzBW6bhAvQHfT6Gjf43mBIuwcTGP4FCqwEKGs0XJRsCzCUu3PBiaZiY8DJ_tz_Y5Dpg3KNuPIV1IhNY2ivEJpkMrh_7t06VhsGZ-y7LHS6CGKliPFHG4UsqvJCjXsqJnkcmuq4SVMlQuVRSj3qJxi2JkcA47rp-HAh6SOppluIRAe0vPSUY567hPni-QEYjxgOUVFuIH7ZIwwzo7LVU2phVY9kiF6spm2nrN08aPMWsWM48e8A1x9rjv6uekG1jA2XU2Cmp54sCz_Q2Ihfn1n6-AJ5sy9I |
link.rule.ids | 310,311,315,786,790,795,796,802,23958,23959,25170,27955,27956,55107 |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8QwEA6iB_XiG9-O4NGubdNHepTFdX2sCCp4K0kzERFbcbci3v3fTtK6inrwFmgTQjL5Jo_5vmFsjxdFqHSsPPSF9CKdalpSIvCSIpRCaEP4aNnIg4ukfxOd3sa3E2x_zIVBRBd8hh1bdG_5uipqe1V2QFvfMBEEuFPk5_20YWuNcVekkWh1hQI_OzjpHl5dXdrgraTTVmwzqPyCXedLenNs8NmLJoTkoVOPVKd4-yHQ-N9uzrPlL9YeXI790QKbwHKRzX4THFxi74clWFmNhq8ITfpoqAy8VAQYQB_ohAxfCuBgSzgEGx1_BwadCigQnj83fAiw17hwLOuh5WLC4_2r-8Gl1wHrIKnxEsapTGDoLhGX2U3v6Lrb99pEDN596IuRJ1WUITdCFsa6fM1VEEukxVzQTAqV-kZxo2Pjc80DHhSEXBoThVmW-mEWKb7CJsuqxFUG5DEDo4SUvmU_BKFCwRMhI9ScbCSM19iSHdD8qdHayNuxXGO7n1OWk_3bRw1ZYlUP88Adce3Bdv3vqjtsun89OM_PTy7ONtiMtYcmhmyTTY6ea9yiTcVIbTtb-gB-dM8m |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=2016+IEEE+International+Conference+on+Acoustics%2C+Speech+and+Signal+Processing+%28ICASSP%29&rft.atitle=An+estimation+method+of+voice+timbre+evaluation+values+using+feature+extraction+with+Gaussian+mixture+model+based+on+reference+singer&rft.au=Yamane%2C+Soichi&rft.au=Kobayashi%2C+Kazuhiro&rft.au=Toda%2C+Tomoki&rft.au=Nakano%2C+Tomoyasu&rft.date=2016-03-01&rft.pub=IEEE&rft.eissn=2379-190X&rft.spage=5265&rft.epage=5269&rft_id=info:doi/10.1109%2FICASSP.2016.7472682&rft.externalDocID=7472682 |