An estimation method of voice timbre evaluation values using feature extraction with Gaussian mixture model based on reference singer

This paper presents an estimation method of voice timbre evaluation values for arbitrary singer's singing voices generated with a singing voice synthesis system towards the development of a singing voice retrieval system. The voice timbre evaluation values are numerical values corresponding to...

Full description

Saved in:
Bibliographic Details
Published in2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) pp. 5265 - 5269
Main Authors Yamane, Soichi, Kobayashi, Kazuhiro, Toda, Tomoki, Nakano, Tomoyasu, Goto, Masataka, Nakamura, Satoshi
Format Conference Proceeding Journal Article
LanguageEnglish
Published IEEE 01.03.2016
Subjects
Online AccessGet full text

Cover

Loading…
Abstract This paper presents an estimation method of voice timbre evaluation values for arbitrary singer's singing voices generated with a singing voice synthesis system towards the development of a singing voice retrieval system. The voice timbre evaluation values are numerical values corresponding to voice timbre expression words, such as "Age" and "Gender", and they usually need to be manually assigned to individual singers' singing voices through listening. To make it possible to automatically estimate them from given singer's singing voices, an acoustic feature to well capture only each singer's voice timbre is extracted with a Gaussian mixture model trained using parallel data between singing voices sung by many pre-stored target singers and same voices sung by a reference singer. Then, the voice timbre evaluation values are estimated from the extracted feature using regression models. The experimental results showed that the proposed method is capable of accurately estimating those values for some expression words, such as "Age" and "Gender", and nonlinear regression is effective for the expression words, "Powerfulness" and "Uniqueness."
AbstractList This paper presents an estimation method of voice timbre evaluation values for arbitrary singer's singing voices generated with a singing voice synthesis system towards the development of a singing voice retrieval system. The voice timbre evaluation values are numerical values corresponding to voice timbre expression words, such as "Age" and "Gender", and they usually need to be manually assigned to individual singers' singing voices through listening. To make it possible to automatically estimate them from given singer's singing voices, an acoustic feature to well capture only each singer's voice timbre is extracted with a Gaussian mixture model trained using parallel data between singing voices sung by many pre-stored target singers and same voices sung by a reference singer. Then, the voice timbre evaluation values are estimated from the extracted feature using regression models. The experimental results showed that the proposed method is capable of accurately estimating those values for some expression words, such as "Age" and "Gender", and nonlinear regression is effective for the expression words, "Powerfulness" and "Uniqueness."
Author Kobayashi, Kazuhiro
Nakano, Tomoyasu
Nakamura, Satoshi
Yamane, Soichi
Toda, Tomoki
Goto, Masataka
Author_xml – sequence: 1
  givenname: Soichi
  surname: Yamane
  fullname: Yamane, Soichi
  organization: Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan
– sequence: 2
  givenname: Kazuhiro
  surname: Kobayashi
  fullname: Kobayashi, Kazuhiro
  organization: Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan
– sequence: 3
  givenname: Tomoki
  surname: Toda
  fullname: Toda, Tomoki
  organization: Inf. Technol. Center, Nagoya Univ., Nagoya, Japan
– sequence: 4
  givenname: Tomoyasu
  surname: Nakano
  fullname: Nakano, Tomoyasu
  organization: Nat. Inst. of Adv. Ind. Sci. & Technol., Tsukuba, Japan
– sequence: 5
  givenname: Masataka
  surname: Goto
  fullname: Goto, Masataka
  organization: Nat. Inst. of Adv. Ind. Sci. & Technol., Tsukuba, Japan
– sequence: 6
  givenname: Satoshi
  surname: Nakamura
  fullname: Nakamura, Satoshi
  organization: Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan
BookMark eNotkM1OwzAQhA0CibbwBL34yKXFP2liH6sKClIlkAoSt8hO1tQoiYvtQHkA3hv3Zy-70nya1cwQXXSuA4TGlEwpJfLuaTFfr1-mjNB8WmQFywU7Q0OaFTKNEPQcDRgv5IRK8n6FhiF8EkJEkYkB-pt3GEK0rYrWdbiFuHE1dgZ_O1sBToL2gOFbNf2R2F8QcB9s94ENqNjv9V30qjroPzZu8FL1IViV_OzuALSuhgZrFSCZd9iDAQ9derC3AX-NLo1qAtyc9gi9Pdy_Lh4nq-dlCreaWEZEnCidSeBGqMoQxrKaazpTQAmtCqGFLojR3NQzQ3jNKacVEaKGXIOUBWEy03yEbo--W---UoxYtjZU0DSqA9eHkgo2yyQlOU3o-IhaACi3PjXkf8tTu_wf00N0Hg
ContentType Conference Proceeding
Journal Article
DBID 6IE
6IH
CBEJK
RIE
RIO
7SP
8FD
L7M
DOI 10.1109/ICASSP.2016.7472682
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEL
IEEE Proceedings Order Plans (POP) 1998-present
Electronics & Communications Abstracts
Technology Research Database
Advanced Technologies Database with Aerospace
DatabaseTitle Technology Research Database
Advanced Technologies Database with Aerospace
Electronics & Communications Abstracts
DatabaseTitleList Technology Research Database

Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 1479999881
9781479999880
EISSN 2379-190X
EndPage 5269
ExternalDocumentID 7472682
Genre orig-research
GroupedDBID 23M
29P
6IE
6IF
6IH
6IK
6IL
6IM
6IN
AAJGR
ABLEC
ACGFS
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IJVOP
IPLJI
JC5
M43
OCL
RIE
RIL
RIO
RNS
7SP
8FD
L7M
ID FETCH-LOGICAL-i208t-ab49e3f8acf0224d3b15ae101c78b8b70fb3fd5f03d3131c088de6be9970294b3
IEDL.DBID RIE
IngestDate Thu Apr 11 21:12:45 EDT 2024
Wed Jun 26 19:23:39 EDT 2024
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i208t-ab49e3f8acf0224d3b15ae101c78b8b70fb3fd5f03d3131c088de6be9970294b3
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Conference-1
ObjectType-Feature-3
content type line 23
SourceType-Conference Papers & Proceedings-2
PQID 1825491061
PQPubID 23500
PageCount 5
ParticipantIDs proquest_miscellaneous_1825491061
ieee_primary_7472682
PublicationCentury 2000
PublicationDate 20160301
PublicationDateYYYYMMDD 2016-03-01
PublicationDate_xml – month: 03
  year: 2016
  text: 20160301
  day: 01
PublicationDecade 2010
PublicationTitle 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
PublicationTitleAbbrev ICASSP
PublicationYear 2016
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0008748
Score 2.0043318
Snippet This paper presents an estimation method of voice timbre evaluation values for arbitrary singer's singing voices generated with a singing voice synthesis...
SourceID proquest
ieee
SourceType Aggregation Database
Publisher
StartPage 5265
SubjectTerms Acoustics
Age
Covariance matrices
Data models
Estimation
estimation of evaluation values
Feature extraction
Gaussian
Gaussian mixture model
Mathematical models
reference singer
Regression
Singing
singing voice synthesis
Timbre
Voice
voice timbre
Title An estimation method of voice timbre evaluation values using feature extraction with Gaussian mixture model based on reference singer
URI https://ieeexplore.ieee.org/document/7472682
https://search.proquest.com/docview/1825491061
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1bS8MwFA5zT_riZRPnjSP4aLe2abv0cQznBSaDOdjbSJoTGWIrWyviu__bJO02UR98C7QJITn5Ti7n-w4hlzRJfCFD4aDLuBPIrtRLinlOlPicMak0Pho28vAhup0E99NwWiNXay4MItrgM2ybon3Ll1lSmKuyjt76-hHTgLvFXL_kaq1Rl3UDVqkKeW7cuev3xuORCd2K2lW1Kn_KL9C1nmSwS4arPpQBJM_tIhft5OOHPON_O7lHmhvOHozW3mif1DA9IDvf5AYb5LOXghHVKNmKUCaPhkzBW6bhAvQHfT6Gjf43mBIuwcTGP4FCqwEKGs0XJRsCzCUu3PBiaZiY8DJ_tz_Y5Dpg3KNuPIV1IhNY2ivEJpkMrh_7t06VhsGZ-y7LHS6CGKliPFHG4UsqvJCjXsqJnkcmuq4SVMlQuVRSj3qJxi2JkcA47rp-HAh6SOppluIRAe0vPSUY567hPni-QEYjxgOUVFuIH7ZIwwzo7LVU2phVY9kiF6spm2nrN08aPMWsWM48e8A1x9rjv6uekG1jA2XU2Cmp54sCz_Q2Ihfn1n6-AJ5sy9I
link.rule.ids 310,311,315,786,790,795,796,802,23958,23959,25170,27955,27956,55107
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8QwEA6iB_XiG9-O4NGubdNHepTFdX2sCCp4K0kzERFbcbci3v3fTtK6inrwFmgTQjL5Jo_5vmFsjxdFqHSsPPSF9CKdalpSIvCSIpRCaEP4aNnIg4ukfxOd3sa3E2x_zIVBRBd8hh1bdG_5uipqe1V2QFvfMBEEuFPk5_20YWuNcVekkWh1hQI_OzjpHl5dXdrgraTTVmwzqPyCXedLenNs8NmLJoTkoVOPVKd4-yHQ-N9uzrPlL9YeXI790QKbwHKRzX4THFxi74clWFmNhq8ITfpoqAy8VAQYQB_ohAxfCuBgSzgEGx1_BwadCigQnj83fAiw17hwLOuh5WLC4_2r-8Gl1wHrIKnxEsapTGDoLhGX2U3v6Lrb99pEDN596IuRJ1WUITdCFsa6fM1VEEukxVzQTAqV-kZxo2Pjc80DHhSEXBoThVmW-mEWKb7CJsuqxFUG5DEDo4SUvmU_BKFCwRMhI9ScbCSM19iSHdD8qdHayNuxXGO7n1OWk_3bRw1ZYlUP88Adce3Bdv3vqjtsun89OM_PTy7ONtiMtYcmhmyTTY6ea9yiTcVIbTtb-gB-dM8m
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=2016+IEEE+International+Conference+on+Acoustics%2C+Speech+and+Signal+Processing+%28ICASSP%29&rft.atitle=An+estimation+method+of+voice+timbre+evaluation+values+using+feature+extraction+with+Gaussian+mixture+model+based+on+reference+singer&rft.au=Yamane%2C+Soichi&rft.au=Kobayashi%2C+Kazuhiro&rft.au=Toda%2C+Tomoki&rft.au=Nakano%2C+Tomoyasu&rft.date=2016-03-01&rft.pub=IEEE&rft.eissn=2379-190X&rft.spage=5265&rft.epage=5269&rft_id=info:doi/10.1109%2FICASSP.2016.7472682&rft.externalDocID=7472682