Speaker normalization for Chinese vowel recognition in cochlear implants
Because of the limited spectra-temporal resolution associated with cochlear implants, implant patients often have greater difficulty with multitalker speech recognition. The present study investigated whether multitalker speech recognition can be improved by applying speaker normalization techniques...
Saved in:
Published in | IEEE transactions on biomedical engineering Vol. 52; no. 7; pp. 1358 - 1361 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
United States
IEEE
01.07.2005
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Subjects | |
Online Access | Get full text |
ISSN | 0018-9294 1558-2531 |
DOI | 10.1109/TBME.2005.847530 |
Cover
Abstract | Because of the limited spectra-temporal resolution associated with cochlear implants, implant patients often have greater difficulty with multitalker speech recognition. The present study investigated whether multitalker speech recognition can be improved by applying speaker normalization techniques to cochlear implant speech processing. Multitalker Chinese vowel recognition was tested with normal-hearing Chinese-speaking subjects listening to a 4-channel cochlear implant simulation, with and without speaker normalization. For each subject, speaker normalization was referenced to the speaker that produced the best recognition performance under conditions without speaker normalization. To match the remaining speakers to this "optimal" output pattern, the overall frequency range of the analysis filter bank was adjusted for each speaker according to the ratio of the mean third formant frequency values between the specific speaker and the reference speaker. Results showed that speaker normalization provided a small but significant improvement in subjects' overall recognition performance. After speaker normalization, subjects' patterns of recognition performance across speakers changed, demonstrating the potential for speaker-dependent effects with the proposed normalization technique. |
---|---|
AbstractList | Because of the limited spectra-temporal resolution associated with cochlear implants, implant patients often have greater difficulty with multitalker speech recognition. Because of the limited spectra-temporal resolution associated with cochlear implants, implant patients often have greater difficulty with multitalker speech recognition. The present study investigated whether multitalker speech recognition can be improved by applying speaker normalization techniques to cochlear implant speech processing. Multitalker Chinese vowel recognition was tested with normal-hearing Chinese-speaking subjects listening to a 4-channel cochlear implant simulation, with and without speaker normalization. For each subject, speaker normalization was referenced to the speaker that produced the best recognition performance under conditions without speaker normalization. To match the remaining speakers to this "optimal" output pattern, the overall frequency range of the analysis filter bank was adjusted for each speaker according to the ratio of the mean third formant frequency values between the specific speaker and the reference speaker. Results showed that speaker normalization provided a small but significant improvement in subjects' overall recognition performance. After speaker normalization, subjects' patterns of recognition performance across speakers changed, demonstrating the potential for speaker-dependent effects with the proposed normalization technique.Because of the limited spectra-temporal resolution associated with cochlear implants, implant patients often have greater difficulty with multitalker speech recognition. The present study investigated whether multitalker speech recognition can be improved by applying speaker normalization techniques to cochlear implant speech processing. Multitalker Chinese vowel recognition was tested with normal-hearing Chinese-speaking subjects listening to a 4-channel cochlear implant simulation, with and without speaker normalization. For each subject, speaker normalization was referenced to the speaker that produced the best recognition performance under conditions without speaker normalization. To match the remaining speakers to this "optimal" output pattern, the overall frequency range of the analysis filter bank was adjusted for each speaker according to the ratio of the mean third formant frequency values between the specific speaker and the reference speaker. Results showed that speaker normalization provided a small but significant improvement in subjects' overall recognition performance. After speaker normalization, subjects' patterns of recognition performance across speakers changed, demonstrating the potential for speaker-dependent effects with the proposed normalization technique. Because of the limited spectra-temporal resolution associated with cochlear implants, implant patients often have greater difficulty with multitalker speech recognition. The present study investigated whether multitalker speech recognition can be improved by applying speaker normalization techniques to cochlear implant speech processing. Multitalker Chinese vowel recognition was tested with normal-hearing Chinese-speaking subjects listening to a 4-channel cochlear implant simulation, with and without speaker normalization. For each subject, speaker normalization was referenced to the speaker that produced the best recognition performance under conditions without speaker normalization. To match the remaining speakers to this "optimal" output pattern, the overall frequency range of the analysis filter bank was adjusted for each speaker according to the ratio of the mean third formant frequency values between the specific speaker and the reference speaker. Results showed that speaker normalization provided a small but significant improvement in subjects' overall recognition performance. After speaker normalization, subjects' patterns of recognition performance across speakers changed, demonstrating the potential for speaker-dependent effects with the proposed normalization technique. |
Author | Xin Luo Qian-Jie Fu |
Author_xml | – sequence: 1 givenname: X. surname: Luo fullname: Luo, X. – sequence: 2 givenname: Q.-J. surname: Fu fullname: Fu, Q.-J. |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/16042003$$D View this record in MEDLINE/PubMed |
BookMark | eNqFks9rFDEUx4NU7LZ6FwQZPPQ268uvmZejLrUVKh6s55DJvrGpM8mazCr61zvbrQoF2VMI-Xy-PPK-J-wopkiMPeew5BzM6-u3H86XAkAvUbVawiO24FpjLbTkR2wBwLE2wqhjdlLK7XxVqJon7Jg3oGZNLtjlpw25r5SrmPLohvDLTSHFqk-5Wt2ESIWq7-kHDVUmn77EcPcaYuWTvxnI5SqMm8HFqTxlj3s3FHp2f56yz-_Or1eX9dXHi_erN1e1V7KdaiWkN0ajFx0i7xF6hWsQBh0hQod9Z7BR2mnfCe6Mdp48uEYI03G-VlqesrN97ianb1sqkx1D8TTMQ1DaFtsgtNyAOQgKBANK88MgSA5Cq4MgN4qjkrvEVw_A27TNcf4Wi007TwitmKGX99C2G2ltNzmMLv-0f3YzA80e8DmVkqm3Pkx365myC4PlYHclsLsS2F0J7L4EswgPxL_Z_1de7JVARP9wpaDhKH8DYQG59w |
CODEN | IEBEAX |
CitedBy_id | crossref_primary_10_1121_1_2897047 crossref_primary_10_1016_j_heares_2014_11_003 crossref_primary_10_1097_AUD_0000000000001173 crossref_primary_10_1044_2014_JSLHR_H_12_0404 crossref_primary_10_1097_AUD_0000000000000265 crossref_primary_10_1016_j_ijporl_2011_03_009 crossref_primary_10_1002_lary_23744 |
Cites_doi | 10.1121/1.399052 10.1109/78.80902 10.1121/1.397688 10.1109/89.222875 10.1109/89.294352 10.1121/1.1911939 10.21437/ICSLP.1996-632 10.1109/TAU.1968.1161952 10.1126/science.270.5234.303 10.1109/89.650310 |
ContentType | Journal Article |
Copyright | Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2005 |
Copyright_xml | – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2005 |
DBID | 97E RIA RIE AAYXX CITATION CGR CUY CVF ECM EIF NPM 7QF 7QO 7QQ 7SC 7SE 7SP 7SR 7TA 7TB 7U5 8BQ 8FD F28 FR3 H8D JG9 JQ2 KR7 L7M L~C L~D P64 7X8 |
DOI | 10.1109/TBME.2005.847530 |
DatabaseName | IEEE Xplore (IEEE) IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE Electronic Library (IEL) CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed Aluminium Industry Abstracts Biotechnology Research Abstracts Ceramic Abstracts Computer and Information Systems Abstracts Corrosion Abstracts Electronics & Communications Abstracts Engineered Materials Abstracts Materials Business File Mechanical & Transportation Engineering Abstracts Solid State and Superconductivity Abstracts METADEX Technology Research Database ANTE: Abstracts in New Technology & Engineering Engineering Research Database Aerospace Database Materials Research Database ProQuest Computer Science Collection Civil Engineering Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional Biotechnology and BioEngineering Abstracts MEDLINE - Academic |
DatabaseTitle | CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) Materials Research Database Civil Engineering Abstracts Aluminium Industry Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Mechanical & Transportation Engineering Abstracts Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Ceramic Abstracts Materials Business File METADEX Biotechnology and BioEngineering Abstracts Computer and Information Systems Abstracts Professional Aerospace Database Engineered Materials Abstracts Biotechnology Research Abstracts Solid State and Superconductivity Abstracts Engineering Research Database Corrosion Abstracts Advanced Technologies Database with Aerospace ANTE: Abstracts in New Technology & Engineering MEDLINE - Academic |
DatabaseTitleList | Materials Research Database MEDLINE - Academic Engineering Research Database Technology Research Database Engineering Research Database MEDLINE |
Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database – sequence: 3 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Medicine Engineering |
EISSN | 1558-2531 |
EndPage | 1361 |
ExternalDocumentID | 2352274921 16042003 10_1109_TBME_2005_847530 1440618 |
Genre | orig-research Research Support, U.S. Gov't, P.H.S Clinical Trial Journal Article Research Support, N.I.H., Extramural |
GeographicLocations | China |
GeographicLocations_xml | – name: China |
GrantInformation_xml | – fundername: NIDCD NIH HHS grantid: R01-DC04993 |
GroupedDBID | --- -~X .55 .DC .GJ 0R~ 29I 4.4 53G 5GY 5RE 5VS 6IF 6IK 6IL 6IN 85S 97E AAJGR AARMG AASAJ AAWTH AAYJJ ABAZT ABJNI ABQJQ ABVLG ACGFO ACGFS ACIWK ACKIV ACNCT ACPRK ADZIZ AENEX AETIX AFFNX AFRAH AGQYO AGSQL AHBIQ AI. AIBXA AKJIK AKQYR ALLEH ALMA_UNASSIGNED_HOLDINGS ASUFR ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CHZPO CS3 DU5 EBS EJD F5P HZ~ H~9 IAAWW IBMZZ ICLAB IDIHD IEGSK IFIPE IFJZH IPLJI JAVBF LAI MS~ O9- OCL P2P RIA RIE RIL RNS TAE TN5 VH1 VJK X7M ZGI ZXP AAYXX CITATION RIG CGR CUY CVF ECM EIF NPM PKN 7QF 7QO 7QQ 7SC 7SE 7SP 7SR 7TA 7TB 7U5 8BQ 8FD F28 FR3 H8D JG9 JQ2 KR7 L7M L~C L~D P64 7X8 |
ID | FETCH-LOGICAL-c437t-423c9958c2b881f80f48d0298ae880b8fb98645a5cb21a95acec0a6229b11d453 |
IEDL.DBID | RIE |
ISSN | 0018-9294 |
IngestDate | Fri Sep 05 06:40:24 EDT 2025 Fri Sep 05 12:12:14 EDT 2025 Thu Sep 04 15:58:18 EDT 2025 Fri Sep 05 11:49:54 EDT 2025 Mon Jun 30 05:24:53 EDT 2025 Wed Feb 19 01:53:28 EST 2025 Tue Jul 01 05:18:20 EDT 2025 Thu Apr 24 22:57:18 EDT 2025 Wed Aug 27 02:52:55 EDT 2025 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 7 |
Language | English |
License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c437t-423c9958c2b881f80f48d0298ae880b8fb98645a5cb21a95acec0a6229b11d453 |
Notes | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 content type line 23 ObjectType-Article-1 ObjectType-Feature-2 |
PMID | 16042003 |
PQID | 867680072 |
PQPubID | 23462 |
PageCount | 4 |
ParticipantIDs | proquest_miscellaneous_19418431 pubmed_primary_16042003 proquest_miscellaneous_68071909 proquest_miscellaneous_20310254 proquest_miscellaneous_28090451 crossref_primary_10_1109_TBME_2005_847530 ieee_primary_1440618 crossref_citationtrail_10_1109_TBME_2005_847530 proquest_journals_867680072 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 2005-07-01 |
PublicationDateYYYYMMDD | 2005-07-01 |
PublicationDate_xml | – month: 07 year: 2005 text: 2005-07-01 day: 01 |
PublicationDecade | 2000 |
PublicationPlace | United States |
PublicationPlace_xml | – name: United States – name: New York |
PublicationTitle | IEEE transactions on biomedical engineering |
PublicationTitleAbbrev | TBME |
PublicationTitleAlternate | IEEE Trans Biomed Eng |
PublicationYear | 2005 |
Publisher | IEEE The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Publisher_xml | – name: IEEE – name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
References | ref13 ref12 ref14 ref1 Fu (ref4) 1997 ref8 ref7 ref9 ref3 ref6 Wang (ref11) 1993 Chang (ref2) ref5 Nordstrom (ref10) |
References_xml | – ident: ref14 doi: 10.1121/1.399052 – ident: ref5 doi: 10.1109/78.80902 – volume-title: 8th Annu. Grodins’ Graduate Research Symp. ident: ref2 article-title: The effects of talker variability on vowel recognition in cochlear implant simulation – ident: ref1 doi: 10.1121/1.397688 – volume-title: Speech perception in acoustic and electric hearing year: 1997 ident: ref4 – ident: ref6 doi: 10.1109/89.222875 – ident: ref7 doi: 10.1109/89.294352 – ident: ref12 doi: 10.1121/1.1911939 – volume-title: Int. Cong. Phonetic Sci. ident: ref10 article-title: A normalization procedure for vowel formant data – ident: ref3 doi: 10.21437/ICSLP.1996-632 – ident: ref9 doi: 10.1109/TAU.1968.1161952 – volume-title: Internal Materials year: 1993 ident: ref11 article-title: The standard Chinese database – ident: ref13 doi: 10.1126/science.270.5234.303 – ident: ref8 doi: 10.1109/89.650310 |
SSID | ssj0014846 |
Score | 1.8308452 |
Snippet | Because of the limited spectra-temporal resolution associated with cochlear implants, implant patients often have greater difficulty with multitalker speech... |
SourceID | proquest pubmed crossref ieee |
SourceType | Aggregation Database Index Database Enrichment Source Publisher |
StartPage | 1358 |
SubjectTerms | Artificial Intelligence Auditory implants China Cochlear Implants Computer-Aided Design Equipment Failure Analysis Filter bank Frequency Humans Pattern analysis Pattern matching Pattern recognition Phonation Prosthesis Design Sound Spectrography - methods speaker normalization Speech Acoustics Speech Perception Speech processing Speech recognition Speech Recognition Software Testing Transplants & implants vowel recognition |
Title | Speaker normalization for Chinese vowel recognition in cochlear implants |
URI | https://ieeexplore.ieee.org/document/1440618 https://www.ncbi.nlm.nih.gov/pubmed/16042003 https://www.proquest.com/docview/867680072 https://www.proquest.com/docview/19418431 https://www.proquest.com/docview/20310254 https://www.proquest.com/docview/28090451 https://www.proquest.com/docview/68071909 |
Volume | 52 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1Lb9swDCbaHobtsHXtHm730GGXAXNiO5JCHbeiRTAgu6wFejNkRcaKpk7QJBuwX19Sst1hWIrdDIgG9CDljyb5EeCDpp2va3Qpem1TaXOVWlXZdIx0MeSFR51xcfL0m55cyK-X6nIHPvW1MN77kHzmB_wYYvmzhdvwr7IhByJ1jruwS2oWa7X6iIHEWJST5WTAhZFdSDIzw_Mv09P494SuYjUKzd80KWvWdcpqv0ahvcp2pBm-OGfPYNrNNSaaXA8262rgfv9F4_i_i9mHpy30FJ-jrjyHHd8cwJM_CAkP4NG0DbUfwuT70ttrfysaRrXztlxTEMYV3HPbr7z4ufjl56JPQaLRq0bQBfuDW1GIq5vlnJNsXsDF2en5ySRt2y6kTo7G65QAljNGoSsqxLzGrJY4Y6Z268nYK6wrpnRXVrmqyK1R1nmXWV0Uhk53JtXoJew1i8a_BqEKTzdqpcjnRGlkbmpFTvEYtTUEjHCWwLDb_tK1nOTcGmNeBt8kMyWfHbfKVGU8uwQ-9m8sIx_HA7KHvO33cnHHEzjuTrhsDXZVoia_i2nUE3jfj5KlcfjENn6xWZU5LQAJb22XKJhnlTzuByQwM8zos12CZjEmkGYSeBWV7376rc4e_XtZx_A40MqGVOI3sLe-3fi3BJjW1btgKXdTHQ2y |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LbxMxEB6VIvE48Gh5LAXqAxckNtnd2M74CKhVgG4vpFJvK68zK6qGTdQkIPHrGXsfRYhU3FbyrOTX2N94Zr4BeKN55qsKXYykbSxtqmKrShuPkQ-GNCPUiU9Ozk_15Ex-PlfnO_Cuz4UhohB8RgP_GXz5s4Xb-KeyoXdE6hRvwW2-96VqsrV6n4HEJi0nSVmFMyM7p2RihtMP-VHzfsKHsRqF8m-at2vS1cpq76NQYGU71gx3zvFDyLveNqEml4PNuhy4X38ROf7vcB7BgxZ8ivfNbnkMO1Tvwf0_KAn34E7eOtv3YfJ1SfaSrkTtce28TdgUjHKFr7pNKxI_Fj9pLvogJG69qAUfsd98MQpx8X0592E2T-Ds-Gj6cRK3hRdiJ0fjdcwQyxmj0GUlYlphUkmcea52S6zuJValJ3VXVrkyS61R1pFLrM4yw-s7k2r0FHbrRU3PQaiM-EwtFVudKI1MTaXYLB6jtoahEc4iGHbTX7iWldwXx5gXwTpJTOHXzhfLVEWzdhG87f9YNowcN8ju-2m_lmtmPIKDboWLVmVXBWq2vDyRegSHfSvrmneg2JoWm1WR8gCQEdd2icwzrbLNfYMEJsZz-myX4F6MGaaZCJ41m--6--2effHvYR3C3ck0PylOPp1-OYB7gWQ2BBa_hN311YZeMXxal6-D1vwGkqYQ_w |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Speaker+Normalization+for+Chinese+Vowel+Recognition+in+Cochlear+Implants&rft.jtitle=IEEE+transactions+on+biomedical+engineering&rft.au=Luo%2C+X.&rft.au=Fu%2C+Q.-J.&rft.date=2005-07-01&rft.issn=0018-9294&rft.volume=52&rft.issue=7&rft.spage=1358&rft.epage=1361&rft_id=info:doi/10.1109%2FTBME.2005.847530&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_TBME_2005_847530 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0018-9294&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0018-9294&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0018-9294&client=summon |