Speaker normalization for Chinese vowel recognition in cochlear implants

Because of the limited spectra-temporal resolution associated with cochlear implants, implant patients often have greater difficulty with multitalker speech recognition. The present study investigated whether multitalker speech recognition can be improved by applying speaker normalization techniques...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on biomedical engineering Vol. 52; no. 7; pp. 1358 - 1361
Main Authors Luo, X., Fu, Q.-J.
Format Journal Article
LanguageEnglish
Published United States IEEE 01.07.2005
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text
ISSN0018-9294
1558-2531
DOI10.1109/TBME.2005.847530

Cover

Abstract Because of the limited spectra-temporal resolution associated with cochlear implants, implant patients often have greater difficulty with multitalker speech recognition. The present study investigated whether multitalker speech recognition can be improved by applying speaker normalization techniques to cochlear implant speech processing. Multitalker Chinese vowel recognition was tested with normal-hearing Chinese-speaking subjects listening to a 4-channel cochlear implant simulation, with and without speaker normalization. For each subject, speaker normalization was referenced to the speaker that produced the best recognition performance under conditions without speaker normalization. To match the remaining speakers to this "optimal" output pattern, the overall frequency range of the analysis filter bank was adjusted for each speaker according to the ratio of the mean third formant frequency values between the specific speaker and the reference speaker. Results showed that speaker normalization provided a small but significant improvement in subjects' overall recognition performance. After speaker normalization, subjects' patterns of recognition performance across speakers changed, demonstrating the potential for speaker-dependent effects with the proposed normalization technique.
AbstractList Because of the limited spectra-temporal resolution associated with cochlear implants, implant patients often have greater difficulty with multitalker speech recognition.
Because of the limited spectra-temporal resolution associated with cochlear implants, implant patients often have greater difficulty with multitalker speech recognition. The present study investigated whether multitalker speech recognition can be improved by applying speaker normalization techniques to cochlear implant speech processing. Multitalker Chinese vowel recognition was tested with normal-hearing Chinese-speaking subjects listening to a 4-channel cochlear implant simulation, with and without speaker normalization. For each subject, speaker normalization was referenced to the speaker that produced the best recognition performance under conditions without speaker normalization. To match the remaining speakers to this "optimal" output pattern, the overall frequency range of the analysis filter bank was adjusted for each speaker according to the ratio of the mean third formant frequency values between the specific speaker and the reference speaker. Results showed that speaker normalization provided a small but significant improvement in subjects' overall recognition performance. After speaker normalization, subjects' patterns of recognition performance across speakers changed, demonstrating the potential for speaker-dependent effects with the proposed normalization technique.Because of the limited spectra-temporal resolution associated with cochlear implants, implant patients often have greater difficulty with multitalker speech recognition. The present study investigated whether multitalker speech recognition can be improved by applying speaker normalization techniques to cochlear implant speech processing. Multitalker Chinese vowel recognition was tested with normal-hearing Chinese-speaking subjects listening to a 4-channel cochlear implant simulation, with and without speaker normalization. For each subject, speaker normalization was referenced to the speaker that produced the best recognition performance under conditions without speaker normalization. To match the remaining speakers to this "optimal" output pattern, the overall frequency range of the analysis filter bank was adjusted for each speaker according to the ratio of the mean third formant frequency values between the specific speaker and the reference speaker. Results showed that speaker normalization provided a small but significant improvement in subjects' overall recognition performance. After speaker normalization, subjects' patterns of recognition performance across speakers changed, demonstrating the potential for speaker-dependent effects with the proposed normalization technique.
Because of the limited spectra-temporal resolution associated with cochlear implants, implant patients often have greater difficulty with multitalker speech recognition. The present study investigated whether multitalker speech recognition can be improved by applying speaker normalization techniques to cochlear implant speech processing. Multitalker Chinese vowel recognition was tested with normal-hearing Chinese-speaking subjects listening to a 4-channel cochlear implant simulation, with and without speaker normalization. For each subject, speaker normalization was referenced to the speaker that produced the best recognition performance under conditions without speaker normalization. To match the remaining speakers to this "optimal" output pattern, the overall frequency range of the analysis filter bank was adjusted for each speaker according to the ratio of the mean third formant frequency values between the specific speaker and the reference speaker. Results showed that speaker normalization provided a small but significant improvement in subjects' overall recognition performance. After speaker normalization, subjects' patterns of recognition performance across speakers changed, demonstrating the potential for speaker-dependent effects with the proposed normalization technique.
Author Xin Luo
Qian-Jie Fu
Author_xml – sequence: 1
  givenname: X.
  surname: Luo
  fullname: Luo, X.
– sequence: 2
  givenname: Q.-J.
  surname: Fu
  fullname: Fu, Q.-J.
BackLink https://www.ncbi.nlm.nih.gov/pubmed/16042003$$D View this record in MEDLINE/PubMed
BookMark eNqFks9rFDEUx4NU7LZ6FwQZPPQ268uvmZejLrUVKh6s55DJvrGpM8mazCr61zvbrQoF2VMI-Xy-PPK-J-wopkiMPeew5BzM6-u3H86XAkAvUbVawiO24FpjLbTkR2wBwLE2wqhjdlLK7XxVqJon7Jg3oGZNLtjlpw25r5SrmPLohvDLTSHFqk-5Wt2ESIWq7-kHDVUmn77EcPcaYuWTvxnI5SqMm8HFqTxlj3s3FHp2f56yz-_Or1eX9dXHi_erN1e1V7KdaiWkN0ajFx0i7xF6hWsQBh0hQod9Z7BR2mnfCe6Mdp48uEYI03G-VlqesrN97ianb1sqkx1D8TTMQ1DaFtsgtNyAOQgKBANK88MgSA5Cq4MgN4qjkrvEVw_A27TNcf4Wi007TwitmKGX99C2G2ltNzmMLv-0f3YzA80e8DmVkqm3Pkx365myC4PlYHclsLsS2F0J7L4EswgPxL_Z_1de7JVARP9wpaDhKH8DYQG59w
CODEN IEBEAX
CitedBy_id crossref_primary_10_1121_1_2897047
crossref_primary_10_1016_j_heares_2014_11_003
crossref_primary_10_1097_AUD_0000000000001173
crossref_primary_10_1044_2014_JSLHR_H_12_0404
crossref_primary_10_1097_AUD_0000000000000265
crossref_primary_10_1016_j_ijporl_2011_03_009
crossref_primary_10_1002_lary_23744
Cites_doi 10.1121/1.399052
10.1109/78.80902
10.1121/1.397688
10.1109/89.222875
10.1109/89.294352
10.1121/1.1911939
10.21437/ICSLP.1996-632
10.1109/TAU.1968.1161952
10.1126/science.270.5234.303
10.1109/89.650310
ContentType Journal Article
Copyright Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2005
Copyright_xml – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2005
DBID 97E
RIA
RIE
AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7QF
7QO
7QQ
7SC
7SE
7SP
7SR
7TA
7TB
7U5
8BQ
8FD
F28
FR3
H8D
JG9
JQ2
KR7
L7M
L~C
L~D
P64
7X8
DOI 10.1109/TBME.2005.847530
DatabaseName IEEE Xplore (IEEE)
IEEE All-Society Periodicals Package (ASPP) 1998–Present
IEEE Electronic Library (IEL)
CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
Aluminium Industry Abstracts
Biotechnology Research Abstracts
Ceramic Abstracts
Computer and Information Systems Abstracts
Corrosion Abstracts
Electronics & Communications Abstracts
Engineered Materials Abstracts
Materials Business File
Mechanical & Transportation Engineering Abstracts
Solid State and Superconductivity Abstracts
METADEX
Technology Research Database
ANTE: Abstracts in New Technology & Engineering
Engineering Research Database
Aerospace Database
Materials Research Database
ProQuest Computer Science Collection
Civil Engineering Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
Biotechnology and BioEngineering Abstracts
MEDLINE - Academic
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Materials Research Database
Civil Engineering Abstracts
Aluminium Industry Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Mechanical & Transportation Engineering Abstracts
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Ceramic Abstracts
Materials Business File
METADEX
Biotechnology and BioEngineering Abstracts
Computer and Information Systems Abstracts Professional
Aerospace Database
Engineered Materials Abstracts
Biotechnology Research Abstracts
Solid State and Superconductivity Abstracts
Engineering Research Database
Corrosion Abstracts
Advanced Technologies Database with Aerospace
ANTE: Abstracts in New Technology & Engineering
MEDLINE - Academic
DatabaseTitleList Materials Research Database
MEDLINE - Academic
Engineering Research Database
Technology Research Database

Engineering Research Database
MEDLINE
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 3
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Engineering
EISSN 1558-2531
EndPage 1361
ExternalDocumentID 2352274921
16042003
10_1109_TBME_2005_847530
1440618
Genre orig-research
Research Support, U.S. Gov't, P.H.S
Clinical Trial
Journal Article
Research Support, N.I.H., Extramural
GeographicLocations China
GeographicLocations_xml – name: China
GrantInformation_xml – fundername: NIDCD NIH HHS
  grantid: R01-DC04993
GroupedDBID ---
-~X
.55
.DC
.GJ
0R~
29I
4.4
53G
5GY
5RE
5VS
6IF
6IK
6IL
6IN
85S
97E
AAJGR
AARMG
AASAJ
AAWTH
AAYJJ
ABAZT
ABJNI
ABQJQ
ABVLG
ACGFO
ACGFS
ACIWK
ACKIV
ACNCT
ACPRK
ADZIZ
AENEX
AETIX
AFFNX
AFRAH
AGQYO
AGSQL
AHBIQ
AI.
AIBXA
AKJIK
AKQYR
ALLEH
ALMA_UNASSIGNED_HOLDINGS
ASUFR
ATWAV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CHZPO
CS3
DU5
EBS
EJD
F5P
HZ~
H~9
IAAWW
IBMZZ
ICLAB
IDIHD
IEGSK
IFIPE
IFJZH
IPLJI
JAVBF
LAI
MS~
O9-
OCL
P2P
RIA
RIE
RIL
RNS
TAE
TN5
VH1
VJK
X7M
ZGI
ZXP
AAYXX
CITATION
RIG
CGR
CUY
CVF
ECM
EIF
NPM
PKN
7QF
7QO
7QQ
7SC
7SE
7SP
7SR
7TA
7TB
7U5
8BQ
8FD
F28
FR3
H8D
JG9
JQ2
KR7
L7M
L~C
L~D
P64
7X8
ID FETCH-LOGICAL-c437t-423c9958c2b881f80f48d0298ae880b8fb98645a5cb21a95acec0a6229b11d453
IEDL.DBID RIE
ISSN 0018-9294
IngestDate Fri Sep 05 06:40:24 EDT 2025
Fri Sep 05 12:12:14 EDT 2025
Thu Sep 04 15:58:18 EDT 2025
Fri Sep 05 11:49:54 EDT 2025
Mon Jun 30 05:24:53 EDT 2025
Wed Feb 19 01:53:28 EST 2025
Tue Jul 01 05:18:20 EDT 2025
Thu Apr 24 22:57:18 EDT 2025
Wed Aug 27 02:52:55 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 7
Language English
License https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c437t-423c9958c2b881f80f48d0298ae880b8fb98645a5cb21a95acec0a6229b11d453
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
content type line 23
ObjectType-Article-1
ObjectType-Feature-2
PMID 16042003
PQID 867680072
PQPubID 23462
PageCount 4
ParticipantIDs proquest_miscellaneous_19418431
pubmed_primary_16042003
proquest_miscellaneous_68071909
proquest_miscellaneous_20310254
proquest_miscellaneous_28090451
crossref_primary_10_1109_TBME_2005_847530
ieee_primary_1440618
crossref_citationtrail_10_1109_TBME_2005_847530
proquest_journals_867680072
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2005-07-01
PublicationDateYYYYMMDD 2005-07-01
PublicationDate_xml – month: 07
  year: 2005
  text: 2005-07-01
  day: 01
PublicationDecade 2000
PublicationPlace United States
PublicationPlace_xml – name: United States
– name: New York
PublicationTitle IEEE transactions on biomedical engineering
PublicationTitleAbbrev TBME
PublicationTitleAlternate IEEE Trans Biomed Eng
PublicationYear 2005
Publisher IEEE
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Publisher_xml – name: IEEE
– name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
References ref13
ref12
ref14
ref1
Fu (ref4) 1997
ref8
ref7
ref9
ref3
ref6
Wang (ref11) 1993
Chang (ref2)
ref5
Nordstrom (ref10)
References_xml – ident: ref14
  doi: 10.1121/1.399052
– ident: ref5
  doi: 10.1109/78.80902
– volume-title: 8th Annu. Grodins’ Graduate Research Symp.
  ident: ref2
  article-title: The effects of talker variability on vowel recognition in cochlear implant simulation
– ident: ref1
  doi: 10.1121/1.397688
– volume-title: Speech perception in acoustic and electric hearing
  year: 1997
  ident: ref4
– ident: ref6
  doi: 10.1109/89.222875
– ident: ref7
  doi: 10.1109/89.294352
– ident: ref12
  doi: 10.1121/1.1911939
– volume-title: Int. Cong. Phonetic Sci.
  ident: ref10
  article-title: A normalization procedure for vowel formant data
– ident: ref3
  doi: 10.21437/ICSLP.1996-632
– ident: ref9
  doi: 10.1109/TAU.1968.1161952
– volume-title: Internal Materials
  year: 1993
  ident: ref11
  article-title: The standard Chinese database
– ident: ref13
  doi: 10.1126/science.270.5234.303
– ident: ref8
  doi: 10.1109/89.650310
SSID ssj0014846
Score 1.8308452
Snippet Because of the limited spectra-temporal resolution associated with cochlear implants, implant patients often have greater difficulty with multitalker speech...
SourceID proquest
pubmed
crossref
ieee
SourceType Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 1358
SubjectTerms Artificial Intelligence
Auditory implants
China
Cochlear Implants
Computer-Aided Design
Equipment Failure Analysis
Filter bank
Frequency
Humans
Pattern analysis
Pattern matching
Pattern recognition
Phonation
Prosthesis Design
Sound Spectrography - methods
speaker normalization
Speech Acoustics
Speech Perception
Speech processing
Speech recognition
Speech Recognition Software
Testing
Transplants & implants
vowel recognition
Title Speaker normalization for Chinese vowel recognition in cochlear implants
URI https://ieeexplore.ieee.org/document/1440618
https://www.ncbi.nlm.nih.gov/pubmed/16042003
https://www.proquest.com/docview/867680072
https://www.proquest.com/docview/19418431
https://www.proquest.com/docview/20310254
https://www.proquest.com/docview/28090451
https://www.proquest.com/docview/68071909
Volume 52
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1Lb9swDCbaHobtsHXtHm730GGXAXNiO5JCHbeiRTAgu6wFejNkRcaKpk7QJBuwX19Sst1hWIrdDIgG9CDljyb5EeCDpp2va3Qpem1TaXOVWlXZdIx0MeSFR51xcfL0m55cyK-X6nIHPvW1MN77kHzmB_wYYvmzhdvwr7IhByJ1jruwS2oWa7X6iIHEWJST5WTAhZFdSDIzw_Mv09P494SuYjUKzd80KWvWdcpqv0ahvcp2pBm-OGfPYNrNNSaaXA8262rgfv9F4_i_i9mHpy30FJ-jrjyHHd8cwJM_CAkP4NG0DbUfwuT70ttrfysaRrXztlxTEMYV3HPbr7z4ufjl56JPQaLRq0bQBfuDW1GIq5vlnJNsXsDF2en5ySRt2y6kTo7G65QAljNGoSsqxLzGrJY4Y6Z268nYK6wrpnRXVrmqyK1R1nmXWV0Uhk53JtXoJew1i8a_BqEKTzdqpcjnRGlkbmpFTvEYtTUEjHCWwLDb_tK1nOTcGmNeBt8kMyWfHbfKVGU8uwQ-9m8sIx_HA7KHvO33cnHHEzjuTrhsDXZVoia_i2nUE3jfj5KlcfjENn6xWZU5LQAJb22XKJhnlTzuByQwM8zos12CZjEmkGYSeBWV7376rc4e_XtZx_A40MqGVOI3sLe-3fi3BJjW1btgKXdTHQ2y
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LbxMxEB6VIvE48Gh5LAXqAxckNtnd2M74CKhVgG4vpFJvK68zK6qGTdQkIPHrGXsfRYhU3FbyrOTX2N94Zr4BeKN55qsKXYykbSxtqmKrShuPkQ-GNCPUiU9Ozk_15Ex-PlfnO_Cuz4UhohB8RgP_GXz5s4Xb-KeyoXdE6hRvwW2-96VqsrV6n4HEJi0nSVmFMyM7p2RihtMP-VHzfsKHsRqF8m-at2vS1cpq76NQYGU71gx3zvFDyLveNqEml4PNuhy4X38ROf7vcB7BgxZ8ivfNbnkMO1Tvwf0_KAn34E7eOtv3YfJ1SfaSrkTtce28TdgUjHKFr7pNKxI_Fj9pLvogJG69qAUfsd98MQpx8X0592E2T-Ds-Gj6cRK3hRdiJ0fjdcwQyxmj0GUlYlphUkmcea52S6zuJValJ3VXVrkyS61R1pFLrM4yw-s7k2r0FHbrRU3PQaiM-EwtFVudKI1MTaXYLB6jtoahEc4iGHbTX7iWldwXx5gXwTpJTOHXzhfLVEWzdhG87f9YNowcN8ju-2m_lmtmPIKDboWLVmVXBWq2vDyRegSHfSvrmneg2JoWm1WR8gCQEdd2icwzrbLNfYMEJsZz-myX4F6MGaaZCJ41m--6--2effHvYR3C3ck0PylOPp1-OYB7gWQ2BBa_hN311YZeMXxal6-D1vwGkqYQ_w
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Speaker+Normalization+for+Chinese+Vowel+Recognition+in+Cochlear+Implants&rft.jtitle=IEEE+transactions+on+biomedical+engineering&rft.au=Luo%2C+X.&rft.au=Fu%2C+Q.-J.&rft.date=2005-07-01&rft.issn=0018-9294&rft.volume=52&rft.issue=7&rft.spage=1358&rft.epage=1361&rft_id=info:doi/10.1109%2FTBME.2005.847530&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_TBME_2005_847530
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0018-9294&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0018-9294&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0018-9294&client=summon