Enhancement of connected words in an extremely noisy environment
A speech enhancement algorithm that is based on a connected-word hidden Markov model (HMM) is developed. Speech is assumed to be highly degraded by statistically independent additive noise. The minimum mean square error estimator is derived for a connected-word HMM. Further, we derive an estimator b...
Saved in:
Published in | IEEE transactions on speech and audio processing Vol. 5; no. 2; pp. 141 - 148 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
New York, NY
IEEE
01.03.1997
Institute of Electrical and Electronics Engineers |
Subjects | |
Online Access | Get full text |
ISSN | 1063-6676 |
DOI | 10.1109/89.554776 |
Cover
Abstract | A speech enhancement algorithm that is based on a connected-word hidden Markov model (HMM) is developed. Speech is assumed to be highly degraded by statistically independent additive noise. The minimum mean square error estimator is derived for a connected-word HMM. Further, we derive an estimator based on a connected-word HMM with explicit state duration. Listening experiments performed with digit strings have shown an increase of intelligibility. The best results were achieved when subjects who listened to the enhanced speech were given the results of an automatic recognition system. |
---|---|
AbstractList | A speech enhancement algorithm that is based on a connected-word hidden Markov model (HMM) is developed. Speech is assumed to be highly degraded by statistically independent additive noise. The minimum mean square error estimator is derived for a connected-word HMM. Further, we derive an estimator based on a connected-word HMM with explicit state duration. Listening experiments performed with digit strings have shown an increase of intelligibility. The best results were achieved when subjects who listened to the enhanced speech were given the results of an automatic recognition system. A speech enhancement algorithm that is based on a connected-word hidden Markov model (HMM) is developed. Speech is assumed to be highly degraded by statistically independent additive noise. The minimum mean square error estimator is derived for a connected-word HMM. Further, we derive an estimator based on a connected-word HMM with explicit state duration. Listening experiments performed with digit strings have shown an increase of intelligibility. The best results were achieved when subjects who listened to the enhanced speech were given the results of an automatic recognition system |
Author | Cohen, Y. Bistritz, Y. Erell, A. |
Author_xml | – sequence: 1 givenname: Y. surname: Cohen fullname: Cohen, Y. organization: Electr. Eng. Group, RND Networks Ltd., Tel Aviv, Israel – sequence: 2 givenname: A. surname: Erell fullname: Erell, A. – sequence: 3 givenname: Y. surname: Bistritz fullname: Bistritz, Y. |
BackLink | http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=2585374$$DView record in Pascal Francis |
BookMark | eNqF0L1LAzEYBvAMFWyrg6tTBhEcrk0ul69NKfUDCi46H7lcgpFrUpNU7X_vlSsdRHB6h_f3vC88EzDywRsALjCaYYzkXMgZpRXnbATGGDFSMMbZKZik9I4QEphXY3C79G_Ka7M2PsNgoQ7eG51NC79CbBN0HioPzXeOPel20AeXdtD4TxeD34fOwIlVXTLnhzkFr_fLl8VjsXp-eFrcrQpNCM8FKZsWM04qTRstW9MoiSkrrUaIC4uEJLpS0lpJG4mZ0A1DotWYt4zJEllBpuB6uLuJ4WNrUq7XLmnTdcqbsE11Kar-Ef8fYiorwkvSw6sDVEmrzsa-B5fqTXRrFXd1SQUlvOrZzcB0DClFY48Co3rfcy1kPfTc2_kvq11W2QWfo3Ldn4nLIeGMMcfLh-UPBY-Kww |
CODEN | IESPEJ |
CitedBy_id | crossref_primary_10_1080_09720529_2000_10697904 crossref_primary_10_1109_89_952492 crossref_primary_10_1155_2016_9161723 crossref_primary_10_1016_j_artint_2009_11_011 |
Cites_doi | 10.1109/ICASSP.1984.1172716 10.1109/ICASSP.1990.115960 10.1016/S0885-2308(86)80009-2 10.1109/78.80762 10.1109/78.127947 10.1109/PROC.1979.11540 10.1109/29.45532 10.1016/S0885-2308(86)80021-3 10.1109/5.18626 10.1109/5.168664 |
ContentType | Journal Article |
Copyright | 1997 INIST-CNRS |
Copyright_xml | – notice: 1997 INIST-CNRS |
DBID | AAYXX CITATION IQODW 7QO 8FD FR3 P64 7SC JQ2 L7M L~C L~D |
DOI | 10.1109/89.554776 |
DatabaseName | CrossRef Pascal-Francis Biotechnology Research Abstracts Technology Research Database Engineering Research Database Biotechnology and BioEngineering Abstracts Computer and Information Systems Abstracts ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
DatabaseTitle | CrossRef Engineering Research Database Biotechnology Research Abstracts Technology Research Database Biotechnology and BioEngineering Abstracts Computer and Information Systems Abstracts Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional |
DatabaseTitleList | Engineering Research Database Computer and Information Systems Abstracts |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering Applied Sciences |
EndPage | 148 |
ExternalDocumentID | 2585374 10_1109_89_554776 554776 |
GroupedDBID | -~X 0R~ 29I 5GY 6IK 97E AAJGR AAWTH ABAZT ABJNI ABQJQ ABVLG ACGFS AETIX AGQYO AHBIQ AI. AIBXA ALLEH ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 EBS EJD HZ~ H~9 ICLAB IFIPE IFJZH IPLJI JAVBF LAI M43 O9- OCL RIA RIE RNS TN5 VH1 AAYOK AAYXX CITATION RIG IQODW 7QO 8FD FR3 P64 7SC JQ2 L7M L~C L~D |
ID | FETCH-LOGICAL-c337t-32bd16734c5bc9deba91562fc0078f0893c4a9ff95b9168cb608dc17d66920f83 |
IEDL.DBID | RIE |
ISSN | 1063-6676 |
IngestDate | Thu Sep 04 19:56:45 EDT 2025 Thu Sep 04 18:25:40 EDT 2025 Wed Apr 02 07:18:17 EDT 2025 Thu Apr 24 22:53:47 EDT 2025 Tue Jul 01 00:47:24 EDT 2025 Wed Aug 27 02:56:47 EDT 2025 |
IsPeerReviewed | false |
IsScholarly | false |
Issue | 2 |
Keywords | Additive noise Performance evaluation Error estimation Least squares method Noise reduction Speech recognition Markov model Algorithm Speech processing |
Language | English |
License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html CC BY 4.0 |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c337t-32bd16734c5bc9deba91562fc0078f0893c4a9ff95b9168cb608dc17d66920f83 |
Notes | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 |
PQID | 15943723 |
PQPubID | 23462 |
PageCount | 8 |
ParticipantIDs | crossref_primary_10_1109_89_554776 pascalfrancis_primary_2585374 ieee_primary_554776 proquest_miscellaneous_28433778 proquest_miscellaneous_15943723 crossref_citationtrail_10_1109_89_554776 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 1900 |
PublicationDate | 1997-03-01 |
PublicationDateYYYYMMDD | 1997-03-01 |
PublicationDate_xml | – month: 03 year: 1997 text: 1997-03-01 day: 01 |
PublicationDecade | 1990 |
PublicationPlace | New York, NY |
PublicationPlace_xml | – name: New York, NY |
PublicationTitle | IEEE transactions on speech and audio processing |
PublicationTitleAbbrev | T-SAP |
PublicationYear | 1997 |
Publisher | IEEE Institute of Electrical and Electronics Engineers |
Publisher_xml | – name: IEEE – name: Institute of Electrical and Electronics Engineers |
References | ref8 ref12 ref7 ref9 ref4 lim (ref1) 1979; 67 ref3 ref6 ref11 lim (ref2) 1983 ref5 rabiner (ref10) 1993 |
References_xml | – ident: ref12 doi: 10.1109/ICASSP.1984.1172716 – ident: ref4 doi: 10.1109/ICASSP.1990.115960 – year: 1993 ident: ref10 publication-title: Fundamentals of speech recognition – ident: ref11 doi: 10.1016/S0885-2308(86)80009-2 – year: 1983 ident: ref2 publication-title: Speech Enhancement – ident: ref9 doi: 10.1109/78.80762 – ident: ref5 doi: 10.1109/78.127947 – volume: 67 start-page: 1586 year: 1979 ident: ref1 article-title: enhancement and bandwidth compression of noisy speech publication-title: Proceedings of the IEEE doi: 10.1109/PROC.1979.11540 – ident: ref6 doi: 10.1109/29.45532 – ident: ref8 doi: 10.1016/S0885-2308(86)80021-3 – ident: ref7 doi: 10.1109/5.18626 – ident: ref3 doi: 10.1109/5.168664 |
SSID | ssj0008174 |
Score | 1.2596253 |
Snippet | A speech enhancement algorithm that is based on a connected-word hidden Markov model (HMM) is developed. Speech is assumed to be highly degraded by... |
SourceID | proquest pascalfrancis crossref ieee |
SourceType | Aggregation Database Index Database Enrichment Source Publisher |
StartPage | 141 |
SubjectTerms | Additive noise Applied sciences Automatic speech recognition Degradation Exact sciences and technology Hidden Markov models Information, signal and communications theory Mean square error methods Noise level Signal processing Signal to noise ratio Speech enhancement Speech processing Speech recognition Telecommunications and information theory Working environment noise |
Title | Enhancement of connected words in an extremely noisy environment |
URI | https://ieeexplore.ieee.org/document/554776 https://www.proquest.com/docview/15943723 https://www.proquest.com/docview/28433778 |
Volume | 5 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT9wwELUKJ3poYaHqtgUsxIFLlmzs9ccNhEArJHoCiVtkj20VFSWo2VW1_HrGdnYFlErcomSiKDO2540984aQQwHeKkQWBaJVKPjY4ZwzihU-GHDSOMnTnu7VTzG94Ze3k9ueZzvVwnjvU_KZH8XLdJbvWpjHrbJjdH1SijWyhqMsl2qtFl2VCZcxwGFFTNvsSYTGpT5WepRffOF6Ui-VmAlpOlRGyF0s_lmQk5e5-JzLt7tEThiTS36P5jM7gsdX1I3v_IFN8qlHm_Q0D48t8sE3A_LxGQfhNjk5b35Fy8c3aRsoxMwXQBxK_2Jc2tG7hpqG4hIeNxLvF7Rp77oFfVYft0NuLs6vz6ZF31ahAMbkrGCVdWMhGYeJBe28NRqDuCpAhAuhRAAD3OgQ9MQidlRgRakcjKUTQldlUOwLWW_axn8lVNuJcWhaHYNbL5TihhsPrhRl4BWUQ3K01HgNPed4bH1xX6fYo9S10nVWypAcrEQfMtHGW0KDqNSVwPLu7gsrrh5XGAsxyYdkf2nVGidPPBExjW_nXY1YLp5bsv9LoPdGpUn17c0vfycbmc425qT9IOuzP3O_iyBlZvfS8HwCMVDmEA |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT9wwELVaemh7KJQWsW0Bq-qhlyzZ2PHHrRUCLRQ4gcQtsse2iooSRHZVwa9nbGdX0FKptyiZKMqM7Xljz7wh5IsAbxUiiwLRKhR84nDOGcUKHww4aZzkaU_35FRMz_nRRX0x8GynWhjvfUo-8-N4mc7yXQfzuFW2i65PSvGcvEC3z-tcrLVcdlWmXMYQhxUxcXOgEZqUelfpcX71kfNJ3VRiLqTpUR0h97H4a0lOfuZgNRdw94meMKaX_BrPZ3YMd3-QN_7nL6yRNwPepN_zAHlLnvl2nbx-wEL4jnzbb39G28c3aRcoxNwXQCRKf2Nk2tPLlpqW4iIetxKvbmnbXfa39EGF3HtyfrB_tjcthsYKBTAmZwWrrJsIyTjUFrTz1mgM46oAETCEEiEMcKND0LVF9KjAilI5mEgnhK7KoNgGWWm71m8Sqm1tHBpXx_DWC6W44caDK0UZeAXliHxdaLyBgXU8Nr-4alL0UepG6SYrZUQ-L0WvM9XGU0LrUalLgcXdrUdWXD6uMBpiko_IzsKqDU6feCZiWt_N-wbRXDy5ZP-WQP-NSpPqw5Nf3iEvp2cnx83x4emPj-RVJreNGWqfyMrsZu63ELLM7HYaqvdfQOld |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Enhancement+of+connected+words+in+an+extremely+noisy+environment&rft.jtitle=IEEE+transactions+on+speech+and+audio+processing&rft.au=COHEN%2C+Y&rft.au=ERELL%2C+A&rft.au=BISTRITZ%2C+Y&rft.date=1997-03-01&rft.pub=Institute+of+Electrical+and+Electronics+Engineers&rft.issn=1063-6676&rft.volume=5&rft.issue=2&rft.spage=141&rft.epage=148&rft_id=info:doi/10.1109%2F89.554776&rft.externalDBID=n%2Fa&rft.externalDocID=2585374 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1063-6676&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1063-6676&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1063-6676&client=summon |