Post-filtering Technique Using Band Importance Function for Speech Intelligibility Enhancement
Conventional speech enhancement (SE) algorithms are mainly designed with the aim of improving signal-to-noise levels of noisy speech signals. However, many applications consider the enhancement of speech intelligibility as the goal for an SE system. In this study, we propose a maximum speech intelli...
Saved in:
Published in | 2016 IEEE Second International Conference on Multimedia Big Data (BigMM) pp. 487 - 491 |
---|---|
Main Authors | , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.04.2016
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Conventional speech enhancement (SE) algorithms are mainly designed with the aim of improving signal-to-noise levels of noisy speech signals. However, many applications consider the enhancement of speech intelligibility as the goal for an SE system. In this study, we propose a maximum speech intelligibility (MSI) post-filter that aims to enhance the intelligibility of processed speech signals. The MSI post-filter is designed to specify a weight for each frequency band of the speech signal based on the critical band importance function. To evaluate the MSI post-filter, we combine it with a recently proposed generalized maximum a posteriori spectral amplitude estimation (GMAPA) SE algorithm. In previous studies, it has been verified that GMAPA outperforms several well-known spectral restoration approaches in terms of objective evaluations and speech recognition tests. Experimental results from the present study confirm that GMAPA also provides better results in a set of subjective intelligibility tests conducted with human subjects. Moreover, the integration of GMAPA and MSI can further improve the intelligibility scores over GMAPA alone under - 10 dB to 5 dB signal-to-noise ratio conditions. |
---|---|
AbstractList | Conventional speech enhancement (SE) algorithms are mainly designed with the aim of improving signal-to-noise levels of noisy speech signals. However, many applications consider the enhancement of speech intelligibility as the goal for an SE system. In this study, we propose a maximum speech intelligibility (MSI) post-filter that aims to enhance the intelligibility of processed speech signals. The MSI post-filter is designed to specify a weight for each frequency band of the speech signal based on the critical band importance function. To evaluate the MSI post-filter, we combine it with a recently proposed generalized maximum a posteriori spectral amplitude estimation (GMAPA) SE algorithm. In previous studies, it has been verified that GMAPA outperforms several well-known spectral restoration approaches in terms of objective evaluations and speech recognition tests. Experimental results from the present study confirm that GMAPA also provides better results in a set of subjective intelligibility tests conducted with human subjects. Moreover, the integration of GMAPA and MSI can further improve the intelligibility scores over GMAPA alone under - 10 dB to 5 dB signal-to-noise ratio conditions. |
Author | Shih-Tsang Tang Pei-Chun Li Ying-Hui Lai |
Author_xml | – sequence: 1 surname: Ying-Hui Lai fullname: Ying-Hui Lai email: jackylai@citi.sinica.edu.tw organization: Res. Center for Inf., Technol. Innovation, Taipei, Taiwan – sequence: 2 surname: Shih-Tsang Tang fullname: Shih-Tsang Tang email: sttang@mail.mcu.edu.tw organization: Dept. of Biomed. Eng., Ming Chuan Univ., Taoyuan, Taiwan – sequence: 3 surname: Pei-Chun Li fullname: Pei-Chun Li email: ankh_li@mmc.edu.tw organization: Dept. of Audiology & SpeechLanguage Pathology, Mackay Med. Coll., Taipei, Taiwan |
BookMark | eNotjLFOwzAUAI0EAy2MTCz-gYTn2I7tkVYtjdQKJNqVykmeU0uJExJ36N9DBdPppNPNyG3oAxLyxCBlDMzLwje7XZoBy1MDN2TGJBjImDLynnx99FNMnG8jjj40dI_VKfjvM9LDdPWFDTUtuqEfow0V0vU5VNH3gbp-pJ8D_ua0CBHb1je-9K2PF7oKp2vbYYgP5M7ZdsLHf87JYb3aLzfJ9v2tWL5uE5-BjomogEtA7upMCaeFM2UmLYoyF5wzzhBKZ43lFnNda6OgVqixBFcxa5w2fE6e_74eEY_D6Ds7Xo5KCglK8B9iElFK |
CODEN | IEEPAD |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/BigMM.2016.90 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE/IET Electronic Library IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 1509021795 9781509021796 |
EndPage | 491 |
ExternalDocumentID | 7545074 |
Genre | orig-research |
GroupedDBID | 6IE 6IL CBEJK RIE RIL |
ID | FETCH-LOGICAL-i208t-4c0350e3fd274f84f9b25ae4b6433131e0bfa9a3ae68d8970d7e8eb0fc1a9f893 |
IEDL.DBID | RIE |
IngestDate | Thu Jun 29 18:35:57 EDT 2023 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i208t-4c0350e3fd274f84f9b25ae4b6433131e0bfa9a3ae68d8970d7e8eb0fc1a9f893 |
PageCount | 5 |
ParticipantIDs | ieee_primary_7545074 |
PublicationCentury | 2000 |
PublicationDate | 20160401 |
PublicationDateYYYYMMDD | 2016-04-01 |
PublicationDate_xml | – month: 04 year: 2016 text: 20160401 day: 01 |
PublicationDecade | 2010 |
PublicationTitle | 2016 IEEE Second International Conference on Multimedia Big Data (BigMM) |
PublicationTitleAbbrev | BigMM |
PublicationYear | 2016 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
Score | 1.596239 |
Snippet | Conventional speech enhancement (SE) algorithms are mainly designed with the aim of improving signal-to-noise levels of noisy speech signals. However, many... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 487 |
SubjectTerms | Algorithm design and analysis GMAPA algorithm intelligibility-oriented speech enhancement Noise measurement Noise reduction Signal to noise ratio spectral restoration Speech Speech enhancement |
Title | Post-filtering Technique Using Band Importance Function for Speech Intelligibility Enhancement |
URI | https://ieeexplore.ieee.org/document/7545074 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NSwMxEB3anjyptOI3OXg02902u91cKy2tUBFsoSdLZneiRdwW2R7015vJ1hbEg7cQwm7IQN5M5s0bgBvSFjEzkcwjS1JZFUrj4gypHJpgRFZH6AmyD8lopu7n8bwGt7taGCLy5DMKeOhz-fkq2_BTWbvn4N5BXh3qLnCrarX2spnt_vJlMmGyVhL4G3bfLMVjxfAQJj9_qSgib8GmxCD7-iXA-N9tHEFrX5UnHnd4cww1KprwzN12pV1y0tvNiumPJqvwZADRN0Uuxu_ey-YPDB2OsS2Ec1bF05rccjHeynJWRNlPMSheeS3vogWz4WB6N5Lbngly2QnTUqqMU4XUtbkLN22qrMZObEhhwqVR3YhCtEabrqEkzVPdC_MepYShzSKjrXNeTqBRrAo6BaGURpbDY4kuRb3M-Q6JjTWqGA12LJ1Bk89msa5kMRbbYzn_e_oCDtg0FenlEhrlx4auHJ6XeO0N-Q3Q_KUi |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NTwIxEJ0gHvSkBozf9uDRwi50P3rFQEBZYiIknCTt7lSJcSFmOeivt9NFSIwHb03TtE0n6Zt23rwBuEFptE6VzzPfIBdGeFzZdwYXFk20j0b62hFkR2F_Iu6nwbQCt5tcGER05DNsUNPF8rNFuqKvsmZk4d5C3g7s2kbgl9laW-HMZmf-kiRE1wob7o7dlktxaNE7gORnnZIk8tZYFbqRfv2SYPzvRg6hvs3LY48bxDmCCuY1eKZ6u9zMKexte9n4R5WVOToA66g8Y4N352fTBD2LZGQNZt1V9rREO5wN1sKcJVX2k3XzVxpLu6jDpNcd3_X5umoCn7e8uOAipWAhtk1mH5wmFkbqVqBQ6JCSo9o-etooqdoKwziLZeRlEcaoPZP6ShrrvhxDNV_keAJMCKlJEI9EugRGqfUeQhNILQKtdMvgKdTobGbLUhhjtj6Ws7-7r2GvP06Gs-Fg9HAO-2SmkgJzAdXiY4WXFt0LfeWM-g2bDKhr |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2016+IEEE+Second+International+Conference+on+Multimedia+Big+Data+%28BigMM%29&rft.atitle=Post-filtering+Technique+Using+Band+Importance+Function+for+Speech+Intelligibility+Enhancement&rft.au=Ying-Hui+Lai&rft.au=Shih-Tsang+Tang&rft.au=Pei-Chun+Li&rft.date=2016-04-01&rft.pub=IEEE&rft.spage=487&rft.epage=491&rft_id=info:doi/10.1109%2FBigMM.2016.90&rft.externalDocID=7545074 |