Post-filtering Technique Using Band Importance Function for Speech Intelligibility Enhancement

Conventional speech enhancement (SE) algorithms are mainly designed with the aim of improving signal-to-noise levels of noisy speech signals. However, many applications consider the enhancement of speech intelligibility as the goal for an SE system. In this study, we propose a maximum speech intelli...

Full description

Saved in:
Bibliographic Details
Published in2016 IEEE Second International Conference on Multimedia Big Data (BigMM) pp. 487 - 491
Main Authors Ying-Hui Lai, Shih-Tsang Tang, Pei-Chun Li
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.04.2016
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Conventional speech enhancement (SE) algorithms are mainly designed with the aim of improving signal-to-noise levels of noisy speech signals. However, many applications consider the enhancement of speech intelligibility as the goal for an SE system. In this study, we propose a maximum speech intelligibility (MSI) post-filter that aims to enhance the intelligibility of processed speech signals. The MSI post-filter is designed to specify a weight for each frequency band of the speech signal based on the critical band importance function. To evaluate the MSI post-filter, we combine it with a recently proposed generalized maximum a posteriori spectral amplitude estimation (GMAPA) SE algorithm. In previous studies, it has been verified that GMAPA outperforms several well-known spectral restoration approaches in terms of objective evaluations and speech recognition tests. Experimental results from the present study confirm that GMAPA also provides better results in a set of subjective intelligibility tests conducted with human subjects. Moreover, the integration of GMAPA and MSI can further improve the intelligibility scores over GMAPA alone under - 10 dB to 5 dB signal-to-noise ratio conditions.
AbstractList Conventional speech enhancement (SE) algorithms are mainly designed with the aim of improving signal-to-noise levels of noisy speech signals. However, many applications consider the enhancement of speech intelligibility as the goal for an SE system. In this study, we propose a maximum speech intelligibility (MSI) post-filter that aims to enhance the intelligibility of processed speech signals. The MSI post-filter is designed to specify a weight for each frequency band of the speech signal based on the critical band importance function. To evaluate the MSI post-filter, we combine it with a recently proposed generalized maximum a posteriori spectral amplitude estimation (GMAPA) SE algorithm. In previous studies, it has been verified that GMAPA outperforms several well-known spectral restoration approaches in terms of objective evaluations and speech recognition tests. Experimental results from the present study confirm that GMAPA also provides better results in a set of subjective intelligibility tests conducted with human subjects. Moreover, the integration of GMAPA and MSI can further improve the intelligibility scores over GMAPA alone under - 10 dB to 5 dB signal-to-noise ratio conditions.
Author Shih-Tsang Tang
Pei-Chun Li
Ying-Hui Lai
Author_xml – sequence: 1
  surname: Ying-Hui Lai
  fullname: Ying-Hui Lai
  email: jackylai@citi.sinica.edu.tw
  organization: Res. Center for Inf., Technol. Innovation, Taipei, Taiwan
– sequence: 2
  surname: Shih-Tsang Tang
  fullname: Shih-Tsang Tang
  email: sttang@mail.mcu.edu.tw
  organization: Dept. of Biomed. Eng., Ming Chuan Univ., Taoyuan, Taiwan
– sequence: 3
  surname: Pei-Chun Li
  fullname: Pei-Chun Li
  email: ankh_li@mmc.edu.tw
  organization: Dept. of Audiology & SpeechLanguage Pathology, Mackay Med. Coll., Taipei, Taiwan
BookMark eNotjLFOwzAUAI0EAy2MTCz-gYTn2I7tkVYtjdQKJNqVykmeU0uJExJ36N9DBdPppNPNyG3oAxLyxCBlDMzLwje7XZoBy1MDN2TGJBjImDLynnx99FNMnG8jjj40dI_VKfjvM9LDdPWFDTUtuqEfow0V0vU5VNH3gbp-pJ8D_ua0CBHb1je-9K2PF7oKp2vbYYgP5M7ZdsLHf87JYb3aLzfJ9v2tWL5uE5-BjomogEtA7upMCaeFM2UmLYoyF5wzzhBKZ43lFnNda6OgVqixBFcxa5w2fE6e_74eEY_D6Ds7Xo5KCglK8B9iElFK
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/BigMM.2016.90
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE/IET Electronic Library
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 1509021795
9781509021796
EndPage 491
ExternalDocumentID 7545074
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i208t-4c0350e3fd274f84f9b25ae4b6433131e0bfa9a3ae68d8970d7e8eb0fc1a9f893
IEDL.DBID RIE
IngestDate Thu Jun 29 18:35:57 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i208t-4c0350e3fd274f84f9b25ae4b6433131e0bfa9a3ae68d8970d7e8eb0fc1a9f893
PageCount 5
ParticipantIDs ieee_primary_7545074
PublicationCentury 2000
PublicationDate 20160401
PublicationDateYYYYMMDD 2016-04-01
PublicationDate_xml – month: 04
  year: 2016
  text: 20160401
  day: 01
PublicationDecade 2010
PublicationTitle 2016 IEEE Second International Conference on Multimedia Big Data (BigMM)
PublicationTitleAbbrev BigMM
PublicationYear 2016
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.596239
Snippet Conventional speech enhancement (SE) algorithms are mainly designed with the aim of improving signal-to-noise levels of noisy speech signals. However, many...
SourceID ieee
SourceType Publisher
StartPage 487
SubjectTerms Algorithm design and analysis
GMAPA algorithm
intelligibility-oriented speech enhancement
Noise measurement
Noise reduction
Signal to noise ratio
spectral restoration
Speech
Speech enhancement
Title Post-filtering Technique Using Band Importance Function for Speech Intelligibility Enhancement
URI https://ieeexplore.ieee.org/document/7545074
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NSwMxEB3anjyptOI3OXg02902u91cKy2tUBFsoSdLZneiRdwW2R7015vJ1hbEg7cQwm7IQN5M5s0bgBvSFjEzkcwjS1JZFUrj4gypHJpgRFZH6AmyD8lopu7n8bwGt7taGCLy5DMKeOhz-fkq2_BTWbvn4N5BXh3qLnCrarX2spnt_vJlMmGyVhL4G3bfLMVjxfAQJj9_qSgib8GmxCD7-iXA-N9tHEFrX5UnHnd4cww1KprwzN12pV1y0tvNiumPJqvwZADRN0Uuxu_ey-YPDB2OsS2Ec1bF05rccjHeynJWRNlPMSheeS3vogWz4WB6N5Lbngly2QnTUqqMU4XUtbkLN22qrMZObEhhwqVR3YhCtEabrqEkzVPdC_MepYShzSKjrXNeTqBRrAo6BaGURpbDY4kuRb3M-Q6JjTWqGA12LJ1Bk89msa5kMRbbYzn_e_oCDtg0FenlEhrlx4auHJ6XeO0N-Q3Q_KUi
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NTwIxEJ0gHvSkBozf9uDRwi50P3rFQEBZYiIknCTt7lSJcSFmOeivt9NFSIwHb03TtE0n6Zt23rwBuEFptE6VzzPfIBdGeFzZdwYXFk20j0b62hFkR2F_Iu6nwbQCt5tcGER05DNsUNPF8rNFuqKvsmZk4d5C3g7s2kbgl9laW-HMZmf-kiRE1wob7o7dlktxaNE7gORnnZIk8tZYFbqRfv2SYPzvRg6hvs3LY48bxDmCCuY1eKZ6u9zMKexte9n4R5WVOToA66g8Y4N352fTBD2LZGQNZt1V9rREO5wN1sKcJVX2k3XzVxpLu6jDpNcd3_X5umoCn7e8uOAipWAhtk1mH5wmFkbqVqBQ6JCSo9o-etooqdoKwziLZeRlEcaoPZP6ShrrvhxDNV_keAJMCKlJEI9EugRGqfUeQhNILQKtdMvgKdTobGbLUhhjtj6Ws7-7r2GvP06Gs-Fg9HAO-2SmkgJzAdXiY4WXFt0LfeWM-g2bDKhr
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2016+IEEE+Second+International+Conference+on+Multimedia+Big+Data+%28BigMM%29&rft.atitle=Post-filtering+Technique+Using+Band+Importance+Function+for+Speech+Intelligibility+Enhancement&rft.au=Ying-Hui+Lai&rft.au=Shih-Tsang+Tang&rft.au=Pei-Chun+Li&rft.date=2016-04-01&rft.pub=IEEE&rft.spage=487&rft.epage=491&rft_id=info:doi/10.1109%2FBigMM.2016.90&rft.externalDocID=7545074