A speech enhancement algorithm using computational auditory scene analysis with spectral subtraction

Computational auditory scene analysis (CASA) system is well used in speech enhancement area in recent years. We propose a new system that combines CASA and spectral subtraction to get better enhanced speech. The CASA part consists of the latest method deep neural networks (DNNs). The original way to...

Full description

Saved in:
Bibliographic Details
Published in2016 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) pp. 6 - 10
Main Authors Cong Guo, Like Hui, Wei-Qiang Zhang, Jia Liu
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.12.2016
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Computational auditory scene analysis (CASA) system is well used in speech enhancement area in recent years. We propose a new system that combines CASA and spectral subtraction to get better enhanced speech. The CASA part consists of the latest method deep neural networks (DNNs). The original way to reconstruct the denoise signal is to use the estimated masks with direct overlap-add method ignoring the information of noise within the frames. In our system, we estimate self-adapted thresholds for each channel by Gaussian Mixture Model from the estimated ratio masks (ERMs) to separate noise and speech of each channel. In this way, we make full use of the information within frames. The results show increase in both objective and subjective evaluation.
AbstractList Computational auditory scene analysis (CASA) system is well used in speech enhancement area in recent years. We propose a new system that combines CASA and spectral subtraction to get better enhanced speech. The CASA part consists of the latest method deep neural networks (DNNs). The original way to reconstruct the denoise signal is to use the estimated masks with direct overlap-add method ignoring the information of noise within the frames. In our system, we estimate self-adapted thresholds for each channel by Gaussian Mixture Model from the estimated ratio masks (ERMs) to separate noise and speech of each channel. In this way, we make full use of the information within frames. The results show increase in both objective and subjective evaluation.
Author Wei-Qiang Zhang
Jia Liu
Cong Guo
Like Hui
Author_xml – sequence: 1
  surname: Cong Guo
  fullname: Cong Guo
  organization: Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
– sequence: 2
  surname: Like Hui
  fullname: Like Hui
  organization: Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
– sequence: 3
  surname: Wei-Qiang Zhang
  fullname: Wei-Qiang Zhang
  email: wqzhang@tsinghua.edu.cn
  organization: Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
– sequence: 4
  surname: Jia Liu
  fullname: Jia Liu
  organization: Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
BookMark eNotj8tqwzAURFVoF22aL8hGP2D3Sn5qGUIfgUAL8T5cS9exwJaNJVP893VoVgPDnAPzwh7d4IixnYBYCFBvx_P551jFEkQeF2WZA8AD26qiFBkoyMo0hWdm9tyPRLrl5Fp0mnpygWN3HSYb2p7P3ror10M_zgGDHRx2HGdjwzAt3GtyxHHtFm89_12Jm02HaV35uV5T35hX9tRg52l7zw2rPt6rw1d0-v48HvanyCoIkRZJlkrVFChSyo0C0Qhdm0ZSognIqJoKUCpvcoOJ1CWiASkVUYaGhM6TDdv9ay0RXcbJ9jgtl_v15A9VI1cB
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ISSPIT.2016.7886000
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Xplore
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9781509058440
1509058443
EndPage 10
ExternalDocumentID 7886000
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i90t-c135429f7a14e6d901f1cbdf2e3ce0ed9be70996f6da32c8aad0229ee5ade1c63
IEDL.DBID RIE
IngestDate Thu Jun 29 18:37:44 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i90t-c135429f7a14e6d901f1cbdf2e3ce0ed9be70996f6da32c8aad0229ee5ade1c63
PageCount 5
ParticipantIDs ieee_primary_7886000
PublicationCentury 2000
PublicationDate 2016-Dec.
PublicationDateYYYYMMDD 2016-12-01
PublicationDate_xml – month: 12
  year: 2016
  text: 2016-Dec.
PublicationDecade 2010
PublicationTitle 2016 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)
PublicationTitleAbbrev ISSPIT
PublicationYear 2016
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.6908963
Snippet Computational auditory scene analysis (CASA) system is well used in speech enhancement area in recent years. We propose a new system that combines CASA and...
SourceID ieee
SourceType Publisher
StartPage 6
SubjectTerms computational auditory scene analysis (CASA)
deep neural network (DNN)
Neural networks
Signal processing algorithms
Signal to noise ratio
Speech
Speech enhancement
Time-frequency analysis
Training
Title A speech enhancement algorithm using computational auditory scene analysis with spectral subtraction
URI https://ieeexplore.ieee.org/document/7886000
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NS8NAEF1qT55UWvGbPXg0aTYf2-YoYmmFSqEVeiu7s5NW1KSkyUF_vbNJrCgevIVlQ8LO4c2bnfeGsWsCKQgSCQ4EvnFCUJ6jIkWJnIzNwE8IAYXVDk8e5egpfFhEixa72WlhELFqPkPXPlZ3-SaD0pbKekTXCJ-JoO8NPL_WajVGQsKLe-PZbDqe224t6TY7f4xMqRBjeMAmX9-qG0Ve3LLQLnz8smH8788csu63No9Pd6hzxFqYdpi55dsNIqw5pmsbR_suV6-rjLj_-o3b7vYVh2qCQ1P948rqMbL8nVs_J-SqcSfhtjLLKwFmTru2pS7yWvzQZfPh_fxu5DTzE5zn2CscEIEdRpX0lQhRGgL-RIA2iY8BoIcm1tin_FAm0qjAh4FShgA9RoyUQQEyOGbtNEvxhHFibbExdsS8JCrdD1VEmVIIRH-0F2lhTlnHHtByUztkLJuzOft7-Zzt2yDVTSEXrF3kJV4StBf6qorpJ4-EqPY
link.rule.ids 310,311,786,790,795,796,802,27958,55109
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT8JAEN0QPOhJDRi_3YNHW_q50KMxElAgJNSEG9nOTsGoLSntQX-9s23FaDx4a5vdtNk5vHnT92YYuyaQAjcWYIDrKMMDaRnSl5TIiUD1nJgQ0Nbe4fFEDJ68h7k_b7CbrRcGEUvxGZr6svyXr1IodKmsQ3SN8JkI-g7hvBVUbq26lRDdd4az2XQYar2WMOu1P4amlJjR32fjr7dVUpEXs8gjEz5-NWL87-ccsPa3O49Pt7hzyBqYtJi65Zs1Iqw4JisdSb2Xy9dlSux_9ca1vn3JoZzhUNf_uNSOjDR757qjE3JZ9yfhujbLSwtmRqs2RZRnlf2hzcL-fXg3MOoJCsZzYOUG2K4eRxV3pe2hUAT9sQ2Rih10AS1UQYRdyhBFLJR0HehJqQjSA0RfKrRBuEesmaQJHjNOvC1QSg-ZF0Smu570KVfygAhQZPmRrU5YSx_QYl31yFjUZ3P69-MrtjsIx6PFaDh5PGN7OmCVROScNfOswAsC-jy6LOP7CaNOrEw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2016+IEEE+International+Symposium+on+Signal+Processing+and+Information+Technology+%28ISSPIT%29&rft.atitle=A+speech+enhancement+algorithm+using+computational+auditory+scene+analysis+with+spectral+subtraction&rft.au=Cong+Guo&rft.au=Like+Hui&rft.au=Wei-Qiang+Zhang&rft.au=Jia+Liu&rft.date=2016-12-01&rft.pub=IEEE&rft.spage=6&rft.epage=10&rft_id=info:doi/10.1109%2FISSPIT.2016.7886000&rft.externalDocID=7886000