A speech enhancement algorithm using computational auditory scene analysis with spectral subtraction

Computational auditory scene analysis (CASA) system is well used in speech enhancement area in recent years. We propose a new system that combines CASA and spectral subtraction to get better enhanced speech. The CASA part consists of the latest method deep neural networks (DNNs). The original way to...

Full description

Saved in:

Bibliographic Details
Published in	2016 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) pp. 6 - 10
Main Authors	Cong Guo, Like Hui, Wei-Qiang Zhang, Jia Liu
Format	Conference Proceeding
Language	English
Published	IEEE 01.12.2016
Subjects	computational auditory scene analysis (CASA) deep neural network (DNN) Neural networks Signal processing algorithms Signal to noise ratio Speech Speech enhancement Time-frequency analysis Training
Online Access	Get full text

Cover

Loading…

Abstract	Computational auditory scene analysis (CASA) system is well used in speech enhancement area in recent years. We propose a new system that combines CASA and spectral subtraction to get better enhanced speech. The CASA part consists of the latest method deep neural networks (DNNs). The original way to reconstruct the denoise signal is to use the estimated masks with direct overlap-add method ignoring the information of noise within the frames. In our system, we estimate self-adapted thresholds for each channel by Gaussian Mixture Model from the estimated ratio masks (ERMs) to separate noise and speech of each channel. In this way, we make full use of the information within frames. The results show increase in both objective and subjective evaluation.
AbstractList	Computational auditory scene analysis (CASA) system is well used in speech enhancement area in recent years. We propose a new system that combines CASA and spectral subtraction to get better enhanced speech. The CASA part consists of the latest method deep neural networks (DNNs). The original way to reconstruct the denoise signal is to use the estimated masks with direct overlap-add method ignoring the information of noise within the frames. In our system, we estimate self-adapted thresholds for each channel by Gaussian Mixture Model from the estimated ratio masks (ERMs) to separate noise and speech of each channel. In this way, we make full use of the information within frames. The results show increase in both objective and subjective evaluation.
Author	Wei-Qiang Zhang Jia Liu Cong Guo Like Hui
Author_xml	– sequence: 1 surname: Cong Guo fullname: Cong Guo organization: Dept. of Electron. Eng., Tsinghua Univ., Beijing, China – sequence: 2 surname: Like Hui fullname: Like Hui organization: Dept. of Electron. Eng., Tsinghua Univ., Beijing, China – sequence: 3 surname: Wei-Qiang Zhang fullname: Wei-Qiang Zhang email: wqzhang@tsinghua.edu.cn organization: Dept. of Electron. Eng., Tsinghua Univ., Beijing, China – sequence: 4 surname: Jia Liu fullname: Jia Liu organization: Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
BookMark	eNotj8tqwzAURFVoF22aL8hGP2D3Sn5qGUIfgUAL8T5cS9exwJaNJVP893VoVgPDnAPzwh7d4IixnYBYCFBvx_P551jFEkQeF2WZA8AD26qiFBkoyMo0hWdm9tyPRLrl5Fp0mnpygWN3HSYb2p7P3ror10M_zgGDHRx2HGdjwzAt3GtyxHHtFm89_12Jm02HaV35uV5T35hX9tRg52l7zw2rPt6rw1d0-v48HvanyCoIkRZJlkrVFChSyo0C0Qhdm0ZSognIqJoKUCpvcoOJ1CWiASkVUYaGhM6TDdv9ay0RXcbJ9jgtl_v15A9VI1cB
ContentType	Conference Proceeding
DBID	6IE 6IL CBEJK RIE RIL
DOI	10.1109/ISSPIT.2016.7886000
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Xplore IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Xplore url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
EISBN	9781509058440 1509058443
EndPage	10
ExternalDocumentID	7886000
Genre	orig-research
GroupedDBID	6IE 6IL CBEJK RIE RIL
ID	FETCH-LOGICAL-i90t-c135429f7a14e6d901f1cbdf2e3ce0ed9be70996f6da32c8aad0229ee5ade1c63
IEDL.DBID	RIE
IngestDate	Thu Jun 29 18:37:44 EDT 2023
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i90t-c135429f7a14e6d901f1cbdf2e3ce0ed9be70996f6da32c8aad0229ee5ade1c63
PageCount	5
ParticipantIDs	ieee_primary_7886000
PublicationCentury	2000
PublicationDate	2016-Dec.
PublicationDateYYYYMMDD	2016-12-01
PublicationDate_xml	– month: 12 year: 2016 text: 2016-Dec.
PublicationDecade	2010
PublicationTitle	2016 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)
PublicationTitleAbbrev	ISSPIT
PublicationYear	2016
Publisher	IEEE
Publisher_xml	– name: IEEE
Score	1.6908963
Snippet	Computational auditory scene analysis (CASA) system is well used in speech enhancement area in recent years. We propose a new system that combines CASA and...
SourceID	ieee
SourceType	Publisher
StartPage	6
SubjectTerms	computational auditory scene analysis (CASA) deep neural network (DNN) Neural networks Signal processing algorithms Signal to noise ratio Speech Speech enhancement Time-frequency analysis Training
Title	A speech enhancement algorithm using computational auditory scene analysis with spectral subtraction
URI	https://ieeexplore.ieee.org/document/7886000
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NS8NAEF1qT55UWvGbPXg0aTYf2-YoYmmFSqEVeiu7s5NW1KSkyUF_vbNJrCgevIVlQ8LO4c2bnfeGsWsCKQgSCQ4EvnFCUJ6jIkWJnIzNwE8IAYXVDk8e5egpfFhEixa72WlhELFqPkPXPlZ3-SaD0pbKekTXCJ-JoO8NPL_WajVGQsKLe-PZbDqe224t6TY7f4xMqRBjeMAmX9-qG0Ve3LLQLnz8smH8788csu63No9Pd6hzxFqYdpi55dsNIqw5pmsbR_suV6-rjLj_-o3b7vYVh2qCQ1P948rqMbL8nVs_J-SqcSfhtjLLKwFmTru2pS7yWvzQZfPh_fxu5DTzE5zn2CscEIEdRpX0lQhRGgL-RIA2iY8BoIcm1tin_FAm0qjAh4FShgA9RoyUQQEyOGbtNEvxhHFibbExdsS8JCrdD1VEmVIIRH-0F2lhTlnHHtByUztkLJuzOft7-Zzt2yDVTSEXrF3kJV4StBf6qorpJ4-EqPY
link.rule.ids	310,311,786,790,795,796,802,27958,55109
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT8JAEN0QPOhJDRi_3YNHW_q50KMxElAgJNSEG9nOTsGoLSntQX-9s23FaDx4a5vdtNk5vHnT92YYuyaQAjcWYIDrKMMDaRnSl5TIiUD1nJgQ0Nbe4fFEDJ68h7k_b7CbrRcGEUvxGZr6svyXr1IodKmsQ3SN8JkI-g7hvBVUbq26lRDdd4az2XQYar2WMOu1P4amlJjR32fjr7dVUpEXs8gjEz5-NWL87-ccsPa3O49Pt7hzyBqYtJi65Zs1Iqw4JisdSb2Xy9dlSux_9ca1vn3JoZzhUNf_uNSOjDR757qjE3JZ9yfhujbLSwtmRqs2RZRnlf2hzcL-fXg3MOoJCsZzYOUG2K4eRxV3pe2hUAT9sQ2Rih10AS1UQYRdyhBFLJR0HehJqQjSA0RfKrRBuEesmaQJHjNOvC1QSg-ZF0Smu570KVfygAhQZPmRrU5YSx_QYl31yFjUZ3P69-MrtjsIx6PFaDh5PGN7OmCVROScNfOswAsC-jy6LOP7CaNOrEw
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2016+IEEE+International+Symposium+on+Signal+Processing+and+Information+Technology+%28ISSPIT%29&rft.atitle=A+speech+enhancement+algorithm+using+computational+auditory+scene+analysis+with+spectral+subtraction&rft.au=Cong+Guo&rft.au=Like+Hui&rft.au=Wei-Qiang+Zhang&rft.au=Jia+Liu&rft.date=2016-12-01&rft.pub=IEEE&rft.spage=6&rft.epage=10&rft_id=info:doi/10.1109%2FISSPIT.2016.7886000&rft.externalDocID=7886000