A speech enhancement algorithm using computational auditory scene analysis with spectral subtraction
Computational auditory scene analysis (CASA) system is well used in speech enhancement area in recent years. We propose a new system that combines CASA and spectral subtraction to get better enhanced speech. The CASA part consists of the latest method deep neural networks (DNNs). The original way to...
Saved in:
Published in | 2016 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) pp. 6 - 10 |
---|---|
Main Authors | , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.12.2016
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Computational auditory scene analysis (CASA) system is well used in speech enhancement area in recent years. We propose a new system that combines CASA and spectral subtraction to get better enhanced speech. The CASA part consists of the latest method deep neural networks (DNNs). The original way to reconstruct the denoise signal is to use the estimated masks with direct overlap-add method ignoring the information of noise within the frames. In our system, we estimate self-adapted thresholds for each channel by Gaussian Mixture Model from the estimated ratio masks (ERMs) to separate noise and speech of each channel. In this way, we make full use of the information within frames. The results show increase in both objective and subjective evaluation. |
---|---|
AbstractList | Computational auditory scene analysis (CASA) system is well used in speech enhancement area in recent years. We propose a new system that combines CASA and spectral subtraction to get better enhanced speech. The CASA part consists of the latest method deep neural networks (DNNs). The original way to reconstruct the denoise signal is to use the estimated masks with direct overlap-add method ignoring the information of noise within the frames. In our system, we estimate self-adapted thresholds for each channel by Gaussian Mixture Model from the estimated ratio masks (ERMs) to separate noise and speech of each channel. In this way, we make full use of the information within frames. The results show increase in both objective and subjective evaluation. |
Author | Wei-Qiang Zhang Jia Liu Cong Guo Like Hui |
Author_xml | – sequence: 1 surname: Cong Guo fullname: Cong Guo organization: Dept. of Electron. Eng., Tsinghua Univ., Beijing, China – sequence: 2 surname: Like Hui fullname: Like Hui organization: Dept. of Electron. Eng., Tsinghua Univ., Beijing, China – sequence: 3 surname: Wei-Qiang Zhang fullname: Wei-Qiang Zhang email: wqzhang@tsinghua.edu.cn organization: Dept. of Electron. Eng., Tsinghua Univ., Beijing, China – sequence: 4 surname: Jia Liu fullname: Jia Liu organization: Dept. of Electron. Eng., Tsinghua Univ., Beijing, China |
BookMark | eNotj8tqwzAURFVoF22aL8hGP2D3Sn5qGUIfgUAL8T5cS9exwJaNJVP893VoVgPDnAPzwh7d4IixnYBYCFBvx_P551jFEkQeF2WZA8AD26qiFBkoyMo0hWdm9tyPRLrl5Fp0mnpygWN3HSYb2p7P3ror10M_zgGDHRx2HGdjwzAt3GtyxHHtFm89_12Jm02HaV35uV5T35hX9tRg52l7zw2rPt6rw1d0-v48HvanyCoIkRZJlkrVFChSyo0C0Qhdm0ZSognIqJoKUCpvcoOJ1CWiASkVUYaGhM6TDdv9ay0RXcbJ9jgtl_v15A9VI1cB |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/ISSPIT.2016.7886000 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Xplore IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Xplore url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 9781509058440 1509058443 |
EndPage | 10 |
ExternalDocumentID | 7886000 |
Genre | orig-research |
GroupedDBID | 6IE 6IL CBEJK RIE RIL |
ID | FETCH-LOGICAL-i90t-c135429f7a14e6d901f1cbdf2e3ce0ed9be70996f6da32c8aad0229ee5ade1c63 |
IEDL.DBID | RIE |
IngestDate | Thu Jun 29 18:37:44 EDT 2023 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i90t-c135429f7a14e6d901f1cbdf2e3ce0ed9be70996f6da32c8aad0229ee5ade1c63 |
PageCount | 5 |
ParticipantIDs | ieee_primary_7886000 |
PublicationCentury | 2000 |
PublicationDate | 2016-Dec. |
PublicationDateYYYYMMDD | 2016-12-01 |
PublicationDate_xml | – month: 12 year: 2016 text: 2016-Dec. |
PublicationDecade | 2010 |
PublicationTitle | 2016 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT) |
PublicationTitleAbbrev | ISSPIT |
PublicationYear | 2016 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
Score | 1.6908963 |
Snippet | Computational auditory scene analysis (CASA) system is well used in speech enhancement area in recent years. We propose a new system that combines CASA and... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 6 |
SubjectTerms | computational auditory scene analysis (CASA) deep neural network (DNN) Neural networks Signal processing algorithms Signal to noise ratio Speech Speech enhancement Time-frequency analysis Training |
Title | A speech enhancement algorithm using computational auditory scene analysis with spectral subtraction |
URI | https://ieeexplore.ieee.org/document/7886000 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NS8NAEF1qT55UWvGbPXg0aTYf2-YoYmmFSqEVeiu7s5NW1KSkyUF_vbNJrCgevIVlQ8LO4c2bnfeGsWsCKQgSCQ4EvnFCUJ6jIkWJnIzNwE8IAYXVDk8e5egpfFhEixa72WlhELFqPkPXPlZ3-SaD0pbKekTXCJ-JoO8NPL_WajVGQsKLe-PZbDqe224t6TY7f4xMqRBjeMAmX9-qG0Ve3LLQLnz8smH8788csu63No9Pd6hzxFqYdpi55dsNIqw5pmsbR_suV6-rjLj_-o3b7vYVh2qCQ1P948rqMbL8nVs_J-SqcSfhtjLLKwFmTru2pS7yWvzQZfPh_fxu5DTzE5zn2CscEIEdRpX0lQhRGgL-RIA2iY8BoIcm1tin_FAm0qjAh4FShgA9RoyUQQEyOGbtNEvxhHFibbExdsS8JCrdD1VEmVIIRH-0F2lhTlnHHtByUztkLJuzOft7-Zzt2yDVTSEXrF3kJV4StBf6qorpJ4-EqPY |
link.rule.ids | 310,311,786,790,795,796,802,27958,55109 |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT8JAEN0QPOhJDRi_3YNHW_q50KMxElAgJNSEG9nOTsGoLSntQX-9s23FaDx4a5vdtNk5vHnT92YYuyaQAjcWYIDrKMMDaRnSl5TIiUD1nJgQ0Nbe4fFEDJ68h7k_b7CbrRcGEUvxGZr6svyXr1IodKmsQ3SN8JkI-g7hvBVUbq26lRDdd4az2XQYar2WMOu1P4amlJjR32fjr7dVUpEXs8gjEz5-NWL87-ccsPa3O49Pt7hzyBqYtJi65Zs1Iqw4JisdSb2Xy9dlSux_9ca1vn3JoZzhUNf_uNSOjDR757qjE3JZ9yfhujbLSwtmRqs2RZRnlf2hzcL-fXg3MOoJCsZzYOUG2K4eRxV3pe2hUAT9sQ2Rih10AS1UQYRdyhBFLJR0HehJqQjSA0RfKrRBuEesmaQJHjNOvC1QSg-ZF0Smu570KVfygAhQZPmRrU5YSx_QYl31yFjUZ3P69-MrtjsIx6PFaDh5PGN7OmCVROScNfOswAsC-jy6LOP7CaNOrEw |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2016+IEEE+International+Symposium+on+Signal+Processing+and+Information+Technology+%28ISSPIT%29&rft.atitle=A+speech+enhancement+algorithm+using+computational+auditory+scene+analysis+with+spectral+subtraction&rft.au=Cong+Guo&rft.au=Like+Hui&rft.au=Wei-Qiang+Zhang&rft.au=Jia+Liu&rft.date=2016-12-01&rft.pub=IEEE&rft.spage=6&rft.epage=10&rft_id=info:doi/10.1109%2FISSPIT.2016.7886000&rft.externalDocID=7886000 |