Data Augmentation and Loss Normalization for Deep Noise Suppression
Speech enhancement using neural networks is recently receiving large attention in research and being integrated in commercial devices and applications. In this work, we investigate data augmentation techniques for supervised deep learning-based speech enhancement. We show that not only augmenting SN...
Saved in:
Published in | Speech and Computer Vol. 12335; pp. 79 - 86 |
---|---|
Main Authors | , |
Format | Book Chapter |
Language | English |
Published |
Switzerland
Springer International Publishing AG
2020
Springer International Publishing |
Series | Lecture Notes in Computer Science |
Subjects | |
Online Access | Get full text |
ISBN | 3030602753 9783030602758 |
ISSN | 0302-9743 1611-3349 |
DOI | 10.1007/978-3-030-60276-5_8 |
Cover
Loading…
Abstract | Speech enhancement using neural networks is recently receiving large attention in research and being integrated in commercial devices and applications. In this work, we investigate data augmentation techniques for supervised deep learning-based speech enhancement. We show that not only augmenting SNR values to a broader range and a continuous distribution helps to regularize training, but also augmenting the spectral and dynamic level diversity. However, to not degrade training by level augmentation, we propose a modification to signal-based loss functions by applying sequence level normalization. We show in experiments that this normalization overcomes the degradation caused by training on sequences with imbalanced signal levels, when using a level-dependent loss function. |
---|---|
AbstractList | Speech enhancement using neural networks is recently receiving large attention in research and being integrated in commercial devices and applications. In this work, we investigate data augmentation techniques for supervised deep learning-based speech enhancement. We show that not only augmenting SNR values to a broader range and a continuous distribution helps to regularize training, but also augmenting the spectral and dynamic level diversity. However, to not degrade training by level augmentation, we propose a modification to signal-based loss functions by applying sequence level normalization. We show in experiments that this normalization overcomes the degradation caused by training on sequences with imbalanced signal levels, when using a level-dependent loss function. |
Author | Braun, Sebastian Tashev, Ivan |
Author_xml | – sequence: 1 givenname: Sebastian orcidid: 0000-0001-9060-223X surname: Braun fullname: Braun, Sebastian email: sebastian.braun@microsoft.com – sequence: 2 givenname: Ivan orcidid: 0000-0002-2263-2047 surname: Tashev fullname: Tashev, Ivan |
BookMark | eNqVkMtOwzAQRQ0URFv6BWzyA4bxI7azrNrykCpYAGvLaSclpY2DnW74etyHxJrV3Lmjc6W5A9JrfIOE3DK4YwD6vtCGCgoCqAKuFc2tOSMDkYzDzs5JnynGqBCyuPg75KJH-klzWmgprsiAcRCFAS71NRnFuAZImiuuTJ9Mpq5z2Xi32mLTua72TeaaZTb3MWYvPmzdpv452pUP2RSxTXYdMXvbtW3AGNPphlxWbhNxdJpD8vEwe5880fnr4_NkPKdrrqShCg0zBmBRLHUlcwmOaZdrxiVKKfOl4WWJRjN0ildoEMCVWrNCCYd5rlAMCTvmxjbUzQqDLb3_ipaB3fdlU19W2PS5PfRjU1-J4UemDf57h7GzuIcW6dvgNotP13YYolVCaalSBLOF_BfE5Qn6Bd7Fewg |
ContentType | Book Chapter |
Copyright | Springer Nature Switzerland AG 2020 |
Copyright_xml | – notice: Springer Nature Switzerland AG 2020 |
DBID | FFUUA |
DEWEY | 006.454 |
DOI | 10.1007/978-3-030-60276-5_8 |
DatabaseName | ProQuest Ebook Central - Book Chapters - Demo use only |
DatabaseTitleList | |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Computer Science |
EISBN | 3030602761 9783030602765 |
EISSN | 1611-3349 |
Editor | Potapova, Rodmonga Karpov, Alexey |
Editor_xml | – sequence: 1 fullname: Karpov, Alexey – sequence: 2 fullname: Potapova, Rodmonga |
EndPage | 86 |
ExternalDocumentID | EBC6367467_91_94 EBC6367424_91_94 |
GroupedDBID | 38. AABBV ACGCR AEDXK AEHEY AEJLV AEJNW AEKFX ALMA_UNASSIGNED_HOLDINGS APEJL AVCSZ AZTDL BBABE CYNQG CZZ DACMV ESBCR FFUUA I4C IEZ OAOFD OPOMJ SBO TPJZQ TSXQS Z5O Z7R Z7S Z7U Z7V Z7W Z7X Z7Y Z7Z Z81 Z82 Z83 Z84 Z85 Z87 Z88 -DT -~X 29L 2HA 2HV ACGFS ADCXD EJD F5P LAS LDH P2P RSU ~02 |
ID | FETCH-LOGICAL-j2648-6e818800c9d7f4540a17a57124e4445d82bbe871ea62fe8e00ab771963ae556e3 |
ISBN | 3030602753 9783030602758 |
ISSN | 0302-9743 |
IngestDate | Tue Jul 29 20:38:08 EDT 2025 Thu Jul 10 08:26:32 EDT 2025 Thu Jul 10 08:26:39 EDT 2025 |
IsPeerReviewed | false |
IsScholarly | false |
LCCallNum | QA75.5-76.95 |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-j2648-6e818800c9d7f4540a17a57124e4445d82bbe871ea62fe8e00ab771963ae556e3 |
OCLC | 1203980247 |
ORCID | 0000-0001-9060-223X 0000-0002-2263-2047 |
PQID | EBC6367424_91_94 |
PageCount | 8 |
ParticipantIDs | springer_books_10_1007_978_3_030_60276_5_8 proquest_ebookcentralchapters_6367467_91_94 proquest_ebookcentralchapters_6367424_91_94 |
PublicationCentury | 2000 |
PublicationDate | 2020-00-00 |
PublicationDateYYYYMMDD | 2020-01-01 |
PublicationDate_xml | – year: 2020 text: 2020-00-00 |
PublicationDecade | 2020 |
PublicationPlace | Switzerland |
PublicationPlace_xml | – name: Switzerland – name: Cham |
PublicationSeriesSubtitle | Lecture Notes in Artificial Intelligence |
PublicationSeriesTitle | Lecture Notes in Computer Science |
PublicationSeriesTitleAlternate | Lect.Notes Computer |
PublicationSubtitle | 22nd International Conference, SPECOM 2020, St. Petersburg, Russia, October 7-9, 2020, Proceedings |
PublicationTitle | Speech and Computer |
PublicationYear | 2020 |
Publisher | Springer International Publishing AG Springer International Publishing |
Publisher_xml | – name: Springer International Publishing AG – name: Springer International Publishing |
RelatedPersons | Hartmanis, Juris Gao, Wen Bertino, Elisa Woeginger, Gerhard Goos, Gerhard Steffen, Bernhard Yung, Moti |
RelatedPersons_xml | – sequence: 1 givenname: Gerhard surname: Goos fullname: Goos, Gerhard – sequence: 2 givenname: Juris surname: Hartmanis fullname: Hartmanis, Juris – sequence: 3 givenname: Elisa surname: Bertino fullname: Bertino, Elisa – sequence: 4 givenname: Wen surname: Gao fullname: Gao, Wen – sequence: 5 givenname: Bernhard surname: Steffen fullname: Steffen, Bernhard – sequence: 6 givenname: Gerhard surname: Woeginger fullname: Woeginger, Gerhard – sequence: 7 givenname: Moti surname: Yung fullname: Yung, Moti |
SSID | ssj0002426268 ssj0002792 |
Score | 2.1100743 |
Snippet | Speech enhancement using neural networks is recently receiving large attention in research and being integrated in commercial devices and applications. In this... |
SourceID | springer proquest |
SourceType | Publisher |
StartPage | 79 |
SubjectTerms | Data augmentation Deep noise suppression Speech enhancement |
Title | Data Augmentation and Loss Normalization for Deep Noise Suppression |
URI | http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=6367424&ppg=94 http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=6367467&ppg=94 http://link.springer.com/10.1007/978-3-030-60276-5_8 |
Volume | 12335 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnZ1LT9tAEIBXEC5VDwVKVSigPXBqZOTY-_IxhABC0BNU3FZre1y1goBIuPTXd8brTWy3h8LFilbraOLPmszMzoOxo8yhlWBcETncEQlJ0wALDVFWqapEj0vldS-962_q4lZc3sm71ZzPurpkkR8Xv_9ZV_IWqriGXKlK9hVkl1-KC_gZ-eIVCeO1Z_x2w6y-huMJoPB1aWE0Q5v_qVu44fjlx0NTXOSTjq8eSbGRnXrfFGDWeYanAE-4_HMOQxrz6XNjZ-2IQBL3IgIhItiLKbbCWuPzjheZkt9Ax5emoxaT1DcS-UvJtvMq8NaI7lWRtGb1nxLO0f0A415H6-nJRKUKfXJhs5HNxDpb10YM2MZ4enn1fRkhI-MhUYYKcoKAqW-ZtBJ42UfKtwruydPxGnoH3bX9cLPJ3lNNCadiDxRxi63BbJt9CNh4o2A_sglR421qHKlxosY71DhS40SN19R4i9oOuz2b3kwuombMRfSL0gsjBYaa4sVFVuqKGiK6kXZSo-EFQghZmiTPAf1acCqpwEAcu1xr0pwOpFSQfmKD2eMMPjNO3fd0XKJbWSqRi9TIKpNZgTa-K0YG9C4bhgdi68P4JgO48D9_bjtc_m-30mH31_CELW2e29ARG8nY1CIZW5OxSGbvVYJ8Ye9W7_k-GyyeX-AAbcFFfti8NH8ARBdbIg |
linkProvider | Library Specific Holdings |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Speech+and+Computer&rft.atitle=Data+Augmentation+and+Loss+Normalization+for+Deep+Noise+Suppression&rft.date=2020-01-01&rft.pub=Springer+International+Publishing+AG&rft.isbn=9783030602758&rft.volume=12335&rft_id=info:doi/10.1007%2F978-3-030-60276-5_8&rft.externalDBID=94&rft.externalDocID=EBC6367424_91_94 |
thumbnail_s | http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F6367424-l.jpg http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F6367467-l.jpg |