Data Augmentation and Loss Normalization for Deep Noise Suppression

Speech enhancement using neural networks is recently receiving large attention in research and being integrated in commercial devices and applications. In this work, we investigate data augmentation techniques for supervised deep learning-based speech enhancement. We show that not only augmenting SN...

Full description

Saved in:
Bibliographic Details
Published inSpeech and Computer Vol. 12335; pp. 79 - 86
Main Authors Braun, Sebastian, Tashev, Ivan
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2020
Springer International Publishing
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN3030602753
9783030602758
ISSN0302-9743
1611-3349
DOI10.1007/978-3-030-60276-5_8

Cover

Loading…
Abstract Speech enhancement using neural networks is recently receiving large attention in research and being integrated in commercial devices and applications. In this work, we investigate data augmentation techniques for supervised deep learning-based speech enhancement. We show that not only augmenting SNR values to a broader range and a continuous distribution helps to regularize training, but also augmenting the spectral and dynamic level diversity. However, to not degrade training by level augmentation, we propose a modification to signal-based loss functions by applying sequence level normalization. We show in experiments that this normalization overcomes the degradation caused by training on sequences with imbalanced signal levels, when using a level-dependent loss function.
AbstractList Speech enhancement using neural networks is recently receiving large attention in research and being integrated in commercial devices and applications. In this work, we investigate data augmentation techniques for supervised deep learning-based speech enhancement. We show that not only augmenting SNR values to a broader range and a continuous distribution helps to regularize training, but also augmenting the spectral and dynamic level diversity. However, to not degrade training by level augmentation, we propose a modification to signal-based loss functions by applying sequence level normalization. We show in experiments that this normalization overcomes the degradation caused by training on sequences with imbalanced signal levels, when using a level-dependent loss function.
Author Braun, Sebastian
Tashev, Ivan
Author_xml – sequence: 1
  givenname: Sebastian
  orcidid: 0000-0001-9060-223X
  surname: Braun
  fullname: Braun, Sebastian
  email: sebastian.braun@microsoft.com
– sequence: 2
  givenname: Ivan
  orcidid: 0000-0002-2263-2047
  surname: Tashev
  fullname: Tashev, Ivan
BookMark eNqVkMtOwzAQRQ0URFv6BWzyA4bxI7azrNrykCpYAGvLaSclpY2DnW74etyHxJrV3Lmjc6W5A9JrfIOE3DK4YwD6vtCGCgoCqAKuFc2tOSMDkYzDzs5JnynGqBCyuPg75KJH-klzWmgprsiAcRCFAS71NRnFuAZImiuuTJ9Mpq5z2Xi32mLTua72TeaaZTb3MWYvPmzdpv452pUP2RSxTXYdMXvbtW3AGNPphlxWbhNxdJpD8vEwe5880fnr4_NkPKdrrqShCg0zBmBRLHUlcwmOaZdrxiVKKfOl4WWJRjN0ildoEMCVWrNCCYd5rlAMCTvmxjbUzQqDLb3_ipaB3fdlU19W2PS5PfRjU1-J4UemDf57h7GzuIcW6dvgNotP13YYolVCaalSBLOF_BfE5Qn6Bd7Fewg
ContentType Book Chapter
Copyright Springer Nature Switzerland AG 2020
Copyright_xml – notice: Springer Nature Switzerland AG 2020
DBID FFUUA
DEWEY 006.454
DOI 10.1007/978-3-030-60276-5_8
DatabaseName ProQuest Ebook Central - Book Chapters - Demo use only
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 3030602761
9783030602765
EISSN 1611-3349
Editor Potapova, Rodmonga
Karpov, Alexey
Editor_xml – sequence: 1
  fullname: Karpov, Alexey
– sequence: 2
  fullname: Potapova, Rodmonga
EndPage 86
ExternalDocumentID EBC6367467_91_94
EBC6367424_91_94
GroupedDBID 38.
AABBV
ACGCR
AEDXK
AEHEY
AEJLV
AEJNW
AEKFX
ALMA_UNASSIGNED_HOLDINGS
APEJL
AVCSZ
AZTDL
BBABE
CYNQG
CZZ
DACMV
ESBCR
FFUUA
I4C
IEZ
OAOFD
OPOMJ
SBO
TPJZQ
TSXQS
Z5O
Z7R
Z7S
Z7U
Z7V
Z7W
Z7X
Z7Y
Z7Z
Z81
Z82
Z83
Z84
Z85
Z87
Z88
-DT
-~X
29L
2HA
2HV
ACGFS
ADCXD
EJD
F5P
LAS
LDH
P2P
RSU
~02
ID FETCH-LOGICAL-j2648-6e818800c9d7f4540a17a57124e4445d82bbe871ea62fe8e00ab771963ae556e3
ISBN 3030602753
9783030602758
ISSN 0302-9743
IngestDate Tue Jul 29 20:38:08 EDT 2025
Thu Jul 10 08:26:32 EDT 2025
Thu Jul 10 08:26:39 EDT 2025
IsPeerReviewed false
IsScholarly false
LCCallNum QA75.5-76.95
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-j2648-6e818800c9d7f4540a17a57124e4445d82bbe871ea62fe8e00ab771963ae556e3
OCLC 1203980247
ORCID 0000-0001-9060-223X
0000-0002-2263-2047
PQID EBC6367424_91_94
PageCount 8
ParticipantIDs springer_books_10_1007_978_3_030_60276_5_8
proquest_ebookcentralchapters_6367467_91_94
proquest_ebookcentralchapters_6367424_91_94
PublicationCentury 2000
PublicationDate 2020-00-00
PublicationDateYYYYMMDD 2020-01-01
PublicationDate_xml – year: 2020
  text: 2020-00-00
PublicationDecade 2020
PublicationPlace Switzerland
PublicationPlace_xml – name: Switzerland
– name: Cham
PublicationSeriesSubtitle Lecture Notes in Artificial Intelligence
PublicationSeriesTitle Lecture Notes in Computer Science
PublicationSeriesTitleAlternate Lect.Notes Computer
PublicationSubtitle 22nd International Conference, SPECOM 2020, St. Petersburg, Russia, October 7-9, 2020, Proceedings
PublicationTitle Speech and Computer
PublicationYear 2020
Publisher Springer International Publishing AG
Springer International Publishing
Publisher_xml – name: Springer International Publishing AG
– name: Springer International Publishing
RelatedPersons Hartmanis, Juris
Gao, Wen
Bertino, Elisa
Woeginger, Gerhard
Goos, Gerhard
Steffen, Bernhard
Yung, Moti
RelatedPersons_xml – sequence: 1
  givenname: Gerhard
  surname: Goos
  fullname: Goos, Gerhard
– sequence: 2
  givenname: Juris
  surname: Hartmanis
  fullname: Hartmanis, Juris
– sequence: 3
  givenname: Elisa
  surname: Bertino
  fullname: Bertino, Elisa
– sequence: 4
  givenname: Wen
  surname: Gao
  fullname: Gao, Wen
– sequence: 5
  givenname: Bernhard
  surname: Steffen
  fullname: Steffen, Bernhard
– sequence: 6
  givenname: Gerhard
  surname: Woeginger
  fullname: Woeginger, Gerhard
– sequence: 7
  givenname: Moti
  surname: Yung
  fullname: Yung, Moti
SSID ssj0002426268
ssj0002792
Score 2.1100743
Snippet Speech enhancement using neural networks is recently receiving large attention in research and being integrated in commercial devices and applications. In this...
SourceID springer
proquest
SourceType Publisher
StartPage 79
SubjectTerms Data augmentation
Deep noise suppression
Speech enhancement
Title Data Augmentation and Loss Normalization for Deep Noise Suppression
URI http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=6367424&ppg=94
http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=6367467&ppg=94
http://link.springer.com/10.1007/978-3-030-60276-5_8
Volume 12335
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnZ1LT9tAEIBXEC5VDwVKVSigPXBqZOTY-_IxhABC0BNU3FZre1y1goBIuPTXd8brTWy3h8LFilbraOLPmszMzoOxo8yhlWBcETncEQlJ0wALDVFWqapEj0vldS-962_q4lZc3sm71ZzPurpkkR8Xv_9ZV_IWqriGXKlK9hVkl1-KC_gZ-eIVCeO1Z_x2w6y-huMJoPB1aWE0Q5v_qVu44fjlx0NTXOSTjq8eSbGRnXrfFGDWeYanAE-4_HMOQxrz6XNjZ-2IQBL3IgIhItiLKbbCWuPzjheZkt9Ax5emoxaT1DcS-UvJtvMq8NaI7lWRtGb1nxLO0f0A415H6-nJRKUKfXJhs5HNxDpb10YM2MZ4enn1fRkhI-MhUYYKcoKAqW-ZtBJ42UfKtwruydPxGnoH3bX9cLPJ3lNNCadiDxRxi63BbJt9CNh4o2A_sglR421qHKlxosY71DhS40SN19R4i9oOuz2b3kwuombMRfSL0gsjBYaa4sVFVuqKGiK6kXZSo-EFQghZmiTPAf1acCqpwEAcu1xr0pwOpFSQfmKD2eMMPjNO3fd0XKJbWSqRi9TIKpNZgTa-K0YG9C4bhgdi68P4JgO48D9_bjtc_m-30mH31_CELW2e29ARG8nY1CIZW5OxSGbvVYJ8Ye9W7_k-GyyeX-AAbcFFfti8NH8ARBdbIg
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Speech+and+Computer&rft.atitle=Data+Augmentation+and+Loss+Normalization+for+Deep+Noise+Suppression&rft.date=2020-01-01&rft.pub=Springer+International+Publishing+AG&rft.isbn=9783030602758&rft.volume=12335&rft_id=info:doi/10.1007%2F978-3-030-60276-5_8&rft.externalDBID=94&rft.externalDocID=EBC6367424_91_94
thumbnail_s http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F6367424-l.jpg
http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F6367467-l.jpg