An adaptive autoregressive pre-whitener for speech and acoustic signals based on parametric NMF

A common assumption in many speech and acoustic processing methods is that the noise is white and Gaussian (WGN). Although making this assumption results in simple and computationally attractive methods, the assumption is often too simple and crude in many applications. In this paper, we introduce a...

Full description

Saved in:
Bibliographic Details
Published inSpeech communication Vol. 151; pp. 9 - 23
Main Authors Jaramillo, Alfredo Esquivel, Nielsen, Jesper Kjær, Christensen, Mads Græsbøll
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.06.2023
Subjects
Online AccessGet full text

Cover

Loading…
Abstract A common assumption in many speech and acoustic processing methods is that the noise is white and Gaussian (WGN). Although making this assumption results in simple and computationally attractive methods, the assumption is often too simple and crude in many applications. In this paper, we introduce a general purpose and online pre-whitener which can be used as a pre-processor with methods based on the WGN assumption, improving their reliability and performance in applications with colored noise. The pre-whitener is a time-varying filter whose coefficients are found using a parametric non-negative matrix factorization (NMF), based on autoregressive (AR) mixture modeling of both the noise component and the signal component constituting the noisy signal. Compared to other types of pre-whiteners, we show that the proposed pre-whitener has the best performance, especially in applications with non-stationary noise. We also perform a large number of experiments to quantify the benefits of using a pre-whitener as a pre-processor for methods based on the WGN-assumption. The applications of interest were pitch estimation and time-of-arrival (TOA) estimation, where the WGN assumption is very popular. •A pre-processing scheme which renders noise closer to white is introduced.•The introduced pre-whitener simplifies the computations of parameter estimation in colored noise.•Pre-whitening based on parametric NMF offers benefit in nonstationary scenarios.•Pre-whitening is preferred over enhancement as a pre-processor for parametric pitch estimation.•Time of arrival estimation accuracy gets also benefit from pre-whitening.
AbstractList A common assumption in many speech and acoustic processing methods is that the noise is white and Gaussian (WGN). Although making this assumption results in simple and computationally attractive methods, the assumption is often too simple and crude in many applications. In this paper, we introduce a general purpose and online pre-whitener which can be used as a pre-processor with methods based on the WGN assumption, improving their reliability and performance in applications with colored noise. The pre-whitener is a time-varying filter whose coefficients are found using a parametric non-negative matrix factorization (NMF), based on autoregressive (AR) mixture modeling of both the noise component and the signal component constituting the noisy signal. Compared to other types of pre-whiteners, we show that the proposed pre-whitener has the best performance, especially in applications with non-stationary noise. We also perform a large number of experiments to quantify the benefits of using a pre-whitener as a pre-processor for methods based on the WGN-assumption. The applications of interest were pitch estimation and time-of-arrival (TOA) estimation, where the WGN assumption is very popular. •A pre-processing scheme which renders noise closer to white is introduced.•The introduced pre-whitener simplifies the computations of parameter estimation in colored noise.•Pre-whitening based on parametric NMF offers benefit in nonstationary scenarios.•Pre-whitening is preferred over enhancement as a pre-processor for parametric pitch estimation.•Time of arrival estimation accuracy gets also benefit from pre-whitening.
Author Christensen, Mads Græsbøll
Jaramillo, Alfredo Esquivel
Nielsen, Jesper Kjær
Author_xml – sequence: 1
  givenname: Alfredo Esquivel
  orcidid: 0000-0002-8994-9479
  surname: Jaramillo
  fullname: Jaramillo, Alfredo Esquivel
  email: a.e.jaramillo@sheffield.ac.uk
  organization: The University Of Sheffield, Speech and Hearing Group, Department of Computer Science, United Kingdom
– sequence: 2
  givenname: Jesper Kjær
  surname: Nielsen
  fullname: Nielsen, Jesper Kjær
  email: jesperkn.research@gmail.com
  organization: Aalborg University, Audio Analysis Lab, Department of Electronic Systems, Denmark
– sequence: 3
  givenname: Mads Græsbøll
  surname: Christensen
  fullname: Christensen, Mads Græsbøll
  email: mgc@es.aau.dk
  organization: Aalborg University, Audio Analysis Lab, Department of Electronic Systems, Denmark
BookMark eNp9UMtOwzAQtFCRaAt_wME_kOBHcJwLUlVRQCpwgbPlOOvWFbUj2y3i73FVzpxGq9mZ2Z0ZmvjgAaFbSmpKqLjb1WkEE_Y1I4zXpKkJYRdoSmXLqpZKNkHTstZWgnf8Cs1S2hFCGinZFKmFx3rQY3ZHwPqQQ4RNhJRO4xih-t66DB4itiHikgJmi7UfsDbhkLIzOLmN118J9zrBgIPHo456DzkW7u11dY0ubaHh5g_n6HP1-LF8rtbvTy_LxboynNBcDcAYF70BI5i0zIimLz-ANoJ2bScG0RDJmxYk74i11DLLewumgNb2nks-R83Z18SQUgSrxuj2Ov4oStSpJLVT55LUqSRFGlUCiuzhLINy29FBVMk48AYGF8FkNQT3v8EvsMF1og
Cites_doi 10.1109/97.736233
10.1109/ICASSP40776.2020.9053018
10.1016/j.apacoust.2020.107236
10.1109/ICASSP.2018.8461683
10.1109/97.1001645
10.23919/EUSIPCO.2018.8553039
10.1109/10.4597
10.23919/EUSIPCO.2019.8902763
10.1016/0165-1684(83)90022-1
10.1561/0100000006
10.1109/48.972119
10.1109/TGRS.2005.856633
10.1121/1.1910339
10.1109/TSA.2005.854113
10.1016/j.sigpro.2017.01.011
10.1109/TASLP.2017.2775800
10.1121/1.399408
10.1109/TASL.2013.2265085
10.1121/1.2951592
10.1155/2007/92953
10.1121/1.4837238
10.1109/TCOM.1980.1094577
10.1016/S0167-6393(98)00041-7
10.1109/ACSSC.2007.4487291
10.1109/ICASSP.2002.5743722
10.1016/j.sigpro.2020.107860
10.21437/ICSLP.2000-743
10.1109/TASLP.2016.2636445
10.1109/ICASSP40776.2020.9053746
10.1109/TASLP.2013.2295918
10.1109/89.928915
10.1109/TASL.2006.881696
10.1186/s13638-017-0983-3
10.1016/0167-6393(93)90095-3
10.1162/NECO_a_00168
10.1109/29.1552
10.1016/j.apacoust.2011.07.004
10.1049/el.2009.1977
10.1162/neco.2008.04-08-771
10.1109/WASPAA.2019.8937252
10.1109/TASL.2011.2180896
10.1186/1687-6180-2012-111
10.1109/ICASSP.2019.8683653
10.1109/TASLP.2019.2930917
10.1109/TASLP.2016.2608948
10.1093/biomet/78.1.65
10.1109/MSP.2004.1311138
10.1109/IWAENC.2014.6953334
ContentType Journal Article
Copyright 2023 The Author(s)
Copyright_xml – notice: 2023 The Author(s)
DBID 6I.
AAFTH
AAYXX
CITATION
DOI 10.1016/j.specom.2023.04.002
DatabaseName ScienceDirect Open Access Titles
Elsevier:ScienceDirect:Open Access
CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Languages & Literatures
Social Welfare & Social Work
Psychology
EISSN 1872-7182
EndPage 23
ExternalDocumentID 10_1016_j_specom_2023_04_002
S0167639323000572
GrantInformation_xml – fundername: National Council of Science and Technology (CONACYT)
  grantid: 418437
GroupedDBID --K
--M
-~X
.DC
.~1
07C
0R~
123
1B1
1~.
1~5
4.4
457
4G.
53G
5VS
6I.
7-5
71M
8P~
9JN
9JO
AACTN
AADFP
AAEDT
AAEDW
AAFJI
AAFTH
AAGJA
AAGJQ
AAGUQ
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AAXUO
AAYFN
ABBOA
ABFNM
ABIVO
ABJNI
ABMAC
ABMMH
ABOYX
ABXDB
ABYKQ
ACDAQ
ACGFS
ACNNM
ACRLP
ACXNI
ACZNC
ADBBV
ADEZE
ADIYS
ADJOM
ADMUD
ADTZH
AEBSH
AECPX
AEKER
AENEX
AFKWA
AFTJW
AFYLN
AGHFR
AGUBO
AGYEJ
AHHHB
AHJVU
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJBFU
AJOXV
AKYCK
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOMHK
AOUOD
ASPBG
AVARZ
AVWKF
AXJTR
AZFZN
BJAXD
BKOJK
BLXMC
CS3
DU5
EBS
EFJIC
EFLBG
EJD
EO8
EO9
EP2
EP3
F0J
F5P
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-Q
G8K
GBLVA
GBOLZ
HLZ
HVGLF
HZ~
IHE
J1W
JJJVA
KOM
LG9
M41
MO0
N9A
O-L
O9-
OAUVE
OKEIE
OZT
P-8
P-9
P2P
PC.
PQQKQ
PRBVW
Q38
R2-
RIG
ROL
RPZ
SBC
SDF
SDG
SDP
SES
SEW
SPC
SPCBC
SSB
SSO
SST
SSV
SSY
SSZ
T5K
WUQ
XFK
XJE
~G-
AAXKI
AAYXX
AFJKZ
AKRWK
CITATION
ID FETCH-LOGICAL-c301t-de2236bcec628f2c64b002eac619796d6408347e8390ff1f2f3bfec2f3aaf5383
IEDL.DBID .~1
ISSN 0167-6393
IngestDate Thu Sep 26 15:18:13 EDT 2024
Fri Feb 23 02:37:03 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Keywords Pitch
Colored
Pre-whitening
Enhancement
NMF
TOA
Language English
License This is an open access article under the CC BY license.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c301t-de2236bcec628f2c64b002eac619796d6408347e8390ff1f2f3bfec2f3aaf5383
ORCID 0000-0002-8994-9479
OpenAccessLink https://www.sciencedirect.com/science/article/pii/S0167639323000572
PageCount 15
ParticipantIDs crossref_primary_10_1016_j_specom_2023_04_002
elsevier_sciencedirect_doi_10_1016_j_specom_2023_04_002
PublicationCentury 2000
PublicationDate June 2023
2023-06-00
PublicationDateYYYYMMDD 2023-06-01
PublicationDate_xml – month: 06
  year: 2023
  text: June 2023
PublicationDecade 2020
PublicationTitle Speech communication
PublicationYear 2023
Publisher Elsevier B.V
Publisher_xml – name: Elsevier B.V
References Madhu (b37) 2009; 45
Birch, Lawrence, Lind, Hare (b3) 1988; 35
Plante, Meyer, Ainsworth (b46) 1995
Févotte, Bertin, Durrieu (b13) 2009; 21
Cohen (b9) 2002; 9
Bao, Dou, Jia, Bao (b2) 2014
Feder, Weinstein (b12) 1988; 36
Emiya, V., Badeau, R., David, B., 2007. Multipitch estimation of quasi-harmonic sounds in colored noise. In: 10th Int. Conf. on Digital Audio Effects (DAFx-07). p. 1,5.
Stoica, Selen (b56) 2004; 21
Kominek, J., Black, A.W., 2004. The CMU Arctic speech databases. In: Fifth ISCA Workshop on Speech Synthesis.
Shi, Nielsen, Jensen, Little, Christensen (b51) 2019; 27
Christensen, Jakobsson (b7) 2009
Trucco (b62) 2001; 26
Christensen (b6) 2013; 21
Févotte, Idier (b14) 2011; 23
Zhao, Y., Hu, R., Nakamura, S., 2003. Whitening processing for blind separation of speech signals. In: Proc. ICABSS. pp. 331–336.
Rosenkranz, T., 2010. Noise codebook adaptation for codebook-based noise reduction. In: Proceedings of International Workshop on Acoustic Echo and Noise Control (IWAENC). Tel Aviv.
Varga, Steeneken (b63) 1993; 12
Stoica, Moses (b55) 2005
Strauss, Mordel, Miguet, Deleforge (b57) 2018
Srinivasan, Samuelsson, Kleijn (b53) 2006; 14
Martin (b38) 2001; 9
Hirsch, H., Pearce, D., 2000. The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In: ASR2000-Automatic Speech Recognition: Challenges for the New Millenium ISCA Tutorial and Research Workshop. ITRW.
Jakobsson, Mossberg, Rowe, Smith (b26) 2005; 43
Linde, Buzo, Gray (b36) 1980; 28
Zou, Y., Liu, H., 2020. A Simple and Efficient Iterative Method for TOA Localization. In: IEEE International Conference on Acoustics, Speech and Signal Processing. ICASSP, pp. 4881–4884.
Gonzalez, Brookes (b17) 2014; 22
Kavalekalam, M.S., Nielsen, J.K., Shi, L., Christensen, M.G., Boldt, J., 2018. Online Parametric NMF for Speech Enhancement. In: 2018 26th European Signal Processing Conference. EUSIPCO, pp. 2320–2324.
Okamoto, Iwaya, Suzuki (b45) 2012; 73
Yoshii, Goto (b64) 2012
Blanco, Nájar (b4) 2012
Swärd, Li, Jakobsson (b59) 2017; 26
Gerkmann, Hendriks (b16) 2012; 20
Itakura, F., 1968. Analysis synthesis telephony based on the maximum likelihood method. In: The 6th International Congress on Acoustics, 1968. pp. 280–292.
Jaramillo, Nielsen, Christensen (b28) 2018
Srinivasan, Samuelsson, Kleijn (b54) 2007; 15
Sohn, Kim, Sung (b52) 1999; 6
Jensen, J.R., Saqib, U., Gannot, S., 2019. An EM Method for Multichannel TOA and DOA Estimation of Acoustic Echoes. In: IEEE Workshop on Applications of Signal Processing To Audio and Acoustics. WASPAA, pp. 120–124.
Therrien (b61) 1992
Nielsen, Jensen, Jensen, Christensen, Jensen (b41) 2017; 135
Quinn, B.G., 2007. Efficient estimation of the parameters in a sum of complex sinusoids in complex autoregressive noise. In: Conference Record of the Forty-First Asilomar Conference on Signals, Systems and Computers. pp. 636–640.
Sun, X., 2002. Pitch determination and voice quality analysis using Subharmonic-to-Harmonic Ratio. In: IEEE International Conference on Acoustics, Speech and Signal Processing. ICASSP, vol. 1, pp. I–333–I–336.
Quinn, Thomson (b49) 1991; 78
Jaramillo, A.E., Nielsen, J.K., Christensen, M.G., 2019a. Adaptive Pre-whitening Based on Parametric NMF. In: 2019 27th European Signal Processing Conference. EUSIPCO, pp. 1–5.
Hansen, Jensen (b19) 2007; 2007
Nielsen, J.K., Kavalekalam, M.S., Christensen, M.G., Boldt, J.B., 2018. Model-based noise PSD estimation from speech in non-stationary noise. In: IEEE International Conference on Acoustics, Speech and Signal Processing. ICASSP.
Huang, Zhao (b24) 1998; 26
Talkin (b60) 1995; 495
Quinn, Nielsen, Christensen (b48) 2021; 180
Dou, Shi, Lin, Li (b10) 2017; 2017
Hilkhuysen, Gaubitch, Brookes, Huckvale (b21) 2014; 135
Ney (b39) 1983; 5
Févotte, Vincent, Ozerov (b15) 2018
Gray (b18) 2006; 2
Jaramillo, A.E., Jakobsson, A., Nielsen, J.K., Christensen, M.G., 2020. Robust fundamental frequency estimation in coloured noise. In: IEEE International Conference on Acoustics, Speech and Signal Processing. ICASSP, pp. 741–745.
Nielsen, J.K., Jensen, J.R., Jensen, S.H., Christensen, M.G., 2014. The single- and multichannel audio recordings database (SMARD). In: 14th International Workshop on Acoustic Signal Enhancement. IWAENC, pp. 40–44.
Noll (b43) 1967; 41
Nørholm, Jensen, Christensen (b44) 2016; 24
He, Bao, Bao (b20) 2017; 25
Kay, Salisbury (b34) 1990; 87
Camacho, Harris (b5) 2008; 124
Jaramillo, Nielsen, Christensen (b31) 2021
Jaramillo, A.E., Nielsen, J.K., Christensen, M.G., 2019b. A Study on How Pre-whitening Influences Fundamental Frequency Estimation. In: IEEE, ICASSP International Conference on Acoustics, Speech and Signal Processing. (ISSN: 1520-6149) pp. 6495–6499.
Chu, W., Alwan, A., 2009. Reducing F0 Frame Error of F0 tracking algorithms under noisy conditions with an unvoiced/voiced classification frontend. In: IEEE International Conference on Acoustics, Speech and Signal Processing. ICASSP, (ISSN: 2379-190X) pp. 3969–3972.
Huang, Bao, Wang, Xiang (b23) 2020; 163
Al-Aboosi, Sha’ameri (b1) 2017; 123
Varga (10.1016/j.specom.2023.04.002_b63) 1993; 12
Camacho (10.1016/j.specom.2023.04.002_b5) 2008; 124
Noll (10.1016/j.specom.2023.04.002_b43) 1967; 41
Christensen (10.1016/j.specom.2023.04.002_b7) 2009
Martin (10.1016/j.specom.2023.04.002_b38) 2001; 9
Jaramillo (10.1016/j.specom.2023.04.002_b28) 2018
Cohen (10.1016/j.specom.2023.04.002_b9) 2002; 9
10.1016/j.specom.2023.04.002_b47
Birch (10.1016/j.specom.2023.04.002_b3) 1988; 35
He (10.1016/j.specom.2023.04.002_b20) 2017; 25
10.1016/j.specom.2023.04.002_b42
10.1016/j.specom.2023.04.002_b40
Huang (10.1016/j.specom.2023.04.002_b23) 2020; 163
Stoica (10.1016/j.specom.2023.04.002_b55) 2005
Gray (10.1016/j.specom.2023.04.002_b18) 2006; 2
Ney (10.1016/j.specom.2023.04.002_b39) 1983; 5
Gerkmann (10.1016/j.specom.2023.04.002_b16) 2012; 20
Dou (10.1016/j.specom.2023.04.002_b10) 2017; 2017
Févotte (10.1016/j.specom.2023.04.002_b15) 2018
Christensen (10.1016/j.specom.2023.04.002_b6) 2013; 21
Srinivasan (10.1016/j.specom.2023.04.002_b54) 2007; 15
Nielsen (10.1016/j.specom.2023.04.002_b41) 2017; 135
10.1016/j.specom.2023.04.002_b58
Kay (10.1016/j.specom.2023.04.002_b34) 1990; 87
10.1016/j.specom.2023.04.002_b8
10.1016/j.specom.2023.04.002_b50
10.1016/j.specom.2023.04.002_b11
Hansen (10.1016/j.specom.2023.04.002_b19) 2007; 2007
Srinivasan (10.1016/j.specom.2023.04.002_b53) 2006; 14
Huang (10.1016/j.specom.2023.04.002_b24) 1998; 26
Bao (10.1016/j.specom.2023.04.002_b2) 2014
10.1016/j.specom.2023.04.002_b27
10.1016/j.specom.2023.04.002_b25
Févotte (10.1016/j.specom.2023.04.002_b14) 2011; 23
10.1016/j.specom.2023.04.002_b29
Al-Aboosi (10.1016/j.specom.2023.04.002_b1) 2017; 123
Jakobsson (10.1016/j.specom.2023.04.002_b26) 2005; 43
Therrien (10.1016/j.specom.2023.04.002_b61) 1992
Shi (10.1016/j.specom.2023.04.002_b51) 2019; 27
Talkin (10.1016/j.specom.2023.04.002_b60) 1995; 495
Sohn (10.1016/j.specom.2023.04.002_b52) 1999; 6
10.1016/j.specom.2023.04.002_b65
10.1016/j.specom.2023.04.002_b22
10.1016/j.specom.2023.04.002_b66
Swärd (10.1016/j.specom.2023.04.002_b59) 2017; 26
Févotte (10.1016/j.specom.2023.04.002_b13) 2009; 21
Nørholm (10.1016/j.specom.2023.04.002_b44) 2016; 24
Gonzalez (10.1016/j.specom.2023.04.002_b17) 2014; 22
Feder (10.1016/j.specom.2023.04.002_b12) 1988; 36
Trucco (10.1016/j.specom.2023.04.002_b62) 2001; 26
Jaramillo (10.1016/j.specom.2023.04.002_b31) 2021
Linde (10.1016/j.specom.2023.04.002_b36) 1980; 28
Hilkhuysen (10.1016/j.specom.2023.04.002_b21) 2014; 135
Stoica (10.1016/j.specom.2023.04.002_b56) 2004; 21
Plante (10.1016/j.specom.2023.04.002_b46) 1995
Quinn (10.1016/j.specom.2023.04.002_b48) 2021; 180
Okamoto (10.1016/j.specom.2023.04.002_b45) 2012; 73
Blanco (10.1016/j.specom.2023.04.002_b4) 2012
Strauss (10.1016/j.specom.2023.04.002_b57) 2018
10.1016/j.specom.2023.04.002_b30
Yoshii (10.1016/j.specom.2023.04.002_b64) 2012
Quinn (10.1016/j.specom.2023.04.002_b49) 1991; 78
10.1016/j.specom.2023.04.002_b35
10.1016/j.specom.2023.04.002_b32
10.1016/j.specom.2023.04.002_b33
Madhu (10.1016/j.specom.2023.04.002_b37) 2009; 45
References_xml – volume: 6
  start-page: 1
  year: 1999
  end-page: 3
  ident: b52
  article-title: A statistical model-based voice activity detection
  publication-title: IEEE Signal Process. Lett.
  contributor:
    fullname: Sung
– start-page: 1
  year: 2018
  end-page: 8
  ident: b57
  article-title: Dregon: Dataset and methods for uav-embedded sound source localization
  publication-title: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems
  contributor:
    fullname: Deleforge
– volume: 5
  start-page: 163
  year: 1983
  end-page: 173
  ident: b39
  article-title: A dynamic programming algorithm for nonlinear smoothing
  publication-title: Signal Process.
  contributor:
    fullname: Ney
– volume: 15
  start-page: 441
  year: 2007
  end-page: 452
  ident: b54
  article-title: Codebook-based Bayesian speech enhancement for nonstationary environments
  publication-title: IEEE Trans. Audio Speech Lang. Process.
  contributor:
    fullname: Kleijn
– volume: 135
  start-page: 188
  year: 2017
  end-page: 197
  ident: b41
  article-title: Fast fundamental frequency estimation: Making a statistically efficient estimator computationally efficient
  publication-title: Signal Process.
  contributor:
    fullname: Jensen
– volume: 26
  start-page: 296
  year: 2017
  end-page: 303
  ident: b59
  article-title: Off-grid fundamental frequency estimation
  publication-title: IEEE/ACM Trans. Audio Speech Lang. Process.
  contributor:
    fullname: Jakobsson
– year: 2009
  ident: b7
  publication-title: Multi-Pitch Estimation
  contributor:
    fullname: Jakobsson
– volume: 73
  start-page: 50
  year: 2012
  end-page: 55
  ident: b45
  article-title: Wide-band dereverberation method based on multichannel linear prediction using prewhitening filter
  publication-title: Appl. Acoust.
  contributor:
    fullname: Suzuki
– start-page: 79
  year: 2012
  end-page: 84
  ident: b64
  article-title: Infinite composite autoregressive models for music signal analysis.
  publication-title: ISMIR
  contributor:
    fullname: Goto
– volume: 2
  start-page: 155
  year: 2006
  end-page: 239
  ident: b18
  article-title: Toeplitz and circulant matrices: A review
  publication-title: Found. Trends® Commun. Inf. Theory
  contributor:
    fullname: Gray
– start-page: 2325
  year: 2018
  end-page: 2329
  ident: b28
  article-title: On optimal filtering for speech decomposition
  publication-title: 2018 26th European Signal Processing Conference
  contributor:
    fullname: Christensen
– volume: 28
  start-page: 84
  year: 1980
  end-page: 95
  ident: b36
  article-title: An algorithm for vector quantizer design
  publication-title: IEEE Trans. Commun.
  contributor:
    fullname: Gray
– year: 1992
  ident: b61
  article-title: Discrete Random Signals and Statistical Signal Processing
  contributor:
    fullname: Therrien
– volume: 2017
  start-page: 1
  year: 2017
  end-page: 11
  ident: b10
  article-title: Modeling of non-Gaussian colored noise and application in CR multi-sensor networks
  publication-title: EURASIP J. Wireless Commun. Networking
  contributor:
    fullname: Li
– volume: 9
  start-page: 504
  year: 2001
  end-page: 512
  ident: b38
  article-title: Noise power spectral density estimation based on optimal smoothing and minimum statistics
  publication-title: IEEE Trans. Speech Audio Process.
  contributor:
    fullname: Martin
– volume: 35
  start-page: 640
  year: 1988
  end-page: 645
  ident: b3
  article-title: Application of prewhitening to AR spectral estimation of EEG
  publication-title: IEEE Trans. Biomed. Eng.
  contributor:
    fullname: Hare
– volume: 24
  start-page: 2354
  year: 2016
  end-page: 2367
  ident: b44
  article-title: Instantaneous fundamental frequency estimation with optimal segmentation for nonstationary voiced speech
  publication-title: IEEE/ACM Trans. Audio Speech Lang. Process.
  contributor:
    fullname: Christensen
– volume: 9
  start-page: 113
  year: 2002
  end-page: 116
  ident: b9
  article-title: Optimal speech enhancement under signal presence uncertainty using log-spectral amplitude estimator
  publication-title: IEEE Signal Process. Lett.
  contributor:
    fullname: Cohen
– volume: 26
  start-page: 165
  year: 1998
  end-page: 181
  ident: b24
  article-title: An energy-constrained signal subspace method for speech enhancement and recognition in white and colored noises
  publication-title: Speech Commun.
  contributor:
    fullname: Zhao
– start-page: 1
  year: 2018
  end-page: 24
  ident: b15
  article-title: Single-channel audio source separation with NMF: divergences, constraints and algorithms
  publication-title: Audio Source Separation
  contributor:
    fullname: Ozerov
– year: 2021
  ident: b31
  article-title: Speech decomposition based on a hybrid speech model and optimal segmentation
  publication-title: Interspeech
  contributor:
    fullname: Christensen
– start-page: 90
  year: 2014
  end-page: 94
  ident: b2
  article-title: Speech enhancement based on a few shapes of speech spectrum
  publication-title: 2014 IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP)
  contributor:
    fullname: Bao
– volume: 2007
  start-page: 092953
  year: 2007
  ident: b19
  article-title: Subspace-based noise reduction for speech signals via diagonal and triangular matrix decompositions: Survey and analysis
  publication-title: EURASIP J. Adv. Signal Process.
  contributor:
    fullname: Jensen
– year: 1995
  ident: b46
  article-title: A pitch extraction reference database
  publication-title: EUROSPEECH
  contributor:
    fullname: Ainsworth
– volume: 12
  start-page: 247
  year: 1993
  end-page: 251
  ident: b63
  article-title: Assessment for automatic speech recognition: Ii. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
  publication-title: Speech Commun.
  contributor:
    fullname: Steeneken
– volume: 27
  start-page: 1737
  year: 2019
  end-page: 1751
  ident: b51
  article-title: Robust Bayesian pitch tracking based on the harmonic model
  publication-title: IEEE/ACM Trans. Audio Speech Lang. Process.
  contributor:
    fullname: Christensen
– volume: 21
  start-page: 36
  year: 2004
  end-page: 47
  ident: b56
  article-title: Model-order selection: a review of information criterion rules
  publication-title: IEEE Signal Process. Mag.
  contributor:
    fullname: Selen
– volume: 495
  start-page: 518
  year: 1995
  ident: b60
  article-title: A robust algorithm for pitch tracking (RAPT)
  publication-title: Speech Coding and Synth.
  contributor:
    fullname: Talkin
– volume: 36
  start-page: 477
  year: 1988
  end-page: 489
  ident: b12
  article-title: Parameter estimation of superimposed signals using the EM algorithm
  publication-title: IEEE Trans. Acoust. Speech Signal Process.
  contributor:
    fullname: Weinstein
– volume: 124
  start-page: 1638
  year: 2008
  end-page: 1652
  ident: b5
  article-title: A sawtooth waveform inspired pitch estimator for speech and music
  publication-title: J. Acoust. Soc. Am.
  contributor:
    fullname: Harris
– volume: 25
  start-page: 457
  year: 2017
  end-page: 468
  ident: b20
  article-title: Multiplicative update of auto-regressive gains for codebook-based speech enhancement
  publication-title: IEEE/ACM Trans. Audio Speech Lang. Process.
  contributor:
    fullname: Bao
– volume: 21
  start-page: 793
  year: 2009
  end-page: 830
  ident: b13
  article-title: Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis
  publication-title: Neural Comput.
  contributor:
    fullname: Durrieu
– volume: 41
  start-page: 293
  year: 1967
  end-page: 309
  ident: b43
  article-title: Cepstrum pitch determination
  publication-title: J. Acoust. Soc. Am.
  contributor:
    fullname: Noll
– year: 2012
  ident: b4
  article-title: Sparse covariance fitting for direction of arrival estimation
  publication-title: EURASIP J. Adv. Signal Process.
  contributor:
    fullname: Nájar
– volume: 22
  start-page: 518
  year: 2014
  end-page: 530
  ident: b17
  article-title: PEFAC-a pitch estimation algorithm robust to high levels of noise
  publication-title: IEEE/ACM Trans. Audio Speech Lang. Process.
  contributor:
    fullname: Brookes
– volume: 163
  year: 2020
  ident: b23
  article-title: Speech enhancement method based on multi-band excitation model
  publication-title: Appl. Acoust.
  contributor:
    fullname: Xiang
– volume: 14
  start-page: 163
  year: 2006
  end-page: 176
  ident: b53
  article-title: Codebook driven short-term predictor parameter estimation for speech enhancement
  publication-title: IEEE Trans. Audio Speech Lang. Process.
  contributor:
    fullname: Kleijn
– volume: 135
  start-page: 439
  year: 2014
  end-page: 450
  ident: b21
  article-title: Effects of noise suppression on intelligibility. II: An attempt to validate physical metrics
  publication-title: J. Acoust. Soc. Am.
  contributor:
    fullname: Huckvale
– volume: 23
  start-page: 2421
  year: 2011
  end-page: 2456
  ident: b14
  article-title: Algorithms for nonnegative matrix factorization with the
  publication-title: Neural Comput.
  contributor:
    fullname: Idier
– volume: 78
  start-page: 65
  year: 1991
  end-page: 74
  ident: b49
  article-title: Estimating the frequency of a periodic function
  publication-title: Biometrika
  contributor:
    fullname: Thomson
– volume: 87
  start-page: 1603
  year: 1990
  end-page: 1611
  ident: b34
  article-title: Improved active sonar detection using autoregressive prewhiteners
  publication-title: J. Acoust. Soc. Am.
  contributor:
    fullname: Salisbury
– volume: 45
  start-page: 1195
  year: 2009
  end-page: 1196
  ident: b37
  article-title: Note on measures for spectral flatness
  publication-title: Electron. Lett.
  contributor:
    fullname: Madhu
– volume: 43
  start-page: 2659
  year: 2005
  end-page: 2665
  ident: b26
  article-title: Frequency-selective detection of nuclear quadrupole resonance signals
  publication-title: IEEE Trans. Geosci. Remote Sens.
  contributor:
    fullname: Smith
– volume: 21
  start-page: 2042
  year: 2013
  end-page: 2056
  ident: b6
  article-title: Accurate estimation of low fundamental frequencies from real-valued measurements
  publication-title: IEEE Trans. Audio Speech Lang. Process.
  contributor:
    fullname: Christensen
– volume: 180
  year: 2021
  ident: b48
  article-title: Fast algorithms for fundamental frequency estimation in autoregressive noise
  publication-title: Signal Process.
  contributor:
    fullname: Christensen
– year: 2005
  ident: b55
  article-title: Spectral analysis of signals
  publication-title: Pearson
  contributor:
    fullname: Moses
– volume: 26
  start-page: 783
  year: 2001
  end-page: 794
  ident: b62
  article-title: Experimental results on the detection of embedded objects by a prewhitening filter
  publication-title: IEEE J. Ocean. Eng.
  contributor:
    fullname: Trucco
– volume: 123
  year: 2017
  ident: b1
  article-title: Improved underwater signal detection using efficient time–frequency de-noising technique and Pre-whitening filter
  publication-title: Appl. Acoust.
  contributor:
    fullname: Sha’ameri
– volume: 20
  start-page: 1383
  year: 2012
  end-page: 1393
  ident: b16
  article-title: Unbiased MMSE-based noise power estimation with low complexity and low tracking delay
  publication-title: IEEE Trans. Audio Speech Lang. Process.
  contributor:
    fullname: Hendriks
– volume: 6
  start-page: 1
  issue: 1
  year: 1999
  ident: 10.1016/j.specom.2023.04.002_b52
  article-title: A statistical model-based voice activity detection
  publication-title: IEEE Signal Process. Lett.
  doi: 10.1109/97.736233
  contributor:
    fullname: Sohn
– ident: 10.1016/j.specom.2023.04.002_b27
  doi: 10.1109/ICASSP40776.2020.9053018
– volume: 163
  year: 2020
  ident: 10.1016/j.specom.2023.04.002_b23
  article-title: Speech enhancement method based on multi-band excitation model
  publication-title: Appl. Acoust.
  doi: 10.1016/j.apacoust.2020.107236
  contributor:
    fullname: Huang
– ident: 10.1016/j.specom.2023.04.002_b42
  doi: 10.1109/ICASSP.2018.8461683
– volume: 9
  start-page: 113
  issue: 4
  year: 2002
  ident: 10.1016/j.specom.2023.04.002_b9
  article-title: Optimal speech enhancement under signal presence uncertainty using log-spectral amplitude estimator
  publication-title: IEEE Signal Process. Lett.
  doi: 10.1109/97.1001645
  contributor:
    fullname: Cohen
– ident: 10.1016/j.specom.2023.04.002_b33
  doi: 10.23919/EUSIPCO.2018.8553039
– volume: 35
  start-page: 640
  issue: 8
  year: 1988
  ident: 10.1016/j.specom.2023.04.002_b3
  article-title: Application of prewhitening to AR spectral estimation of EEG
  publication-title: IEEE Trans. Biomed. Eng.
  doi: 10.1109/10.4597
  contributor:
    fullname: Birch
– ident: 10.1016/j.specom.2023.04.002_b11
– ident: 10.1016/j.specom.2023.04.002_b29
  doi: 10.23919/EUSIPCO.2019.8902763
– volume: 5
  start-page: 163
  issue: 2
  year: 1983
  ident: 10.1016/j.specom.2023.04.002_b39
  article-title: A dynamic programming algorithm for nonlinear smoothing
  publication-title: Signal Process.
  doi: 10.1016/0165-1684(83)90022-1
  contributor:
    fullname: Ney
– volume: 2
  start-page: 155
  issue: 3
  year: 2006
  ident: 10.1016/j.specom.2023.04.002_b18
  article-title: Toeplitz and circulant matrices: A review
  publication-title: Found. Trends® Commun. Inf. Theory
  doi: 10.1561/0100000006
  contributor:
    fullname: Gray
– volume: 26
  start-page: 783
  issue: 4
  year: 2001
  ident: 10.1016/j.specom.2023.04.002_b62
  article-title: Experimental results on the detection of embedded objects by a prewhitening filter
  publication-title: IEEE J. Ocean. Eng.
  doi: 10.1109/48.972119
  contributor:
    fullname: Trucco
– volume: 43
  start-page: 2659
  issue: 11
  year: 2005
  ident: 10.1016/j.specom.2023.04.002_b26
  article-title: Frequency-selective detection of nuclear quadrupole resonance signals
  publication-title: IEEE Trans. Geosci. Remote Sens.
  doi: 10.1109/TGRS.2005.856633
  contributor:
    fullname: Jakobsson
– ident: 10.1016/j.specom.2023.04.002_b50
– volume: 41
  start-page: 293
  issue: 2
  year: 1967
  ident: 10.1016/j.specom.2023.04.002_b43
  article-title: Cepstrum pitch determination
  publication-title: J. Acoust. Soc. Am.
  doi: 10.1121/1.1910339
  contributor:
    fullname: Noll
– volume: 14
  start-page: 163
  issue: 1
  year: 2006
  ident: 10.1016/j.specom.2023.04.002_b53
  article-title: Codebook driven short-term predictor parameter estimation for speech enhancement
  publication-title: IEEE Trans. Audio Speech Lang. Process.
  doi: 10.1109/TSA.2005.854113
  contributor:
    fullname: Srinivasan
– start-page: 1
  year: 2018
  ident: 10.1016/j.specom.2023.04.002_b15
  article-title: Single-channel audio source separation with NMF: divergences, constraints and algorithms
  contributor:
    fullname: Févotte
– volume: 135
  start-page: 188
  issue: Supplement C
  year: 2017
  ident: 10.1016/j.specom.2023.04.002_b41
  article-title: Fast fundamental frequency estimation: Making a statistically efficient estimator computationally efficient
  publication-title: Signal Process.
  doi: 10.1016/j.sigpro.2017.01.011
  contributor:
    fullname: Nielsen
– year: 2005
  ident: 10.1016/j.specom.2023.04.002_b55
  article-title: Spectral analysis of signals
  publication-title: Pearson
  contributor:
    fullname: Stoica
– volume: 26
  start-page: 296
  issue: 2
  year: 2017
  ident: 10.1016/j.specom.2023.04.002_b59
  article-title: Off-grid fundamental frequency estimation
  publication-title: IEEE/ACM Trans. Audio Speech Lang. Process.
  doi: 10.1109/TASLP.2017.2775800
  contributor:
    fullname: Swärd
– volume: 87
  start-page: 1603
  issue: 4
  year: 1990
  ident: 10.1016/j.specom.2023.04.002_b34
  article-title: Improved active sonar detection using autoregressive prewhiteners
  publication-title: J. Acoust. Soc. Am.
  doi: 10.1121/1.399408
  contributor:
    fullname: Kay
– volume: 21
  start-page: 2042
  issue: 10
  year: 2013
  ident: 10.1016/j.specom.2023.04.002_b6
  article-title: Accurate estimation of low fundamental frequencies from real-valued measurements
  publication-title: IEEE Trans. Audio Speech Lang. Process.
  doi: 10.1109/TASL.2013.2265085
  contributor:
    fullname: Christensen
– start-page: 79
  year: 2012
  ident: 10.1016/j.specom.2023.04.002_b64
  article-title: Infinite composite autoregressive models for music signal analysis.
  contributor:
    fullname: Yoshii
– volume: 124
  start-page: 1638
  issue: 3
  year: 2008
  ident: 10.1016/j.specom.2023.04.002_b5
  article-title: A sawtooth waveform inspired pitch estimator for speech and music
  publication-title: J. Acoust. Soc. Am.
  doi: 10.1121/1.2951592
  contributor:
    fullname: Camacho
– volume: 2007
  start-page: 092953
  year: 2007
  ident: 10.1016/j.specom.2023.04.002_b19
  article-title: Subspace-based noise reduction for speech signals via diagonal and triangular matrix decompositions: Survey and analysis
  publication-title: EURASIP J. Adv. Signal Process.
  doi: 10.1155/2007/92953
  contributor:
    fullname: Hansen
– volume: 135
  start-page: 439
  issue: 1
  year: 2014
  ident: 10.1016/j.specom.2023.04.002_b21
  article-title: Effects of noise suppression on intelligibility. II: An attempt to validate physical metrics
  publication-title: J. Acoust. Soc. Am.
  doi: 10.1121/1.4837238
  contributor:
    fullname: Hilkhuysen
– volume: 28
  start-page: 84
  issue: 1
  year: 1980
  ident: 10.1016/j.specom.2023.04.002_b36
  article-title: An algorithm for vector quantizer design
  publication-title: IEEE Trans. Commun.
  doi: 10.1109/TCOM.1980.1094577
  contributor:
    fullname: Linde
– volume: 26
  start-page: 165
  issue: 3
  year: 1998
  ident: 10.1016/j.specom.2023.04.002_b24
  article-title: An energy-constrained signal subspace method for speech enhancement and recognition in white and colored noises
  publication-title: Speech Commun.
  doi: 10.1016/S0167-6393(98)00041-7
  contributor:
    fullname: Huang
– ident: 10.1016/j.specom.2023.04.002_b47
  doi: 10.1109/ACSSC.2007.4487291
– ident: 10.1016/j.specom.2023.04.002_b58
  doi: 10.1109/ICASSP.2002.5743722
– volume: 180
  year: 2021
  ident: 10.1016/j.specom.2023.04.002_b48
  article-title: Fast algorithms for fundamental frequency estimation in autoregressive noise
  publication-title: Signal Process.
  doi: 10.1016/j.sigpro.2020.107860
  contributor:
    fullname: Quinn
– ident: 10.1016/j.specom.2023.04.002_b22
  doi: 10.21437/ICSLP.2000-743
– volume: 25
  start-page: 457
  issue: 3
  year: 2017
  ident: 10.1016/j.specom.2023.04.002_b20
  article-title: Multiplicative update of auto-regressive gains for codebook-based speech enhancement
  publication-title: IEEE/ACM Trans. Audio Speech Lang. Process.
  doi: 10.1109/TASLP.2016.2636445
  contributor:
    fullname: He
– ident: 10.1016/j.specom.2023.04.002_b66
  doi: 10.1109/ICASSP40776.2020.9053746
– ident: 10.1016/j.specom.2023.04.002_b65
– volume: 22
  start-page: 518
  issue: 2
  year: 2014
  ident: 10.1016/j.specom.2023.04.002_b17
  article-title: PEFAC-a pitch estimation algorithm robust to high levels of noise
  publication-title: IEEE/ACM Trans. Audio Speech Lang. Process.
  doi: 10.1109/TASLP.2013.2295918
  contributor:
    fullname: Gonzalez
– volume: 9
  start-page: 504
  issue: 5
  year: 2001
  ident: 10.1016/j.specom.2023.04.002_b38
  article-title: Noise power spectral density estimation based on optimal smoothing and minimum statistics
  publication-title: IEEE Trans. Speech Audio Process.
  doi: 10.1109/89.928915
  contributor:
    fullname: Martin
– volume: 15
  start-page: 441
  issue: 2
  year: 2007
  ident: 10.1016/j.specom.2023.04.002_b54
  article-title: Codebook-based Bayesian speech enhancement for nonstationary environments
  publication-title: IEEE Trans. Audio Speech Lang. Process.
  doi: 10.1109/TASL.2006.881696
  contributor:
    fullname: Srinivasan
– start-page: 1
  year: 2018
  ident: 10.1016/j.specom.2023.04.002_b57
  article-title: Dregon: Dataset and methods for uav-embedded sound source localization
  contributor:
    fullname: Strauss
– volume: 2017
  start-page: 1
  issue: 1
  year: 2017
  ident: 10.1016/j.specom.2023.04.002_b10
  article-title: Modeling of non-Gaussian colored noise and application in CR multi-sensor networks
  publication-title: EURASIP J. Wireless Commun. Networking
  doi: 10.1186/s13638-017-0983-3
  contributor:
    fullname: Dou
– year: 1992
  ident: 10.1016/j.specom.2023.04.002_b61
  contributor:
    fullname: Therrien
– start-page: 2325
  year: 2018
  ident: 10.1016/j.specom.2023.04.002_b28
  article-title: On optimal filtering for speech decomposition
  contributor:
    fullname: Jaramillo
– ident: 10.1016/j.specom.2023.04.002_b8
– volume: 12
  start-page: 247
  issue: 3
  year: 1993
  ident: 10.1016/j.specom.2023.04.002_b63
  article-title: Assessment for automatic speech recognition: Ii. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems
  publication-title: Speech Commun.
  doi: 10.1016/0167-6393(93)90095-3
  contributor:
    fullname: Varga
– volume: 23
  start-page: 2421
  issue: 9
  year: 2011
  ident: 10.1016/j.specom.2023.04.002_b14
  article-title: Algorithms for nonnegative matrix factorization with the β-divergence
  publication-title: Neural Comput.
  doi: 10.1162/NECO_a_00168
  contributor:
    fullname: Févotte
– volume: 36
  start-page: 477
  issue: 4
  year: 1988
  ident: 10.1016/j.specom.2023.04.002_b12
  article-title: Parameter estimation of superimposed signals using the EM algorithm
  publication-title: IEEE Trans. Acoust. Speech Signal Process.
  doi: 10.1109/29.1552
  contributor:
    fullname: Feder
– start-page: 90
  year: 2014
  ident: 10.1016/j.specom.2023.04.002_b2
  article-title: Speech enhancement based on a few shapes of speech spectrum
  contributor:
    fullname: Bao
– ident: 10.1016/j.specom.2023.04.002_b25
– volume: 73
  start-page: 50
  year: 2012
  ident: 10.1016/j.specom.2023.04.002_b45
  article-title: Wide-band dereverberation method based on multichannel linear prediction using prewhitening filter
  publication-title: Appl. Acoust.
  doi: 10.1016/j.apacoust.2011.07.004
  contributor:
    fullname: Okamoto
– volume: 45
  start-page: 1195
  issue: 23
  year: 2009
  ident: 10.1016/j.specom.2023.04.002_b37
  article-title: Note on measures for spectral flatness
  publication-title: Electron. Lett.
  doi: 10.1049/el.2009.1977
  contributor:
    fullname: Madhu
– volume: 123
  year: 2017
  ident: 10.1016/j.specom.2023.04.002_b1
  article-title: Improved underwater signal detection using efficient time–frequency de-noising technique and Pre-whitening filter
  publication-title: Appl. Acoust.
  contributor:
    fullname: Al-Aboosi
– year: 2021
  ident: 10.1016/j.specom.2023.04.002_b31
  article-title: Speech decomposition based on a hybrid speech model and optimal segmentation
  contributor:
    fullname: Jaramillo
– ident: 10.1016/j.specom.2023.04.002_b35
– volume: 495
  start-page: 518
  year: 1995
  ident: 10.1016/j.specom.2023.04.002_b60
  article-title: A robust algorithm for pitch tracking (RAPT)
  publication-title: Speech Coding and Synth.
  contributor:
    fullname: Talkin
– volume: 21
  start-page: 793
  issue: 3
  year: 2009
  ident: 10.1016/j.specom.2023.04.002_b13
  article-title: Nonnegative matrix factorization with the Itakura-Saito divergence: With application to music analysis
  publication-title: Neural Comput.
  doi: 10.1162/neco.2008.04-08-771
  contributor:
    fullname: Févotte
– ident: 10.1016/j.specom.2023.04.002_b32
  doi: 10.1109/WASPAA.2019.8937252
– year: 1995
  ident: 10.1016/j.specom.2023.04.002_b46
  article-title: A pitch extraction reference database
  contributor:
    fullname: Plante
– volume: 20
  start-page: 1383
  issue: 4
  year: 2012
  ident: 10.1016/j.specom.2023.04.002_b16
  article-title: Unbiased MMSE-based noise power estimation with low complexity and low tracking delay
  publication-title: IEEE Trans. Audio Speech Lang. Process.
  doi: 10.1109/TASL.2011.2180896
  contributor:
    fullname: Gerkmann
– issue: 1
  year: 2012
  ident: 10.1016/j.specom.2023.04.002_b4
  article-title: Sparse covariance fitting for direction of arrival estimation
  publication-title: EURASIP J. Adv. Signal Process.
  doi: 10.1186/1687-6180-2012-111
  contributor:
    fullname: Blanco
– ident: 10.1016/j.specom.2023.04.002_b30
  doi: 10.1109/ICASSP.2019.8683653
– volume: 27
  start-page: 1737
  issue: 11
  year: 2019
  ident: 10.1016/j.specom.2023.04.002_b51
  article-title: Robust Bayesian pitch tracking based on the harmonic model
  publication-title: IEEE/ACM Trans. Audio Speech Lang. Process.
  doi: 10.1109/TASLP.2019.2930917
  contributor:
    fullname: Shi
– year: 2009
  ident: 10.1016/j.specom.2023.04.002_b7
  contributor:
    fullname: Christensen
– volume: 24
  start-page: 2354
  issue: 12
  year: 2016
  ident: 10.1016/j.specom.2023.04.002_b44
  article-title: Instantaneous fundamental frequency estimation with optimal segmentation for nonstationary voiced speech
  publication-title: IEEE/ACM Trans. Audio Speech Lang. Process.
  doi: 10.1109/TASLP.2016.2608948
  contributor:
    fullname: Nørholm
– volume: 78
  start-page: 65
  issue: 1
  year: 1991
  ident: 10.1016/j.specom.2023.04.002_b49
  article-title: Estimating the frequency of a periodic function
  publication-title: Biometrika
  doi: 10.1093/biomet/78.1.65
  contributor:
    fullname: Quinn
– volume: 21
  start-page: 36
  issue: 4
  year: 2004
  ident: 10.1016/j.specom.2023.04.002_b56
  article-title: Model-order selection: a review of information criterion rules
  publication-title: IEEE Signal Process. Mag.
  doi: 10.1109/MSP.2004.1311138
  contributor:
    fullname: Stoica
– ident: 10.1016/j.specom.2023.04.002_b40
  doi: 10.1109/IWAENC.2014.6953334
SSID ssj0004882
Score 2.4079802
Snippet A common assumption in many speech and acoustic processing methods is that the noise is white and Gaussian (WGN). Although making this assumption results in...
SourceID crossref
elsevier
SourceType Aggregation Database
Publisher
StartPage 9
SubjectTerms Colored
Enhancement
NMF
Pitch
Pre-whitening
TOA
Title An adaptive autoregressive pre-whitener for speech and acoustic signals based on parametric NMF
URI https://dx.doi.org/10.1016/j.specom.2023.04.002
Volume 151
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LT8JAEJ4QvXAxii8UyR6Mtwq02xaOhEhQkYsSuW22-4j4aEnBGC_-dmf6QI2JB0_NNrvJZnf6zbfbb2YATrnva2OlcbTEswlX2nMiFXUd33Q60u91rJZZts9JMJryq5k_q8CgjIUhWWWB_TmmZ2hdvGkVq9lazOetWxLQo3_1kERTSCXhMEdnhDZ9_vEl80ADdcv83tS7DJ_LNF4UzZhQPLrrZQlPi8uVX-7pm8sZbsNWwRVZP5_ODlRMXIODcXHDuGRnbLxOirysQXUNZu81aORxt-zePFuZGuxbvkjSp10Q_ZhJLRcEdkxSHgOTHbypScKQN_q7EJuUIadlOH-jHpiMNUP8zMp_MdJ9oOUy8oKaJTGjHOIvVJ5LscnNcA-mw4u7wcgpai04Cj_xlaMN8oQgUkYFbte6KuDkvhGV8YAV9gIdcORqPDTIp9rWdqxrvcgahQ8pLYKmtw8bcRKbQ2Ch8n2ibVLpkPte0FMyoOqFbqR5l-tuHZxyicUiT6khSq3Zo8i3RNCWiDYXOIk6hOU-iB-mIRD1_xx59O-Rx1ClVq4Ja8DGKn01J8g-VlEzM68mbPYvr0eTTx-b2yk
link.rule.ids 315,786,790,4521,24144,27957,27958,45620,45714
linkProvider Elsevier
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3JTsMwEB2h9kAvCMpWVh8Qt6gksZP0WFVULbS9UAQ3y_Ei1rQqRYi_ZyYLi5A4cIrixJLlcd48O29mAE64EMY6ZT2jcG_CtQm9VKeJJ6zvK9HxnVF5ts9JNLjmF7fidgV6VSwMySpL7C8wPUfrsqVdzmZ7fn_fviIBPfrXEEk0hVQiDte5iH1eg3p3eDmYfIVHJnnNqDzFN3WoIuhymRcFNM4oJD0I85yn5fnKLw_1zev012GtpIusW4xoA1Zs1oSdUXnI-MJO2egzL_JLExqfePbehIMi9Jbd2CenFhbfrRpmi8dNkN2MKaPmhHdMUSoDm--96Za0IW_0gyGzC4a0luH4rb5jKjMMITSvAMZI-oGLl5EjNGyWMUoj_kwVujSbjPtbcN0_n_YGXlluwdP4lS89Y5EqRKm2OgoSF-iIkwdHYMY9VtyJTMSRrvHYIqU6c853gQtTZzVelHKIm-E21LJZZneBxVoIYm5Km5iLMOpoFVEBwyA1POEmaYFXTbGcF1k1ZCU3e5CFSSSZRJ5xiYNoQVzZQf5YHRKB_8-ee__ueQyrg-l4JEfDyeU-NOhJIRE7gNpy8WoPkYws06NysX0AuQ3d3w
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=An+adaptive+autoregressive+pre-whitener+for+speech+and+acoustic+signals+based+on+parametric+NMF&rft.jtitle=Speech+communication&rft.au=Jaramillo%2C+Alfredo+Esquivel&rft.au=Nielsen%2C+Jesper+Kj%C3%A6r&rft.au=Christensen%2C+Mads+Gr%C3%A6sb%C3%B8ll&rft.date=2023-06-01&rft.pub=Elsevier+B.V&rft.issn=0167-6393&rft.eissn=1872-7182&rft.volume=151&rft.spage=9&rft.epage=23&rft_id=info:doi/10.1016%2Fj.specom.2023.04.002&rft.externalDocID=S0167639323000572
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0167-6393&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0167-6393&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0167-6393&client=summon