α-Jacobian environmental adaptation

The robustness of automatic speech recognition systems to noise is still a problem, especially for small footprint systems. This paper addresses the problem of noise robustness using model compensation methods. Such algorithms are already available, but their complexity is usually high. An often-ref...

Full description

Saved in:
Bibliographic Details
Published inSpeech communication Vol. 42; no. 1; pp. 25 - 41
Main Authors Cerisara, Christophe, Rigazio, Luca, Junqua, Jean-Claude
Format Journal Article
LanguageEnglish
Published Elsevier B.V 2004
Elsevier : North-Holland
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The robustness of automatic speech recognition systems to noise is still a problem, especially for small footprint systems. This paper addresses the problem of noise robustness using model compensation methods. Such algorithms are already available, but their complexity is usually high. An often-referenced method for achieving noise robustness is parallel model combination (PMC). Several algorithms have been proposed to develop more computationally efficient methods than PMC. For example, Jacobian adaptation approximates PMC with a linear transformation function in the cepstral domain. However, the Jacobian approximation is valid only for test environments that are close to the training conditions whereas, in real test conditions, the mismatch between the test and training environments is usually large. In this paper, we propose two methods, respectively called static and dynamic α-Jacobian adaptation (or α-JAC), to compute new linear approximations of PMC for realistic test environments. We further extend both algorithms to compensate for additive and convolutional noise and we derive the corresponding non-linear algorithm that is approximated. All these algorithms are experimentally compared in important mismatch conditions. As compared to Jacobian adaptation, improvements are observed with both static and dynamic α-Jacobian adaptation.
ISSN:0167-6393
1872-7182
DOI:10.1016/j.specom.2003.08.003