Nonlinear mapping for feature extraction in automatic speech recognition

The present invention successfully combines neural-net discriminative feature processing with Gaussian-mixture distribution modeling (GMM). By training one or more neural networks to generate subword probability posteriors, then using transformations of these estimates as the base features for a con...

Full description

Saved in:

Bibliographic Details
Main Authors	Hermansky, Hynek, Sharma, Sangita, Ellis, Daniel
Format	Patent
Language	English
Published	07.08.2007
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The present invention successfully combines neural-net discriminative feature processing with Gaussian-mixture distribution modeling (GMM). By training one or more neural networks to generate subword probability posteriors, then using transformations of these estimates as the base features for a conventionally-trained Gaussian-mixture based system, substantial error rate reductions may be achieved. The present invention effectively has two acoustic models in tandem-first a neural net and then a GMM. By using a variety of combination schemes available for connectionist models, various systems based upon multiple features streams can be constructed with even greater error rate reductions.