Complex linear projection for acoustic modeling

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech recognition using complex linear projection are disclosed. In one aspect, a method includes the actions of receiving audio data corresponding to an utterance. The method further includes gen...

Full description

Saved in:
Bibliographic Details
Main Authors Visontai, Mirko, Shafran, Izhak, Bengio, Samuel, Sainath, Tara N, Variani, Ehsan, Thornton, Christopher Walter George, Bacchiani, Michiel A. U
Format Patent
LanguageEnglish
Published 27.11.2018
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech recognition using complex linear projection are disclosed. In one aspect, a method includes the actions of receiving audio data corresponding to an utterance. The method further includes generating frequency domain data using the audio data. The method further includes processing the frequency domain data using complex linear projection. The method further includes providing the processed frequency domain data to a neural network trained as an acoustic model. The method further includes generating a transcription for the utterance that is determined based at least on output that the neural network provides in response to receiving the processed frequency domain data.
Bibliography:Application Number: US201615386979