Deep Convolutional Neural Networks for Large-scale Speech Tasks
Sainath, Tara N., Kingsbury, Brian, Saon, George, Soltau, Hagen, Mohamed, Abdel-rahman, Dahl, George, Ramabhadran, Bhuvana
Published in Neural networks (01.04.2015)
Published in Neural networks (01.04.2015)
Get full text
Journal Article
Analyzing convolutional neural networks for speech activity detection in mismatched acoustic conditions
Thomas, Samuel, Ganapathy, Sriram, Saon, George, Soltau, Hagen
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2014)
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2014)
Get full text
Conference Proceeding
Joint training of convolutional and non-convolutional neural networks
Soltau, Hagen, Saon, George, Sainath, Tara N.
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2014)
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2014)
Get full text
Conference Proceeding
Speaker adaptation of neural network acoustic models using i-vectors
Saon, George, Soltau, Hagen, Nahamoo, David, Picheny, Michael
Published in 2013 IEEE Workshop on Automatic Speech Recognition and Understanding (01.12.2013)
Published in 2013 IEEE Workshop on Automatic Speech Recognition and Understanding (01.12.2013)
Get full text
Conference Proceeding
Monotonic Recurrent Neural Network Transducer and Decoding Strategies
Tripathi, Anshuman, Lu, Han, Sak, Hasim, Soltau, Hagen
Published in 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (01.12.2019)
Published in 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (01.12.2019)
Get full text
Conference Proceeding
Retrieval Augmented End-to-End Spoken Dialog Models
Wang, Mingqiu, Shafran, Izhak, Soltau, Hagen, Han, Wei, Cao, Yuan, Yu, Dian, El Shafey, Laurent
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Get full text
Conference Proceeding
Optimization Techniques to Improve Training Speed of Deep Neural Networks for Large Speech Tasks : LARGE-SCALE OPTIMIZATION FOR AUDIO, SPEECH, AND LANGUAGE PROCESSING
SAINATH, Tara N, KINGSBURY, Brian, SOLTAU, Hagen, RAMABHADRAN, Bhuvana
Published in IEEE transactions on audio, speech, and language processing (2013)
Get full text
Published in IEEE transactions on audio, speech, and language processing (2013)
Journal Article
Exploiting diversity for spoken term detection
Mangu, Lidia, Soltau, Hagen, Hong-Kwang Kuo, Kingsbury, Brian, Saon, George
Published in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (01.05.2013)
Published in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (01.05.2013)
Get full text
Conference Proceeding
Advances in speech transcription at IBM under the DARPA EARS program
Chen, S.F., Kingsbury, B., Lidia Mangu, Povey, D., Saon, G., Soltau, H., Zweig, G.
Published in IEEE transactions on audio, speech, and language processing (01.09.2006)
Published in IEEE transactions on audio, speech, and language processing (01.09.2006)
Get full text
Journal Article
Progress in dynamic network decoding
Nolden, David, Soltau, Hagen, Ney, Hermann
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2014)
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2014)
Get full text
Conference Proceeding
Morpheme-based feature-rich language models using Deep Neural Networks for LVCSR of Egyptian Arabic
El-Desoky Mousa, Amr, Kuo, Hong-Kwang Jeff, Mangu, Lidia, Soltau, Hagen
Published in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (01.05.2013)
Published in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (01.05.2013)
Get full text
Conference Proceeding
Efficient spoken term detection using confusion networks
Mangu, Lidia, Kingsbury, Brian, Soltau, Hagen, Hong-Kwang Kuo, Picheny, Michael
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2014)
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2014)
Get full text
Conference Proceeding
Advances in Arabic Speech Transcription at IBM Under the DARPA GALE Program
Soltau, H., Saon, G., Kingsbury, B., Kuo, H.-K.J., Mangu, L., Povey, D., Emami, A.
Published in IEEE transactions on audio, speech, and language processing (01.07.2009)
Published in IEEE transactions on audio, speech, and language processing (01.07.2009)
Get full text
Journal Article
Improvements to Deep Convolutional Neural Networks for LVCSR
Sainath, Tara N., Kingsbury, Brian, Mohamed, Abdel-rahman, Dahl, George E., Saon, George, Soltau, Hagen, Beran, Tomas, Aravkin, Aleksandr Y., Ramabhadran, Bhuvana
Published in 2013 IEEE Workshop on Automatic Speech Recognition and Understanding (01.12.2013)
Published in 2013 IEEE Workshop on Automatic Speech Recognition and Understanding (01.12.2013)
Get full text
Conference Proceeding
Out-of-vocabulary word detection in a speech-to-speech translation system
Hong-Kwang Kuo, Kislal, Ellen Eide, Mangu, Lidia, Soltau, Hagen, Beran, Tomas
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2014)
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2014)
Get full text
Conference Proceeding
Advances in Arabic Speech Transcription at IBM Under the DARPA GALE Program : Processing morphologically rich languages
SOLTAU, Hagen, SAON, George, KINGSBURY, Brian, KUO, Hong-Kwang Jeff, MANGU, Lidia, POVEY, Daniel, EMAMI, Ahmad
Published in IEEE transactions on audio, speech, and language processing (2009)
Get full text
Published in IEEE transactions on audio, speech, and language processing (2009)
Journal Article