Deep beamforming networks for multi-channel speech recognition
Xiong Xiao, Watanabe, Shinji, Erdogan, Hakan, Liang Lu, Hershey, John, Seltzer, Michael L., Guoguo Chen, Yu Zhang, Mandel, Michael, Dong Yu
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2016)
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2016)
Get full text
Conference Proceeding
Journal Article
Deep Neural Networks for Single-Channel Multi-Talker Speech Recognition
Chao Weng, Dong Yu, Seltzer, Michael L., Droppo, Jasha
Published in IEEE/ACM transactions on audio, speech, and language processing (01.10.2015)
Published in IEEE/ACM transactions on audio, speech, and language processing (01.10.2015)
Get full text
Journal Article
Improving speech recognition in reverberation using a room-aware deep neural network and multi-task learning
Giri, Ritwik, Seltzer, Michael L., Droppo, Jasha, Dong Yu
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2015)
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2015)
Get full text
Conference Proceeding
A study on data augmentation of reverberant speech for robust speech recognition
Ko, Tom, Peddinti, Vijayaditya, Povey, Daniel, Seltzer, Michael L., Khudanpur, Sanjeev
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2017)
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2017)
Get full text
Conference Proceeding
An investigation of deep neural networks for noise robust speech recognition
Seltzer, Michael L., Dong Yu, Yongqiang Wang
Published in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (01.05.2013)
Published in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (01.05.2013)
Get full text
Conference Proceeding
Transformer-Based Acoustic Modeling for Hybrid Speech Recognition
Wang, Yongqiang, Mohamed, Abdelrahman, Le, Due, Liu, Chunxi, Xiao, Alex, Mahadeokar, Jay, Huang, Hongzhao, Tjandra, Andros, Zhang, Xiaohui, Zhang, Frank, Fuegen, Christian, Zweig, Geoffrey, Seltzer, Michael L.
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Get full text
Conference Proceeding
Alignment Restricted Streaming Recurrent Neural Network Transducer
Mahadeokar, Jay, Shangguan, Yuan, Le, Duc, Keren, Gil, Su, Hang, Le, Thong, Yeh, Ching-Feng, Fuegen, Christian, Seltzer, Michael L.
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19.01.2021)
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19.01.2021)
Get full text
Conference Proceeding
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer
Kim, Suyoun, Shangguan, Yuan, Mahadeokar, Jay, Bruguier, Antoine, Fuegen, Christian, Seltzer, Michael L., Le, Duc
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Get full text
Conference Proceeding
Aipnet: Generative Adversarial Pre-Training of Accent-Invariant Networks for End-To-End Speech Recognition
Chen, Yi-Chen, Yang, Zhaojun, Yeh, Ching-Feng, Jain, Mahaveer, Seltzer, Michael L.
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Get full text
Conference Proceeding
Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers
Le, Duc, Seide, Frank, Wang, Yuhao, Li, Yang, Schubert, Kjell, Kalinli, Ozlem, Seltzer, Michael L.
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities
Tjandra, Andros, Singhal, Nayan, Zhang, David, Kalinli, Ozlem, Mohamed, Abdelrahman, Le, Duc, Seltzer, Michael L.
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
Memory-Efficient Speech Recognition on Smart Devices
Venkatesh, Ganesh, Valliappan, Alagappan, Mahadeokar, Jay, Shangguan, Yuan, Fuegen, Christian, Seltzer, Michael L., Chandra, Vikas
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Get full text
Conference Proceeding
Improving fast-slow Encoder based Transducer with Streaming Deliberation
Li, Ke, Mahadeokar, Jay, Guo, Jinxi, Shi, Yangyang, Keren, Gil, Kalinli, Ozlem, Seltzer, Michael L., Le, Duc
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
Deep neural network features and semi-supervised training for low resource speech recognition
Thomas, Samuel, Seltzer, Michael L., Church, Kenneth, Hermansky, Hynek
Published in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (01.05.2013)
Published in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (01.05.2013)
Get full text
Conference Proceeding
Neural-FST Class Language Model for End-to-End Speech Recognition
Bruguier, Antoine, Le, Duc, Prabhavalkar, Rohit, Li, Dangna, Liu, Zhe, Wang, Bo, Chang, Eun, Peng, Fuchun, Kalinli, Ozlem, Seltzer, Michael L.
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Get full text
Conference Proceeding