Phone Synchronous Speech Recognition With CTC Lattices
Chen, Zhehuai, Zhuang, Yimeng, Qian, Yanmin, Yu, Kai
Published in IEEE/ACM transactions on audio, speech, and language processing (01.01.2017)
Published in IEEE/ACM transactions on audio, speech, and language processing (01.01.2017)
Get full text
Journal Article
Improving Speech Recognition Using Consistent Predictions on Synthesized Speech
Wang, Gary, Rosenberg, Andrew, Chen, Zhehuai, Zhang, Yu, Ramabhadran, Bhuvana, Wu, Yonghui, Moreno, Pedro
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Get full text
Conference Proceeding
Confidence measures for CTC-based phone synchronous decoding
Zhehuai Chen, Yimeng Zhuang, Kai Yu
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2017)
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2017)
Get full text
Conference Proceeding
Sequence Modeling in Unsupervised Single-Channel Overlapped Speech Recognition
Chen, Zhehuai, Droppo, Jasha
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Get full text
Conference Proceeding
Directed automatic speech transcription error correction using bidirectional LSTM
Da Zheng, Zhehuai Chen, Yue Wu, Kai Yu
Published in 2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) (01.10.2016)
Published in 2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) (01.10.2016)
Get full text
Conference Proceeding
An investigation of implementation and performance analysis of DNN based speech synthesis system
Zhehuai Chen, Kai Yu
Published in 2014 12th International Conference on Signal Processing (ICSP) (01.10.2014)
Published in 2014 12th International Conference on Signal Processing (ICSP) (01.10.2014)
Get full text
Conference Proceeding
End-to-end Contextual Speech Recognition Using Class Language Models and a Token Passing Decoder
Chen, Zhehuai, Jain, Mahaveer, Wang, Yongqiang, Seltzer, Michael L., Fuegen, Christian
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Get full text
Conference Proceeding
On Modular Training of Neural Acoustics-to-Word Model for LVCSR
Chen, Zhehuai, Liu, Qi, Li, Hao, Yu, Kai
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Get full text
Conference Proceeding
Accelerating RNN-T Training and Inference Using CTC Guidance
Wang, Yongqiang, Chen, Zhehuai, Zheng, Chengjian, Zhang, Yu, Han, Wei, Haghani, Parisa
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
Understanding Shared Speech-Text Representations
Wang, Gary, Kastner, Kyle, Bapna, Ankur, Chen, Zhehuai, Rosenberg, Andrew, Ramabhadran, Bhuvana, Zhang, Yu
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
Tts4pretrain 2.0: Advancing the use of Text and Speech in ASR Pretraining with Consistency and Contrastive Losses
Chen, Zhehuai, Zhang, Yu, Rosenberg, Andrew, Ramabhadran, Bhuvana, Moreno, Pedro, Wang, Gary
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Get full text
Conference Proceeding
SALM: Speech-Augmented Language Model with in-Context Learning for Speech Recognition and Translation
Chen, Zhehuai, Huang, He, Andrusenko, Andrei, Hrinchuk, Oleksii, Puvvada, Krishna C., Li, Jason, Ghosh, Subhankar, Balam, Jagadeesh, Ginsburg, Boris
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Get full text
Conference Proceeding
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-to-Speech
Saeki, Takaaki, Zen, Heiga, Chen, Zhehuai, Morioka, Nobuyuki, Wang, Gary, Zhang, Yu, Bapna, Ankur, Rosenberg, Andrew, Ramabhadran, Bhuvana
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
An Asynchronous WFST-Based Decoder for Automatic Speech Recognition
Lv, Hang, Chen, Zhehuai, Xu, Hainan, Povey, Daniel, Xie, Lei, Khudanpur, Sanjeev
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Get full text
Conference Proceeding