Phone Synchronous Speech Recognition With CTC Lattices
Chen, Zhehuai, Zhuang, Yimeng, Qian, Yanmin, Yu, Kai
Published in IEEE/ACM transactions on audio, speech, and language processing (01.01.2017)
Published in IEEE/ACM transactions on audio, speech, and language processing (01.01.2017)
Get full text
Journal Article
Improving Speech Recognition Using Consistent Predictions on Synthesized Speech
Wang, Gary, Rosenberg, Andrew, Chen, Zhehuai, Zhang, Yu, Ramabhadran, Bhuvana, Wu, Yonghui, Moreno, Pedro
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Get full text
Conference Proceeding
Confidence measures for CTC-based phone synchronous decoding
Zhehuai Chen, Yimeng Zhuang, Kai Yu
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2017)
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2017)
Get full text
Conference Proceeding
Sequence Modeling in Unsupervised Single-Channel Overlapped Speech Recognition
Chen, Zhehuai, Droppo, Jasha
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Get full text
Conference Proceeding
Directed automatic speech transcription error correction using bidirectional LSTM
Da Zheng, Zhehuai Chen, Yue Wu, Kai Yu
Published in 2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) (01.10.2016)
Published in 2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) (01.10.2016)
Get full text
Conference Proceeding
An investigation of implementation and performance analysis of DNN based speech synthesis system
Zhehuai Chen, Kai Yu
Published in 2014 12th International Conference on Signal Processing (ICSP) (01.10.2014)
Published in 2014 12th International Conference on Signal Processing (ICSP) (01.10.2014)
Get full text
Conference Proceeding
End-to-end Contextual Speech Recognition Using Class Language Models and a Token Passing Decoder
Chen, Zhehuai, Jain, Mahaveer, Wang, Yongqiang, Seltzer, Michael L., Fuegen, Christian
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Get full text
Conference Proceeding
SPEECH RECOGNITION USING UNSPOKEN TEXT AND SPEECH SYNTHESIS
PEDRO J MORENO MENGIBAR, CHEN ZHEHUAI, BHUVANA RAMABHADRAN, ANDREW ROSENBERG
Year of Publication 10.04.2024
Get full text
Year of Publication 10.04.2024
Patent
On Modular Training of Neural Acoustics-to-Word Model for LVCSR
Chen, Zhehuai, Liu, Qi, Li, Hao, Yu, Kai
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Get full text
Conference Proceeding
Accelerating RNN-T Training and Inference Using CTC Guidance
Wang, Yongqiang, Chen, Zhehuai, Zheng, Chengjian, Zhang, Yu, Han, Wei, Haghani, Parisa
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
Understanding Shared Speech-Text Representations
Wang, Gary, Kastner, Kyle, Bapna, Ankur, Chen, Zhehuai, Rosenberg, Andrew, Ramabhadran, Bhuvana, Zhang, Yu
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
비음성 텍스트 및 스피치 합성을 사용한 스피치 인식
CHEN ZHEHUAI, ROSENBERG ANDREW, MORENO MENGIBAR PEDRO J, RAMABHADRAN BHUVANA
Year of Publication 05.01.2023
Get full text
Year of Publication 05.01.2023
Patent
Tts4pretrain 2.0: Advancing the use of Text and Speech in ASR Pretraining with Consistency and Contrastive Losses
Chen, Zhehuai, Zhang, Yu, Rosenberg, Andrew, Ramabhadran, Bhuvana, Moreno, Pedro, Wang, Gary
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Get full text
Conference Proceeding
스트리밍 시퀀스 모델에 대한 일관성 예측
CHEN ZHEHUAI, ROSENBERG ANDREW, MORENO MENGIBAR PEDRO J, RAMABHADRAN BHUVANA
Year of Publication 04.11.2022
Get full text
Year of Publication 04.11.2022
Patent
셀프 지도 스피치 사전 트레이닝에서 텍스트 삽입하기
CHEN ZHEHUAI, RAMABHADRAN BHUVANA, ROSENBERG ANDREW M, ZHANG YU, MENGIBAR PEDRO J. MORENO
Year of Publication 20.02.2024
Get full text
Year of Publication 20.02.2024
Patent
SALM: Speech-Augmented Language Model with in-Context Learning for Speech Recognition and Translation
Chen, Zhehuai, Huang, He, Andrusenko, Andrei, Hrinchuk, Oleksii, Puvvada, Krishna C., Li, Jason, Ghosh, Subhankar, Balam, Jagadeesh, Ginsburg, Boris
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Get full text
Conference Proceeding
시퀀스들에 걸쳐 대조 손실을 갖는 지도 및 비지도 트레이닝
CHEN ZHEHUAI, ROSENBERG ANDREW, RAMABHADRAN BHUVANA, EMOND JESSE, WANG YUAN, ZHANG YU
Year of Publication 13.11.2023
Get full text
Year of Publication 13.11.2023
Patent