Accelerating RNN-T Training and Inference Using CTC guidance
Wang, Yongqiang, Chen, Zhehuai, Zheng, Chengjian, Zhang, Yu, Han, Wei, Haghani, Parisa
Year of Publication 28.10.2022
Year of Publication 28.10.2022
Get full text
Journal Article
Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Chen, Zhehuai, Bapna, Ankur, Rosenberg, Andrew, Zhang, Yu, Ramabhadran, Bhuvana, Moreno, Pedro, Chen, Nanxin
Year of Publication 18.10.2022
Year of Publication 18.10.2022
Get full text
Journal Article
Using Text Injection to Improve Recognition of Personal Identifiers in Speech
Blau, Yochai, Agrawal, Rohan, Madmony, Lior, Wang, Gary, Rosenberg, Andrew, Chen, Zhehuai, Gekhman, Zorik, Beryozkin, Genady, Haghani, Parisa, Ramabhadran, Bhuvana
Year of Publication 14.08.2023
Year of Publication 14.08.2023
Get full text
Journal Article
Less is More: Accurate Speech Recognition & Translation without Web-Scale Data
Puvvada, Krishna C, Żelasko, Piotr, Huang, He, Hrinchuk, Oleksii, Koluguri, Nithin Rao, Dhawan, Kunal, Majumdar, Somshubra, Rastorgueva, Elena, Chen, Zhehuai, Lavrukhin, Vitaly, Balam, Jagadeesh, Ginsburg, Boris
Year of Publication 28.06.2024
Year of Publication 28.06.2024
Get full text
Journal Article
MAESTRO: Matched Speech Text Representations through Modality Matching
Chen, Zhehuai, Zhang, Yu, Rosenberg, Andrew, Ramabhadran, Bhuvana, Moreno, Pedro, Bapna, Ankur, Zen, Heiga
Year of Publication 07.04.2022
Year of Publication 07.04.2022
Get full text
Journal Article
Unsupervised Data Selection via Discrete Speech Representation for ASR
Lu, Zhiyun, Wang, Yongqiang, Zhang, Yu, Han, Wei, Chen, Zhehuai, Haghani, Parisa
Year of Publication 05.04.2022
Year of Publication 05.04.2022
Get full text
Journal Article
High-precision Voice Search Query Correction via Retrievable Speech-text Embedings
Li, Christopher, Wang, Gary, Kastner, Kyle, Su, Heng, Chen, Allen, Rosenberg, Andrew, Chen, Zhehuai, Wu, Zelin, Velikovich, Leonid, Rondon, Pat, Caseiro, Diamantino, Aleksic, Petar
Year of Publication 08.01.2024
Year of Publication 08.01.2024
Get full text
Journal Article
JOIST: A Joint Speech and Text Streaming Model for ASR
Sainath, Tara N., Prabhavalkar, Rohit, Bapna, Ankur, Zhang, Yu, Huo, Zhouyuan, Chen, Zhehuai, Li, Bo, Wang, Weiran, Strohman, Trevor
Published in 2022 IEEE Spoken Language Technology Workshop (SLT) (09.01.2023)
Published in 2022 IEEE Spoken Language Technology Workshop (SLT) (09.01.2023)
Get full text
Conference Proceeding
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech
Saeki, Takaaki, Zen, Heiga, Chen, Zhehuai, Morioka, Nobuyuki, Wang, Gary, Zhang, Yu, Bapna, Ankur, Rosenberg, Andrew, Ramabhadran, Bhuvana
Year of Publication 27.10.2022
Year of Publication 27.10.2022
Get full text
Journal Article
JOIST: A Joint Speech and Text Streaming Model For ASR
Sainath, Tara N, Prabhavalkar, Rohit, Bapna, Ankur, Zhang, Yu, Huo, Zhouyuan, Chen, Zhehuai, Li, Bo, Wang, Weiran, Strohman, Trevor
Year of Publication 13.10.2022
Year of Publication 13.10.2022
Get full text
Journal Article
Consistency prediction on streaming sequence models
Rosenberg, Andrew, Chen, Zhehuai, Ramabhadran, Bhuvana, Moreno Mengibar, Pedro Jose
Year of Publication 12.03.2024
Get full text
Year of Publication 12.03.2024
Patent
CONFORMER-BASED SPEECH CONVERSION MODEL
CHEN, Zhehuai, RAMABHADRAN, Bhuvana, MENGIBAR, Pedro, J. Moreno, BIADSY, Fadi
Year of Publication 03.01.2024
Get full text
Year of Publication 03.01.2024
Patent
Injecting Text in Self-Supervised Speech Pretraining
Chen, Zhehuai, Zhang, Yu, Rosenberg, Andrew, Ramabhadran, Bhuvana, Wang, Gary, Moreno, Pedro
Year of Publication 27.08.2021
Year of Publication 27.08.2021
Get full text
Journal Article
An Asynchronous WFST-Based Decoder For Automatic Speech Recognition
Lv, Hang, Chen, Zhehuai, Xu, Hainan, Povey, Daniel, Xie, Lei, Khudanpur, Sanjeev
Year of Publication 16.03.2021
Year of Publication 16.03.2021
Get full text
Journal Article
Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition
Yang, Chao-Han Huck, Park, Taejin, Gong, Yuan, Li, Yuanchao, Chen, Zhehuai, Lin, Yen-Ting, Chen, Chen, Hu, Yuchen, Dhawan, Kunal, Żelasko, Piotr, Zhang, Chao, Chen, Yun-Nung, Tsao, Yu, Balam, Jagadeesh, Ginsburg, Boris, Siniscalchi, Sabato Marco, Chng, Eng Siong, Bell, Peter, Lai, Catherine, Watanabe, Shinji, Stolcke, Andreas
Year of Publication 15.09.2024
Year of Publication 15.09.2024
Get full text
Journal Article
SPEECH RECOGNITION USING UNSPOKEN TEXT AND SPEECH SYNTHESIS
CHEN, Zhehuai, RAMABHADRAN, Bhuvana, ROSENBERG, Andrew, MORENO MENGIBAR, Pedro J
Year of Publication 18.01.2023
Get full text
Year of Publication 18.01.2023
Patent