Chain-of-Thought Prompting for Speech Translation
Hu, Ke, Chen, Zhehuai, Chao-Han, Huck Yang, Żelasko, Piotr, Hrinchuk, Oleksii, Lavrukhin, Vitaly, Balam, Jagadeesh, Ginsburg, Boris
Published in arXiv.org (17.09.2024)
Get full text
Published in arXiv.org (17.09.2024)
Paper
Understanding Shared Speech-Text Representations
Wang, Gary, Kastner, Kyle, Bapna, Ankur, Chen, Zhehuai, Rosenberg, Andrew, Ramabhadran, Bhuvana, Zhang, Yu
Published in arXiv.org (27.04.2023)
Get full text
Published in arXiv.org (27.04.2023)
Paper
SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation
Chen, Zhehuai, Huang, He, Andrusenko, Andrei, Hrinchuk, Oleksii, Puvvada, Krishna C, Li, Jason, Ghosh, Subhankar, Balam, Jagadeesh, Ginsburg, Boris
Published in arXiv.org (13.10.2023)
Get full text
Published in arXiv.org (13.10.2023)
Paper
Accelerating RNN-T Training and Inference Using CTC guidance
Wang, Yongqiang, Chen, Zhehuai, Zheng, Chengjian, Zhang, Yu, Han, Wei, Haghani, Parisa
Published in arXiv.org (29.10.2022)
Get full text
Published in arXiv.org (29.10.2022)
Paper
Less is More: Accurate Speech Recognition & Translation without Web-Scale Data
Puvvada, Krishna C, Żelasko, Piotr, Huang, He, Hrinchuk, Oleksii, Nithin Rao Koluguri, Dhawan, Kunal, Majumdar, Somshubra, Rastorgueva, Elena, Chen, Zhehuai, Lavrukhin, Vitaly, Balam, Jagadeesh, Ginsburg, Boris
Published in arXiv.org (28.06.2024)
Get full text
Published in arXiv.org (28.06.2024)
Paper
Unsupervised Data Selection via Discrete Speech Representation for ASR
Lu, Zhiyun, Wang, Yongqiang, Zhang, Yu, Han, Wei, Chen, Zhehuai, Haghani, Parisa
Published in arXiv.org (05.04.2022)
Get full text
Published in arXiv.org (05.04.2022)
Paper