SPEECH RECOGNITION USING UNSPOKEN TEXT AND SPEECH SYNTHESIS
PEDRO J MORENO MENGIBAR, CHEN ZHEHUAI, BHUVANA RAMABHADRAN, ANDREW ROSENBERG
Year of Publication 10.04.2024
Get full text
Year of Publication 10.04.2024
Patent
Accelerating RNN-T Training and Inference Using CTC Guidance
Wang, Yongqiang, Chen, Zhehuai, Zheng, Chengjian, Zhang, Yu, Han, Wei, Haghani, Parisa
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
Understanding Shared Speech-Text Representations
Wang, Gary, Kastner, Kyle, Bapna, Ankur, Chen, Zhehuai, Rosenberg, Andrew, Ramabhadran, Bhuvana, Zhang, Yu
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
비음성 텍스트 및 스피치 합성을 사용한 스피치 인식
CHEN ZHEHUAI, ROSENBERG ANDREW, MORENO MENGIBAR PEDRO J, RAMABHADRAN BHUVANA
Year of Publication 05.01.2023
Get full text
Year of Publication 05.01.2023
Patent
스트리밍 시퀀스 모델에 대한 일관성 예측
CHEN ZHEHUAI, ROSENBERG ANDREW, MORENO MENGIBAR PEDRO J, RAMABHADRAN BHUVANA
Year of Publication 04.11.2022
Get full text
Year of Publication 04.11.2022
Patent
셀프 지도 스피치 사전 트레이닝에서 텍스트 삽입하기
CHEN ZHEHUAI, RAMABHADRAN BHUVANA, ROSENBERG ANDREW M, ZHANG YU, MENGIBAR PEDRO J. MORENO
Year of Publication 20.02.2024
Get full text
Year of Publication 20.02.2024
Patent
SALM: Speech-Augmented Language Model with in-Context Learning for Speech Recognition and Translation
Chen, Zhehuai, Huang, He, Andrusenko, Andrei, Hrinchuk, Oleksii, Puvvada, Krishna C., Li, Jason, Ghosh, Subhankar, Balam, Jagadeesh, Ginsburg, Boris
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Get full text
Conference Proceeding
시퀀스들에 걸쳐 대조 손실을 갖는 지도 및 비지도 트레이닝
CHEN ZHEHUAI, ROSENBERG ANDREW, RAMABHADRAN BHUVANA, EMOND JESSE, WANG YUAN, ZHANG YU
Year of Publication 13.11.2023
Get full text
Year of Publication 13.11.2023
Patent
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-to-Speech
Saeki, Takaaki, Zen, Heiga, Chen, Zhehuai, Morioka, Nobuyuki, Wang, Gary, Zhang, Yu, Bapna, Ankur, Rosenberg, Andrew, Ramabhadran, Bhuvana
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
스피치 인식을 사용한 교차 언어 스피치 합성 개선
CHEN ZHEHUAI, ROSENBERG ANDREW, RAMABHADRAN BHUVANA, ZHANG YU, MENGIBAR PEDRO J. MORENO
Year of Publication 19.06.2023
Get full text
Year of Publication 19.06.2023
Patent
Instruction Data Generation and Unsupervised Adaptation for Speech Language Models
Noroozi, Vahid, Chen, Zhehuai, Majumdar, Somshubra, Huang, Steve, Balam, Jagadeesh, Ginsburg, Boris
Year of Publication 18.06.2024
Year of Publication 18.06.2024
Get full text
Journal Article
EMMeTT: Efficient Multimodal Machine Translation Training
Żelasko, Piotr, Chen, Zhehuai, Wang, Mengru, Galvez, Daniel, Hrinchuk, Oleksii, Ding, Shuoyang, Hu, Ke, Balam, Jagadeesh, Lavrukhin, Vitaly, Ginsburg, Boris
Year of Publication 20.09.2024
Year of Publication 20.09.2024
Get full text
Journal Article
Chain-of-Thought Prompting for Speech Translation
Hu, Ke, Chen, Zhehuai, Yang, Chao-Han Huck, Żelasko, Piotr, Hrinchuk, Oleksii, Lavrukhin, Vitaly, Balam, Jagadeesh, Ginsburg, Boris
Year of Publication 17.09.2024
Year of Publication 17.09.2024
Get full text
Journal Article
BESTOW: Efficient and Streamable Speech Language Model with the Best of Two Worlds in GPT and T5
Chen, Zhehuai, Huang, He, Hrinchuk, Oleksii, Puvvada, Krishna C, Koluguri, Nithin Rao, Żelasko, Piotr, Balam, Jagadeesh, Ginsburg, Boris
Year of Publication 28.06.2024
Year of Publication 28.06.2024
Get full text
Journal Article
DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment
Lu, Ke-Han, Chen, Zhehuai, Fu, Szu-Wei, Huang, He, Ginsburg, Boris, Wang, Yu-Chiang Frank, Lee, Hung-yi
Year of Publication 26.06.2024
Year of Publication 26.06.2024
Get full text
Journal Article
Understanding Shared Speech-Text Representations
Wang, Gary, Kastner, Kyle, Bapna, Ankur, Chen, Zhehuai, Rosenberg, Andrew, Ramabhadran, Bhuvana, Zhang, Yu
Year of Publication 27.04.2023
Year of Publication 27.04.2023
Get full text
Journal Article