Search Results - "Chen, Zhehuai" :: K.UTB vyhledávací portál

Accelerating RNN-T Training and Inference Using CTC guidance

by Wang, Yongqiang, Chen, Zhehuai, Zheng, Chengjian, Zhang, Yu, Han, Wei, Haghani, Parisa
Year of Publication 28.10.2022

Get full text

Journal Article

Loading…

Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR

by Chen, Zhehuai, Bapna, Ankur, Rosenberg, Andrew, Zhang, Yu, Ramabhadran, Bhuvana, Moreno, Pedro, Chen, Nanxin
Year of Publication 18.10.2022

Get full text

Journal Article

Loading…

Using Text Injection to Improve Recognition of Personal Identifiers in Speech

by Blau, Yochai, Agrawal, Rohan, Madmony, Lior, Wang, Gary, Rosenberg, Andrew, Chen, Zhehuai, Gekhman, Zorik, Beryozkin, Genady, Haghani, Parisa, Ramabhadran, Bhuvana
Year of Publication 14.08.2023

Get full text

Journal Article

Loading…

Less is More: Accurate Speech Recognition & Translation without Web-Scale Data

by Puvvada, Krishna C, Żelasko, Piotr, Huang, He, Hrinchuk, Oleksii, Koluguri, Nithin Rao, Dhawan, Kunal, Majumdar, Somshubra, Rastorgueva, Elena, Chen, Zhehuai, Lavrukhin, Vitaly, Balam, Jagadeesh, Ginsburg, Boris
Year of Publication 28.06.2024

Get full text

Journal Article

Loading…

MAESTRO: Matched Speech Text Representations through Modality Matching

by Chen, Zhehuai, Zhang, Yu, Rosenberg, Andrew, Ramabhadran, Bhuvana, Moreno, Pedro, Bapna, Ankur, Zen, Heiga
Year of Publication 07.04.2022

Get full text

Journal Article

Loading…

Unsupervised Data Selection via Discrete Speech Representation for ASR

by Lu, Zhiyun, Wang, Yongqiang, Zhang, Yu, Han, Wei, Chen, Zhehuai, Haghani, Parisa
Year of Publication 05.04.2022

Get full text

Journal Article

Loading…

High-precision Voice Search Query Correction via Retrievable Speech-text Embedings

by Li, Christopher, Wang, Gary, Kastner, Kyle, Su, Heng, Chen, Allen, Rosenberg, Andrew, Chen, Zhehuai, Wu, Zelin, Velikovich, Leonid, Rondon, Pat, Caseiro, Diamantino, Aleksic, Petar
Year of Publication 08.01.2024

Get full text

Journal Article

Loading…

JOIST: A Joint Speech and Text Streaming Model for ASR

by Sainath, Tara N., Prabhavalkar, Rohit, Bapna, Ankur, Zhang, Yu, Huo, Zhouyuan, Chen, Zhehuai, Li, Bo, Wang, Weiran, Strohman, Trevor
Published in 2022 IEEE Spoken Language Technology Workshop (SLT) (09.01.2023)

Get full text

Conference Proceeding

Loading…

Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition

by Xu, Hainan, Chen, Zhehuai, Jia, Fei, Ginsburg, Boris
Published in arXiv.org (04.04.2024)

Get full text

Paper

Loading…

Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting

by Chen, Zhehuai, Qian, Yanmin, Yu, Kai
Published in arXiv.org (02.08.2018)

Get full text

Paper Journal Article

Loading…

Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech

by Saeki, Takaaki, Zen, Heiga, Chen, Zhehuai, Morioka, Nobuyuki, Wang, Gary, Zhang, Yu, Bapna, Ankur, Rosenberg, Andrew, Ramabhadran, Bhuvana
Year of Publication 27.10.2022

Get full text

Journal Article

Loading…

JOIST: A Joint Speech and Text Streaming Model For ASR

by Sainath, Tara N, Prabhavalkar, Rohit, Bapna, Ankur, Zhang, Yu, Huo, Zhouyuan, Chen, Zhehuai, Li, Bo, Wang, Weiran, Strohman, Trevor
Year of Publication 13.10.2022

Get full text

Journal Article

Loading…

Consistency prediction on streaming sequence models

by Rosenberg, Andrew, Chen, Zhehuai, Ramabhadran, Bhuvana, Moreno Mengibar, Pedro Jose
Year of Publication 12.03.2024

Get full text

Patent

Loading…

Using Aligned Text and Speech Representations to Train Automatic Speech Recognition Models without Transcribed Speech Data

by Rosenberg, Andrew, Bapna, Ankur, Chen, Zhehuai, Zhang, Yu, Ramabhadran, Bhuvana
Year of Publication 25.01.2024

Get full text

Patent

Loading…

USING ALIGNED TEXT AND SPEECH REPRESENTATIONS TO TRAIN AUTOMATIC SPEECH RECOGNITION MODELS WITHOUT TRANSCRIBED SPEECH DATA

by CHEN, Zhehuai, ZHANG, Yu, RAMABHADRAN, Bhuvana, ROSENBERG, Andrew, BAPNA, Ankur
Year of Publication 25.01.2024

Get full text

Patent

Loading…

CONFORMER-BASED SPEECH CONVERSION MODEL

by CHEN, Zhehuai, RAMABHADRAN, Bhuvana, MENGIBAR, Pedro, J. Moreno, BIADSY, Fadi
Year of Publication 03.01.2024

Get full text

Patent

Loading…

Injecting Text in Self-Supervised Speech Pretraining

by Chen, Zhehuai, Zhang, Yu, Rosenberg, Andrew, Ramabhadran, Bhuvana, Wang, Gary, Moreno, Pedro
Year of Publication 27.08.2021

Get full text

Journal Article

Loading…

An Asynchronous WFST-Based Decoder For Automatic Speech Recognition

by Lv, Hang, Chen, Zhehuai, Xu, Hainan, Povey, Daniel, Xie, Lei, Khudanpur, Sanjeev
Year of Publication 16.03.2021

Get full text

Journal Article

Loading…

Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition

by Yang, Chao-Han Huck, Park, Taejin, Gong, Yuan, Li, Yuanchao, Chen, Zhehuai, Lin, Yen-Ting, Chen, Chen, Hu, Yuchen, Dhawan, Kunal, Żelasko, Piotr, Zhang, Chao, Chen, Yun-Nung, Tsao, Yu, Balam, Jagadeesh, Ginsburg, Boris, Siniscalchi, Sabato Marco, Chng, Eng Siong, Bell, Peter, Lai, Catherine, Watanabe, Shinji, Stolcke, Andreas
Year of Publication 15.09.2024

Get full text

Journal Article

Loading…

SPEECH RECOGNITION USING UNSPOKEN TEXT AND SPEECH SYNTHESIS

by CHEN, Zhehuai, RAMABHADRAN, Bhuvana, ROSENBERG, Andrew, MORENO MENGIBAR, Pedro J
Year of Publication 18.01.2023

Get full text

Patent

Refine Results

Format

Subject Area

Topic

Language

Year of Publication

Database