Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Swietojanski, Pawel, Braun, Stefan, Can, Dogan, Da Silva, Thiago Fraga, Ghoshal, Arnab, Hori, Takaaki, Hsiao, Roger, Mason, Henry, McDermott, Erik, Silovsky, Honza, Travadi, Ruchir, Zhuang, Xiaodan
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
An investigation into language model data augmentation for low-resourced STT and KWS
Guangpu Huang, da Silva, Thiago Fraga, Lamel, Lori, Gauvain, Jean-Luc, Gorin, Arseniy, Laurent, Antoine, Lileikyte, Rasa, Messouadi, Abdel
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2017)
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2017)
Get full text
Conference Proceeding
Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Swietojanski, Pawel, Braun, Stefan, Can, Dogan, da Silva, Thiago Fraga, Ghoshal, Arnab, Hori, Takaaki, Hsiao, Roger, Mason, Henry, McDermott, Erik, Silovsky, Honza, Travadi, Ruchir, Zhuang, Xiaodan
Year of Publication 02.11.2022
Year of Publication 02.11.2022
Get full text
Journal Article
Optimizing Bilingual Neural Transducer with Synthetic Code-switching Text Generation
Nguyen, Thien, Tran, Nathalie, Deng, Liuhui, da Silva, Thiago Fraga, Radzihovsky, Matthew, Hsiao, Roger, Mason, Henry, Braun, Stefan, McDermott, Erik, Can, Dogan, Swietojanski, Pawel, Verwimp, Lyan, Oyman, Sibel, Arvizo, Tresi, Silovsky, Honza, Ghoshal, Arnab, Martel, Mathieu, Ambati, Bharat Ram, Ali, Mohamed
Year of Publication 21.10.2022
Year of Publication 21.10.2022
Get full text
Journal Article
Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Swietojanski, Pawel, Braun, Stefan, Dogan Can, Thiago Fraga da Silva, Ghoshal, Arnab, Hori, Takaaki, Hsiao, Roger, Mason, Henry, McDermott, Erik, Silovsky, Honza, Travadi, Ruchir, Zhuang, Xiaodan
Published in arXiv.org (18.04.2023)
Get full text
Published in arXiv.org (18.04.2023)
Paper
Optimizing Bilingual Neural Transducer with Synthetic Code-switching Text Generation
Nguyen, Thien, Tran, Nathalie, Deng, Liuhui, Thiago Fraga da Silva, Radzihovsky, Matthew, Hsiao, Roger, Mason, Henry, Braun, Stefan, McDermott, Erik, Dogan Can, Swietojanski, Pawel, Verwimp, Lyan, Oyman, Sibel, Arvizo, Tresi, Silovsky, Honza, Ghoshal, Arnab, Martel, Mathieu, Bharat Ram Ambati, Ali, Mohamed
Published in arXiv.org (21.10.2022)
Get full text
Published in arXiv.org (21.10.2022)
Paper
SPEECH RECOGNITION FOR MULTIPLE USERS USING SPEECH PROFILE COMBINATION
FRAGA DA SILVA, Thiago, DENG, Yaqiao, JEON, Woojay, LIU, Leo, YOUNG, Mary K, KRISHNAMOORTHY, Mahesh
Year of Publication 30.11.2023
Get full text
Year of Publication 30.11.2023
Patent