Online Automatic Speech Recognition With Listen, Attend and Spell Model
Hsiao, Roger, Can, Dogan, Ng, Tim, Travadi, Ruchir, Ghoshal, Arnab
Published in IEEE signal processing letters (2020)
Published in IEEE signal processing letters (2020)
Get full text
Journal Article
Rapid Language Identification
Van Segbroeck, Maarten, Travadi, Ruchir, Narayanan, Shrikanth S.
Published in IEEE/ACM transactions on audio, speech, and language processing (01.07.2015)
Published in IEEE/ACM transactions on audio, speech, and language processing (01.07.2015)
Get full text
Journal Article
Personalization of CTC-Based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization
Lei, Zhihong, Pusateri, Ernest, Han, Shiyi, Liu, Leo, Xu, Mingbin, Ng, Tim, Travadi, Ruchir, Zhang, Youyuan, Hannemann, Mirko, Siu, Man-Hung, Huang, Zhen
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Get full text
Conference Proceeding
Improving Semi-Supervised Classification for Low-Resource Speech Interaction Applications
Kumar, Manoj, Papadopoulos, Pavlos, Travadi, Ruchir, Bone, Daniel, Narayanan, Shrikanth
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Get full text
Conference Proceeding
Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Swietojanski, Pawel, Braun, Stefan, Can, Dogan, Da Silva, Thiago Fraga, Ghoshal, Arnab, Hori, Takaaki, Hsiao, Roger, Mason, Henry, McDermott, Erik, Silovsky, Honza, Travadi, Ruchir, Zhuang, Xiaodan
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
Optimizing Byte-level Representation for End-to-end ASR
Hsiao, Roger, Deng, Liuhui, McDermott, Erik, Travadi, Ruchir, Zhuang, Xiaodan
Year of Publication 13.06.2024
Year of Publication 13.06.2024
Get full text
Journal Article
Optimizing Byte-level Representation for End-to-end ASR
Hsiao, Roger, Deng, Liuhui, McDermott, Erik, Travadi, Ruchir, Zhuang, Xiaodan
Published in arXiv.org (04.09.2024)
Get full text
Published in arXiv.org (04.09.2024)
Paper
Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization
Lei, Zhihong, Pusateri, Ernest, Han, Shiyi, Liu, Leo, Xu, Mingbin, Ng, Tim, Travadi, Ruchir, Zhang, Youyuan, Hannemann, Mirko, Siu, Man-Hung, Huang, Zhen
Year of Publication 15.10.2023
Year of Publication 15.10.2023
Get full text
Journal Article
A Computational Tool to Study Vocal Participation of Women in UN-ITU Meetings
Hebbar, Rajat, Somandepalli, Krishna, Peri, Raghuveer, Travadi, Ruchir, Tuplin, Tracy, Rivera, Fernando, Narayanan, Shrikanth
Published in 2021 International Conference on Content-Based Multimedia Indexing (CBMI) (28.06.2021)
Published in 2021 International Conference on Content-Based Multimedia Indexing (CBMI) (28.06.2021)
Get full text
Conference Proceeding
Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Swietojanski, Pawel, Braun, Stefan, Can, Dogan, da Silva, Thiago Fraga, Ghoshal, Arnab, Hori, Takaaki, Hsiao, Roger, Mason, Henry, McDermott, Erik, Silovsky, Honza, Travadi, Ruchir, Zhuang, Xiaodan
Year of Publication 02.11.2022
Year of Publication 02.11.2022
Get full text
Journal Article
Multimodal Representation Learning using Deep Multiset Canonical Correlation
Somandepalli, Krishna, Kumar, Naveen, Travadi, Ruchir, Narayanan, Shrikanth
Published in arXiv.org (03.04.2019)
Published in arXiv.org (03.04.2019)
Get full text
Paper
Journal Article
Online Automatic Speech Recognition with Listen, Attend and Spell Model
Hsiao, Roger, Dogan Can, Ng, Tim, Travadi, Ruchir, Ghoshal, Arnab
Published in arXiv.org (13.10.2020)
Published in arXiv.org (13.10.2020)
Get full text
Paper
Journal Article
Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization
Lei, Zhihong, Pusateri, Ernest, Han, Shiyi, Liu, Leo, Xu, Mingbin, Ng, Tim, Travadi, Ruchir, Zhang, Youyuan, Hannemann, Mirko, Man-Hung, Siu, Huang, Zhen
Published in arXiv.org (16.10.2023)
Get full text
Published in arXiv.org (16.10.2023)
Paper
Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Swietojanski, Pawel, Braun, Stefan, Dogan Can, Thiago Fraga da Silva, Ghoshal, Arnab, Hori, Takaaki, Hsiao, Roger, Mason, Henry, McDermott, Erik, Silovsky, Honza, Travadi, Ruchir, Zhuang, Xiaodan
Published in arXiv.org (18.04.2023)
Get full text
Published in arXiv.org (18.04.2023)
Paper