A Multi-View Approach to Audio-Visual Speaker Verification
Sari, Leda, Singh, Kritika, Zhou, Jiatong, Torresani, Lorenzo, Singhal, Nayan, Saraf, Yatharth
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Get full text
Conference Proceeding
Conformer-Based Self-Supervised Learning For Non-Speech Audio Tasks
Srivastava, Sangeeta, Wang, Yun, Tjandra, Andros, Kumar, Anurag, Liu, Chunxi, Singh, Kritika, Saraf, Yatharth
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Get full text
Conference Proceeding
Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions
Liu, Chunxi, Picheny, Michael, Sari, Leda, Chitkara, Pooja, Xiao, Alex, Zhang, Xiaohui, Chou, Mark, Alvarado, Andres, Hazirbas, Caner, Saraf, Yatharth
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Get full text
Conference Proceeding
Improved Language Identification Through Cross-Lingual Self-Supervised Learning
Tjandra, Andros, Choudhury, Diptanu Gon, Zhang, Frank, Singh, Kritika, Conneau, Alexis, Baevski, Alexei, Sela, Assaf, Saraf, Yatharth, Auli, Michael
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Get full text
Conference Proceeding
Training ASR Models By Generation of Contextual Information
Singh, Kritika, Okhonko, Dmytro, Liu, Jun, Wang, Yongqiang, Zhang, Frank, Girshick, Ross, Edunov, Sergey, Peng, Fuchun, Saraf, Yatharth, Zweig, Geoffrey, Mohamed, Abdelrahman
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Get full text
Conference Proceeding
Dual Application of Speech Enhancement for Automatic Speech Recognition
Pandey, Ashutosh, Liu, Chunxi, Wang, Yun, Saraf, Yatharth
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19.01.2021)
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19.01.2021)
Get full text
Conference Proceeding
Pushing the performances of ASR models on English and Spanish accents
Chitkara, Pooja, Riviere, Morgane, Copet, Jade, Zhang, Frank, Saraf, Yatharth
Year of Publication 22.12.2022
Year of Publication 22.12.2022
Get full text
Journal Article
Computing the curve-skeletons of images
Saraf, Yatharth, Raman, Balasubramanian, Krishnan, Swaminathan
Published in International journal of computer mathematics (01.02.2008)
Published in International journal of computer mathematics (01.02.2008)
Get full text
Journal Article
Improving RNN Transducer Based ASR with Auxiliary Tasks
Liu, Chunxi, Zhang, Frank, Le, Duc, Kim, Suyoun, Saraf, Yatharth, Zweig, Geoffrey
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19.01.2021)
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19.01.2021)
Get full text
Conference Proceeding
Kaizen: Continuously Improving Teacher Using Exponential Moving Average for Semi-Supervised Speech Recognition
Manohar, Vimal, Likhomanenko, Tatiana, Xu, Qiantong, Hsu, Wei-Ning, Collobert, Ronan, Saraf, Yatharth, Zweig, Geoffrey, Mohamed, Abdelrahman
Published in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (13.12.2021)
Published in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (13.12.2021)
Get full text
Conference Proceeding
On Lattice-Free Boosted MMI Training of HMM and CTC-Based Full-Context ASR Models
Zhang, Xiaohui, Manohar, Vimal, Zhang, David, Zhang, Frank, Shi, Yangyang, Singhal, Nayan, Chan, Julian, Peng, Fuchun, Saraf, Yatharth, Seltzer, Mike
Published in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (13.12.2021)
Published in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (13.12.2021)
Get full text
Conference Proceeding
Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks
Srivastava, Sangeeta, Wang, Yun, Tjandra, Andros, Kumar, Anurag, Liu, Chunxi, Singh, Kritika, Saraf, Yatharth
Year of Publication 14.10.2021
Year of Publication 14.10.2021
Get full text
Journal Article
Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings
Li, Jialu, Manohar, Vimal, Chitkara, Pooja, Tjandra, Andros, Picheny, Michael, Zhang, Frank, Zhang, Xiaohui, Saraf, Yatharth
Year of Publication 07.10.2021
Year of Publication 07.10.2021
Get full text
Journal Article
Improving Data Driven Inverse Text Normalization using Data Augmentation
Pandey, Laxmi, Paul, Debjyoti, Chitkara, Pooja, Pang, Yutong, Zhang, Xuedong, Schubert, Kjell, Chou, Mark, Liu, Shu, Saraf, Yatharth
Year of Publication 20.07.2022
Year of Publication 20.07.2022
Get full text
Journal Article
Benchmarking LF-MMI, CTC And RNN-T Criteria For Streaming ASR
Zhang, Xiaohui, Zhang, Frank, Liu, Chunxi, Schubert, Kjell, Chan, Julian, Prakash, Pradyot, Liu, Jun, Yeh, Ching-Feng, Peng, Fuchun, Saraf, Yatharth, Zweig, Geoffrey
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19.01.2021)
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19.01.2021)
Get full text
Conference Proceeding
A Multi-View Approach To Audio-Visual Speaker Verification
Sarı, Leda, Singh, Kritika, Zhou, Jiatong, Torresani, Lorenzo, Singhal, Nayan, Saraf, Yatharth
Year of Publication 11.02.2021
Year of Publication 11.02.2021
Get full text
Journal Article
Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions
Liu, Chunxi, Picheny, Michael, Sarı, Leda, Chitkara, Pooja, Xiao, Alex, Zhang, Xiaohui, Chou, Mark, Alvarado, Andres, Hazirbas, Caner, Saraf, Yatharth
Year of Publication 18.11.2021
Year of Publication 18.11.2021
Get full text
Journal Article