Search Results - "Saraf, Yatharth" :: K.UTB vyhledávací portál

A Multi-View Approach to Audio-Visual Speaker Verification

by Sari, Leda, Singh, Kritika, Zhou, Jiatong, Torresani, Lorenzo, Singhal, Nayan, Saraf, Yatharth
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)

Get full text

Conference Proceeding

Loading…

Conformer-Based Self-Supervised Learning For Non-Speech Audio Tasks

by Srivastava, Sangeeta, Wang, Yun, Tjandra, Andros, Kumar, Anurag, Liu, Chunxi, Singh, Kritika, Saraf, Yatharth
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)

Get full text

Conference Proceeding

Loading…

Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions

by Liu, Chunxi, Picheny, Michael, Sari, Leda, Chitkara, Pooja, Xiao, Alex, Zhang, Xiaohui, Chou, Mark, Alvarado, Andres, Hazirbas, Caner, Saraf, Yatharth
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)

Get full text

Conference Proceeding

Loading…

Improved Language Identification Through Cross-Lingual Self-Supervised Learning

by Tjandra, Andros, Choudhury, Diptanu Gon, Zhang, Frank, Singh, Kritika, Conneau, Alexis, Baevski, Alexei, Sela, Assaf, Saraf, Yatharth, Auli, Michael
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)

Get full text

Conference Proceeding

Loading…

Training ASR Models By Generation of Contextual Information

by Singh, Kritika, Okhonko, Dmytro, Liu, Jun, Wang, Yongqiang, Zhang, Frank, Girshick, Ross, Edunov, Sergey, Peng, Fuchun, Saraf, Yatharth, Zweig, Geoffrey, Mohamed, Abdelrahman
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)

Get full text

Conference Proceeding

Loading…

Evaluation infrastructure for testing real-time content search

by Saraf, Yatharth
Year of Publication 10.05.2022

Get full text

Patent

Loading…

Dual Application of Speech Enhancement for Automatic Speech Recognition

by Pandey, Ashutosh, Liu, Chunxi, Wang, Yun, Saraf, Yatharth
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19.01.2021)

Get full text

Conference Proceeding

Loading…

Evaluation infrastructure for testing real-time content search

by Saraf, Yatharth
Year of Publication 26.11.2019

Get full text

Patent

Loading…

Pushing the performances of ASR models on English and Spanish accents

by Chitkara, Pooja, Riviere, Morgane, Copet, Jade, Zhang, Frank, Saraf, Yatharth
Year of Publication 22.12.2022

Get full text

Journal Article

Loading…

Computing the curve-skeletons of images

by Saraf, Yatharth, Raman, Balasubramanian, Krishnan, Swaminathan
Published in International journal of computer mathematics (01.02.2008)

Get full text

Journal Article

Loading…

Improving RNN Transducer Based ASR with Auxiliary Tasks

by Liu, Chunxi, Zhang, Frank, Le, Duc, Kim, Suyoun, Saraf, Yatharth, Zweig, Geoffrey
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19.01.2021)

Get full text

Conference Proceeding

Loading…

Dual Application of Speech Enhancement for Automatic Speech Recognition

by Pandey, Ashutosh, Liu, Chunxi, Wang, Yun, Saraf, Yatharth
Year of Publication 07.11.2020

Get full text

Journal Article

Loading…

Kaizen: Continuously Improving Teacher Using Exponential Moving Average for Semi-Supervised Speech Recognition

by Manohar, Vimal, Likhomanenko, Tatiana, Xu, Qiantong, Hsu, Wei-Ning, Collobert, Ronan, Saraf, Yatharth, Zweig, Geoffrey, Mohamed, Abdelrahman
Published in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (13.12.2021)

Get full text

Conference Proceeding

Loading…

On Lattice-Free Boosted MMI Training of HMM and CTC-Based Full-Context ASR Models

by Zhang, Xiaohui, Manohar, Vimal, Zhang, David, Zhang, Frank, Shi, Yangyang, Singhal, Nayan, Chan, Julian, Peng, Fuchun, Saraf, Yatharth, Seltzer, Mike
Published in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (13.12.2021)

Get full text

Conference Proceeding

Loading…

Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks

by Srivastava, Sangeeta, Wang, Yun, Tjandra, Andros, Kumar, Anurag, Liu, Chunxi, Singh, Kritika, Saraf, Yatharth
Year of Publication 14.10.2021

Get full text

Journal Article

Loading…

Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings

by Li, Jialu, Manohar, Vimal, Chitkara, Pooja, Tjandra, Andros, Picheny, Michael, Zhang, Frank, Zhang, Xiaohui, Saraf, Yatharth
Year of Publication 07.10.2021

Get full text

Journal Article

Loading…

Improving Data Driven Inverse Text Normalization using Data Augmentation

by Pandey, Laxmi, Paul, Debjyoti, Chitkara, Pooja, Pang, Yutong, Zhang, Xuedong, Schubert, Kjell, Chou, Mark, Liu, Shu, Saraf, Yatharth
Year of Publication 20.07.2022

Get full text

Journal Article

Loading…

Benchmarking LF-MMI, CTC And RNN-T Criteria For Streaming ASR

by Zhang, Xiaohui, Zhang, Frank, Liu, Chunxi, Schubert, Kjell, Chan, Julian, Prakash, Pradyot, Liu, Jun, Yeh, Ching-Feng, Peng, Fuchun, Saraf, Yatharth, Zweig, Geoffrey
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19.01.2021)

Get full text

Conference Proceeding

Loading…

A Multi-View Approach To Audio-Visual Speaker Verification

by Sarı, Leda, Singh, Kritika, Zhou, Jiatong, Torresani, Lorenzo, Singhal, Nayan, Saraf, Yatharth
Year of Publication 11.02.2021

Get full text

Journal Article

Loading…

Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions

by Liu, Chunxi, Picheny, Michael, Sarı, Leda, Chitkara, Pooja, Xiao, Alex, Zhang, Xiaohui, Chou, Mark, Alvarado, Andres, Hazirbas, Caner, Saraf, Yatharth
Year of Publication 18.11.2021

Get full text

Journal Article

Refine Results

Format

Subject Area

Topic

Language

Year of Publication

Database