Sequence-Based Multi-Lingual Low Resource Speech Recognition
Dalmia, Siddharth, Sanabria, Ramon, Metze, Florian, Black, Alan W.
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Get full text
Conference Proceeding
Universal Phone Recognition with a Multilingual Allophone System
Li, Xinjian, Dalmia, Siddharth, Li, Juncheng, Lee, Matthew, Littell, Patrick, Yao, Jiali, Anastasopoulos, Antonios, Mortensen, David R., Neubig, Graham, Black, Alan W, Metze, Florian
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Get full text
Conference Proceeding
ESPnet-SLU: Advancing Spoken Language Understanding Through ESPnet
Arora, Siddhant, Dalmia, Siddharth, Denisov, Pavel, Chang, Xuankai, Ueda, Yushi, Peng, Yifan, Zhang, Yuekai, Kumar, Sujay, Ganesan, Karthik, Yan, Brian, Thang Vu, Ngoc, Black, Alan W, Watanabe, Shinji
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Get full text
Conference Proceeding
Align, Write, Re-Order: Explainable End-to-End Speech Translation via Operation Sequence Generation
Omachi, Motoi, Yan, Brian, Dalmia, Siddharth, Fujita, Yuya, Watanabe, Shinji
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
LegoNN: Building Modular Encoder-Decoder Models
Dalmia, Siddharth, Okhonko, Dmytro, Lewis, Mike, Edunov, Sergey, Watanabe, Shinji, Metze, Florian, Zettlemoyer, Luke, Mohamed, Abdelrahman
Published in IEEE/ACM transactions on audio, speech, and language processing (01.01.2023)
Published in IEEE/ACM transactions on audio, speech, and language processing (01.01.2023)
Get full text
Journal Article
Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Yan, Brian, Zhang, Chunlei, Yu, Meng, Zhang, Shi-Xiong, Dalmia, Siddharth, Berrebbi, Dan, Weng, Chao, Watanabe, Shinji, Yu, Dong
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Get full text
Conference Proceeding
Multimodal Modeling for Spoken Language Identification
Bharadwaj, Shikhar, Ma, Min, Vashishth, Shikhar, Bapna, Ankur, Ganapathy, Sriram, Axelrod, Vera, Dalmia, Siddharth, Han, Wei, Zhang, Yu, Van Esch, Daan, Ritchie, Sandy, Talukdar, Partha, Riesa, Jason
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Get full text
Conference Proceeding
FLEURS: FEW-Shot Learning Evaluation of Universal Representations of Speech
Conneau, Alexis, Ma, Min, Khanuja, Simran, Zhang, Yu, Axelrod, Vera, Dalmia, Siddharth, Riesa, Jason, Rivera, Clara, Bapna, Ankur
Published in 2022 IEEE Spoken Language Technology Workshop (SLT) (09.01.2023)
Published in 2022 IEEE Spoken Language Technology Workshop (SLT) (09.01.2023)
Get full text
Conference Proceeding
Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates
Inaguma, Hirofumi, Dalmia, Siddharth, Yan, Brian, Watanabe, Shinji
Published in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (13.12.2021)
Published in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (13.12.2021)
Get full text
Conference Proceeding
Align, Write, Re-order: Explainable End-to-End Speech Translation via Operation Sequence Generation
Omachi, Motoi, Yan, Brian, Dalmia, Siddharth, Fujita, Yuya, Watanabe, Shinji
Year of Publication 10.11.2022
Year of Publication 10.11.2022
Get full text
Journal Article
Transforming LLMs into Cross-modal and Cross-lingual Retrieval Systems
Gomez, Frank Palma, Sanabria, Ramon, Sung, Yun-hsuan, Cer, Daniel, Dalmia, Siddharth, Abrego, Gustavo Hernandez
Year of Publication 01.04.2024
Year of Publication 01.04.2024
Get full text
Journal Article
A Study on the Integration of Pre-Trained SSL, ASR, LM and SLU Models for Spoken Language Understanding
Peng, Yifan, Arora, Siddhant, Higuchi, Yosuke, Ueda, Yushi, Kumar, Sujay, Ganesan, Karthik, Dalmia, Siddharth, Chang, Xuankai, Watanabe, Shinji
Published in 2022 IEEE Spoken Language Technology Workshop (SLT) (09.01.2023)
Published in 2022 IEEE Spoken Language Technology Workshop (SLT) (09.01.2023)
Get full text
Conference Proceeding
Domain Robust Feature Extraction for Rapid Low Resource ASR Development
Dalmia, Siddharth, Li, Xinjian, Metze, Florian, Black, Alan W.
Published in 2018 IEEE Spoken Language Technology Workshop (SLT) (01.12.2018)
Published in 2018 IEEE Spoken Language Technology Workshop (SLT) (01.12.2018)
Get full text
Conference Proceeding
LLM Augmented LLMs: Expanding Capabilities through Composition
Bansal, Rachit, Samanta, Bidisha, Dalmia, Siddharth, Gupta, Nitish, Vashishth, Shikhar, Ganapathy, Sriram, Bapna, Abhishek, Jain, Prateek, Talukdar, Partha
Year of Publication 04.01.2024
Year of Publication 04.01.2024
Get full text
Journal Article
Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks
Dalmia, Siddharth, Yan, Brian, Raunak, Vikas, Metze, Florian, Watanabe, Shinji
Year of Publication 02.05.2021
Year of Publication 02.05.2021
Get full text
Journal Article