TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings
Boeddeker, Christoph, Subramanian, Aswin Shanmugam, Wichern, Gordon, Haeb-Umbach, Reinhold, Le Roux, Jonathan
Published in IEEE/ACM transactions on audio, speech, and language processing (2024)
Published in IEEE/ACM transactions on audio, speech, and language processing (2024)
Get full text
Journal Article
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks
Petermann, Darius, Wichern, Gordon, Subramanian, Aswin Shanmugam, Wang, Zhong-Qiu, Roux, Jonathan Le
Published in IEEE/ACM transactions on audio, speech, and language processing (2023)
Published in IEEE/ACM transactions on audio, speech, and language processing (2023)
Get full text
Journal Article
End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming
Zhang, Wangyou, Aswin Shanmugam Subramanian, Chang, Xuankai, Watanabe, Shinji, Qian, Yanmin
Published in arXiv.org (27.10.2020)
Published in arXiv.org (27.10.2020)
Get full text
Paper
Journal Article
ESPnet-se: end-to-end speech enhancement and separation toolkit designed for asr integration
Li, Chenda, Shi, Jing, Zhang, Wangyou, Aswin Shanmugam Subramanian, Chang, Xuankai, Kamo, Naoyuki, Hira, Moto, Hayashi, Tomoki, Boeddeker, Christoph, Chen, Zhuo, Watanabe, Shinji
Published in arXiv.org (07.11.2020)
Published in arXiv.org (07.11.2020)
Get full text
Paper
Journal Article
Speech Enhancement Using End-to-End Speech Recognition Objectives
Subramanian, Aswin Shanmugam, Wang, Xiaofei, Baskar, Murali Karthick, Watanabe, Shinji, Taniguchi, Toru, Tran, Dung, Fujita, Yuya
Published in 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (01.10.2019)
Published in 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (01.10.2019)
Get full text
Conference Proceeding
Attention-Based ASR with Lightweight and Dynamic Convolutions
Fujita, Yuya, Subramanian, Aswin Shanmugam, Omachi, Motoi, Watanabe, Shinji
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Get full text
Conference Proceeding
Far-Field Location Guided Target Speech Extraction Using End-to-End Speech Recognition Objectives
Subramanian, Aswin Shanmugam, Weng, Chao, Yu, Meng, Zhang, Shi-Xiong, Xu, Yong, Watanabe, Shinji, Yu, Dong
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Get full text
Conference Proceeding
Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization
Subramanian, Aswin Shanmugam, Weng, Chao, Watanabe, Shinji, Yu, Meng, Xu, Yong, Zhang, Shi-Xiong, Yu, Dong
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Get full text
Conference Proceeding
Generalized Weighted-Prediction-Error Dereverberation with Varying Source Priors For Reverberant Speech Recognition
Taniguchi, Toru, Subramanian, Aswin Shanmugam, Wang, Xiaofei, Tran, Dung, Fujita, Yuya, Watanabe, Shinji
Published in 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (01.10.2019)
Published in 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (01.10.2019)
Get full text
Conference Proceeding
An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition
Chang, Xuankai, Maekaku, Takashi, Guo, Pengcheng, Shi, Jing, Lu, Yen-Ju, Subramanian, Aswin Shanmugam, Wang, Tianzi, Yang, Shu-wen, Tsao, Yu, Lee, Hung-yi, Watanabe, Shinji
Published in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (13.12.2021)
Published in 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (13.12.2021)
Get full text
Conference Proceeding
ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration
Li, Chenda, Shi, Jing, Zhang, Wangyou, Subramanian, Aswin Shanmugam, Chang, Xuankai, Kamo, Naoyuki, Hira, Moto, Hayashi, Tomoki, Boeddeker, Christoph, Chen, Zhuo, Watanabe, Shinji
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19.01.2021)
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19.01.2021)
Get full text
Conference Proceeding
Soft Language Identification for Language-Agnostic Many-to-One End-to-End Speech Translation
Wang, Peidong, Xue, Jian, Li, Jinyu, Chen, Junkun, Subramanian, Aswin Shanmugam
Year of Publication 11.06.2024
Year of Publication 11.06.2024
Get full text
Journal Article
The 2020 ESPnet Update: New Features, Broadened Applications, Performance Improvements, and Future Plans
Watanabe, Shinji, Boyer, Florian, Chang, Xuankai, Guo, Pengcheng, Hayashi, Tomoki, Higuchi, Yosuke, Hori, Takaaki, Huang, Wen-Chin, Inaguma, Hirofumi, Kamo, Naoyuki, Karita, Shigeki, Li, Chenda, Shi, Jing, Subramanian, Aswin Shanmugam, Zhang, Wangyou
Published in 2021 IEEE Data Science and Learning Workshop (DSLW) (05.06.2021)
Published in 2021 IEEE Data Science and Learning Workshop (DSLW) (05.06.2021)
Get full text
Conference Proceeding
TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings
Boeddeker, Christoph, Subramanian, Aswin Shanmugam, Wichern, Gordon, Haeb-Umbach, Reinhold, Roux, Jonathan Le
Year of Publication 07.03.2023
Year of Publication 07.03.2023
Get full text
Journal Article
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks
Petermann, Darius, Wichern, Gordon, Subramanian, Aswin Shanmugam, Wang, Zhong-Qiu, Roux, Jonathan Le
Year of Publication 14.12.2022
Year of Publication 14.12.2022
Get full text
Journal Article
Reverberation as Supervision for Speech Separation
Aralikatti, Rohith, Boeddeker, Christoph, Wichern, Gordon, Subramanian, Aswin Shanmugam, Roux, Jonathan Le
Year of Publication 15.11.2022
Year of Publication 15.11.2022
Get full text
Journal Article
System and Method for Audio Processing using Time-Invariant Speaker Embeddings
Le Roux, Jonathan, Subramanian, Aswin Shanmugam, Böddeker, Christoph, Wichern, Gordon
Year of Publication 12.09.2024
Get full text
Year of Publication 12.09.2024
Patent
Audio Source Separation using Hyperbolic Embeddings
Le Roux, Jonathan, Subramanian, Aswin Shanmugam, Petermann, Darius, Wichern, Gordon
Year of Publication 13.06.2024
Get full text
Year of Publication 13.06.2024
Patent