N-HANS: A neural network-based toolkit for in-the-wild audio enhancement
Liu, Shuo, Keren, Gil, Parada-Cabaleiro, Emilia, Schuller, Björn
Published in Multimedia tools and applications (01.07.2021)
Published in Multimedia tools and applications (01.07.2021)
Get full text
Journal Article
Improving fast-slow Encoder based Transducer with Streaming Deliberation
Li, Ke, Mahadeokar, Jay, Guo, Jinxi, Shi, Yangyang, Keren, Gil, Kalinli, Ozlem, Seltzer, Michael L., Le, Duc
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
Alignment Restricted Streaming Recurrent Neural Network Transducer
Mahadeokar, Jay, Shangguan, Yuan, Le, Duc, Keren, Gil, Su, Hang, Le, Thong, Yeh, Ching-Feng, Fuegen, Christian, Seltzer, Michael L.
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19.01.2021)
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19.01.2021)
Get full text
Conference Proceeding
A Time-Domain Convolutional Recurrent Network for Packet Loss Concealment
Lin, Ju, Wang, Yun, Kalgaonkar, Kaustubh, Keren, Gil, Zhang, Didi, Fuegen, Christian
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Get full text
Conference Proceeding
Convolutional RNN: An enhanced model for extracting features from sequential data
Keren, Gil, Schuller, Bjorn
Published in 2016 International Joint Conference on Neural Networks (IJCNN) (01.07.2016)
Published in 2016 International Joint Conference on Neural Networks (IJCNN) (01.07.2016)
Get full text
Conference Proceeding
Deep Shallow Fusion for RNN-T Personalization
Le, Duc, Keren, Gil, Chan, Julian, Mahadeokar, Jay, Fuegen, Christian, Seltzer, Michael L.
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19.01.2021)
Published in 2021 IEEE Spoken Language Technology Workshop (SLT) (19.01.2021)
Get full text
Conference Proceeding
Efficient Streaming LLM for Speech Recognition
Jia, Junteng, Keren, Gil, Zhou, Wei, Lakomkin, Egor, Zhang, Xiaohui, Wu, Chunyang, Seide, Frank, Mahadeokar, Jay, Kalinli, Ozlem
Year of Publication 01.10.2024
Year of Publication 01.10.2024
Get full text
Journal Article
End-to-end learning for dimensional emotion recognition from physiological signals
Keren, Gil, Kirschstein, Tobias, Marchi, Erik, Ringeval, Fabien, Schuller, Bjorn
Published in 2017 IEEE International Conference on Multimedia and Expo (ICME) (01.07.2017)
Published in 2017 IEEE International Conference on Multimedia and Expo (ICME) (01.07.2017)
Get full text
Conference Proceeding
Towards Selection of Text-to-speech Data to Augment ASR Training
Liu, Shuo, Sarı, Leda, Wu, Chunyang, Keren, Gil, Shangguan, Yuan, Mahadeokar, Jay, Kalinli, Ozlem
Year of Publication 30.05.2023
Year of Publication 30.05.2023
Get full text
Journal Article
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses
Yang, Yufeng, Raj, Desh, Lin, Ju, Moritz, Niko, Jia, Junteng, Keren, Gil, Lakomkin, Egor, Huang, Yiteng, Donley, Jacob, Mahadeokar, Jay, Kalinli, Ozlem
Year of Publication 17.09.2024
Year of Publication 17.09.2024
Get full text
Journal Article