Leveraging Large Language Models for Exploiting ASR Uncertainty
Dighe, Pranay, Su, Yi, Zheng, Shangshang, Liu, Yunshu, Garg, Vineet, Niu, Xiaochuan, Tewfik, Ahmed
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Get full text
Conference Proceeding
Modality Drop-Out for Multimodal Device Directed Speech Detection Using Verbal and Non-Verbal Features
Krishna, Gautam, Dharur, Sameer, Rudovic, Oggi, Dighe, Pranay, Adya, Saurabh, Abdelaziz, Ahmed Hussen, Tewfik, Ahmed H
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Get full text
Conference Proceeding
Audio-to-Intent Using Acoustic-Textual Subword Representations from End-to-End ASR
Dighe, Pranay, Nayak, Prateeth, Rudovic, Oggi, Marchi, Erik, Niu, Xiaochuan, Tewfik, Ahmed
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
Analyzing Uncertainties in Speech Recognition Using Dropout
Vyas, Apoorv, Dighe, Pranay, Tong, Sibo, Bourlard, Herve
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Get full text
Conference Proceeding
Low-rank and sparse soft targets to learn better DNN acoustic models
Dighe, Pranay, Asaei, Afsaneh, Bourlard, Herve
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2017)
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2017)
Get full text
Conference Proceeding
Exploiting low-dimensional structures to enhance DNN based acoustic modeling in speech recognition
Dighe, Pranay, Luyet, Gil, Asaei, Afsaneh, Bourlard, Herve
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2016)
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2016)
Get full text
Conference Proceeding
Journal Article
Knowledge Transfer for Efficient on-Device False Trigger Mitigation
Dighe, Dighe, Marchi, Erik, Vishnubhotla, Srikanth, Kajarekar, Sachin, Naik, Devang
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Get full text
Conference Proceeding
Streaming on-Device Detection of Device Directed Speech from Voice and Touch-Based Invocation
Rudovic, Ognjen Oggi, Bindal, Akanksha, Garg, Vineet, Simha, Pramod, Dighe, Pranay, Kajarekar, Sachin
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Get full text
Conference Proceeding
Less Is More: A Unified Architecture for Device-Directed Speech Detection with Multiple Invocation Types
Rudovic, Oggi, Chang, Wonil, Garg, Vineet, Dighe, Pranay, Simha, Pramod, Berkowitz, Jack, Abdelaziz, Ahmed H., Kajarekar, Sachin, Marchi, Erik, Adya, Saurabh
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
Lattice-Based Improvements for Voice Triggering Using Graph Neural Networks
Dighe, Pranay, Adya, Saurabh, Li, Nuoyu, Vishnubhotla, Srikanth, Naik, Devang, Sagar, Adithya, Ma, Ying, Pulman, Stephen, Williams, Jason
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Get full text
Conference Proceeding
Audio event detection from acoustic unit occurrence patterns
Kumar, A., Dighe, P., Singh, R., Chaudhuri, S., Raj, B.
Published in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2012)
Published in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2012)
Get full text
Conference Proceeding
Device-Directed Speech Detection for Follow-up Conversations Using Large Language Models
Ognjen, Rudovic, Dighe, Pranay, Su, Yi, Garg, Vineet, Dharur, Sameer, Niu, Xiaochuan, Abdelaziz, Ahmed H, Adya, Saurabh, Tewfik, Ahmed
Year of Publication 28.10.2024
Year of Publication 28.10.2024
Get full text
Journal Article
Modality Dropout for Multimodal Device Directed Speech Detection using Verbal and Non-Verbal Features
Krishna, Gautam, Dharur, Sameer, Rudovic, Oggi, Dighe, Pranay, Adya, Saurabh, Abdelaziz, Ahmed Hussen, Tewfik, Ahmed H
Year of Publication 23.10.2023
Year of Publication 23.10.2023
Get full text
Journal Article
Leveraging Large Language Models for Exploiting ASR Uncertainty
Dighe, Pranay, Su, Yi, Zheng, Shangshang, Liu, Yunshu, Garg, Vineet, Niu, Xiaochuan, Tewfik, Ahmed
Year of Publication 09.09.2023
Year of Publication 09.09.2023
Get full text
Journal Article
Audio-to-Intent Using Acoustic-Textual Subword Representations from End-to-End ASR
Dighe, Pranay, Nayak, Prateeth, Rudovic, Oggi, Marchi, Erik, Niu, Xiaochuan, Tewfik, Ahmed
Year of Publication 21.10.2022
Year of Publication 21.10.2022
Get full text
Journal Article
Far-Field ASR Using Low-Rank and Sparse Soft Targets from Parallel Data
Dighe, Pranay, Asaei, Afsaneh, Bourlard, Herve
Published in 2018 IEEE Spoken Language Technology Workshop (SLT) (01.12.2018)
Published in 2018 IEEE Spoken Language Technology Workshop (SLT) (01.12.2018)
Get full text
Conference Proceeding
Knowledge Transfer for Efficient On-device False Trigger Mitigation
Dighe, Pranay, Marchi, Erik, Vishnubhotla, Srikanth, Kajarekar, Sachin, Naik, Devang
Year of Publication 20.10.2020
Year of Publication 20.10.2020
Get full text
Journal Article