An End-to-End Neural Network for Polyphonic Piano Music Transcription
Sigtia, Siddharth, Benetos, Emmanouil, Dixon, Simon
Published in IEEE/ACM transactions on audio, speech, and language processing (01.05.2016)
Published in IEEE/ACM transactions on audio, speech, and language processing (01.05.2016)
Get full text
Journal Article
Improved music feature learning with deep neural networks
Sigtia, Siddharth, Dixon, Simon
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2014)
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2014)
Get full text
Conference Proceeding
Multi-Task Learning for Speaker Verification and Voice Trigger Detection
Sigtia, Siddharth, Marchi, Erik, Kajarekar, Sachin, Naik, Devang, Bridle, John
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Get full text
Conference Proceeding
Multi-Task Learning for Voice Trigger Detection
Sigtia, Siddharth, Clark, Pascal, Haynes, Rob, Richards, Hywel, Bridle, John
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Get full text
Conference Proceeding
Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging
Yong Xu, Qiang Huang, Wenwu Wang, Foster, Peter, Sigtia, Siddharth, Jackson, Philip J. B., Plumbley, Mark D.
Published in IEEE/ACM transactions on audio, speech, and language processing (01.06.2017)
Published in IEEE/ACM transactions on audio, speech, and language processing (01.06.2017)
Get full text
Journal Article
A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Wagner, Dominik, Churchill, Alexander, Sigtia, Siddharth, Georgiou, Panayiotis, Mirsamadi, Matt, Mishra, Aarshee, Marchi, Erik
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Get full text
Conference Proceeding
Progressive Voice Trigger Detection: Accuracy vs Latency
Sigtia, Siddharth, Bridle, John, Richards, Hywel, Clark, Pascal, Marchi, Erik, Garg, Vineet
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Get full text
Conference Proceeding
Generalised Discriminative Transform via Curriculum Learning for Speaker Recognition
Marchi, Erik, Shum, Stephen, Kyuyeon Hwang, Kajarekar, Sachin, Sigtia, Siddharth, Richards, Hywel, Haynes, Rob, Yoon Kim, Bridle, John
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Get full text
Conference Proceeding
A hybrid recurrent neural network for music transcription
Sigtia, Siddharth, Benetos, Emmanouil, Boulanger-Lewandowski, Nicolas, Weyde, Tillman, d'Avila Garcez, Artur S., Dixon, Simon
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2015)
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2015)
Get full text
Conference Proceeding
A Multimodal Approach to Device-Directed Speech Detection with Large Language Models
Wagner, Dominik, Churchill, Alexander, Sigtia, Siddharth, Georgiou, Panayiotis, Mirsamadi, Matt, Mishra, Aarshee, Marchi, Erik
Published in arXiv.org (26.03.2024)
Published in arXiv.org (26.03.2024)
Get full text
Paper
Journal Article
Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models
Wagner, Dominik, Churchill, Alexander, Sigtia, Siddharth, Georgiou, Panayiotis, Mirsamadi, Matt, Mishra, Aarshee, Marchi, Erik
Year of Publication 06.12.2023
Year of Publication 06.12.2023
Get full text
Journal Article
Chime-home: A dataset for sound source recognition in a domestic environment
Foster, Peter, Sigtia, Siddharth, Krstulovic, Sacha, Barker, Jon, Plumbley, Mark D.
Published in 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (01.10.2015)
Published in 2015 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (01.10.2015)
Get full text
Conference Proceeding
Hybrid Transformer/CTC Networks for Hardware Efficient Voice Triggering
Adya, Saurabh, Garg, Vineet, Sigtia, Siddharth, Simha, Pramod, Dhir, Chandra
Year of Publication 05.08.2020
Year of Publication 05.08.2020
Get full text
Journal Article
Multi-task Learning for Voice Trigger Detection
Sigtia, Siddharth, Clark, Pascal, Haynes, Rob, Richards, Hywel, Bridle, John
Published in arXiv.org (20.04.2020)
Published in arXiv.org (20.04.2020)
Get full text
Paper
Journal Article
Multi-task Learning for Speaker Verification and Voice Trigger Detection
Sigtia, Siddharth, Marchi, Erik, Kajarekar, Sachin, Naik, Devang, Bridle, John
Published in arXiv.org (26.01.2020)
Published in arXiv.org (26.01.2020)
Get full text
Paper
Journal Article
Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
Garg, Vineet, Chang, Wonil, Sigtia, Siddharth, Adya, Saurabh, Simha, Pramod, Dighe, Pranay, Dhir, Chandra
Year of Publication 13.05.2021
Year of Publication 13.05.2021
Get full text
Journal Article
Progressive Voice Trigger Detection: Accuracy vs Latency
Sigtia, Siddharth, Bridle, John, Richards, Hywel, Clark, Pascal, Marchi, Erik, Garg, Vineet
Year of Publication 29.10.2020
Year of Publication 29.10.2020
Get full text
Journal Article
Improving Voice Trigger Detection with Metric Learning
Nayak, Prateeth, Higuchi, Takuya, Gupta, Anmol, Ranjan, Shivesh, Shum, Stephen, Sigtia, Siddharth, Marchi, Erik, Lakshminarasimhan, Varun, Cho, Minsik, Adya, Saurabh, Dhir, Chandra, Tewfik, Ahmed
Year of Publication 05.04.2022
Year of Publication 05.04.2022
Get full text
Journal Article