Frequency Domain Multi-channel Acoustic Modeling for Distant Speech Recognition
Minhua, Wu, Kumatani, Kenichi, Sundaram, Shiva, Strom, Nikko, Hoffmeister, Bjorn
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Get full text
Conference Proceeding
Time-Delayed Bottleneck Highway Networks Using a DFT Feature for Keyword Spotting
Guo, Jinxi, Kumatani, Kenichi, Sun, Ming, Wu, Minhua, Raju, Anirudh, Strom, Nikko, Mandal, Arindam
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Get full text
Conference Proceeding
Multi-geometry Spatial Acoustic Modeling for Distant Speech Recognition
Kumatani, Kenichi, Minhua, Wu, Sundaram, Shiva, Strom, Nikko, Hoffmeister, Bjorn
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Get full text
Conference Proceeding
Max-pooling loss training of long short-term memory networks for small-footprint keyword spotting
Ming Sun, Raju, Anirudh, Tucker, George, Panchapagesan, Sankaran, Gengshen Fu, Mandal, Arindam, Matsoukas, Spyros, Strom, Nikko, Vitaladevuni, Shiv
Published in 2016 IEEE Spoken Language Technology Workshop (SLT) (01.12.2016)
Published in 2016 IEEE Spoken Language Technology Workshop (SLT) (01.12.2016)
Get full text
Conference Proceeding
Direct modeling of raw audio with DNNS for wake word detection
Kumatani, Kenichi, Panchapagesan, Sankaran, Minhua Wu, Minjae Kim, Strom, Nikko, Tiwari, Gautam, Mandai, Arindam
Published in 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (01.12.2017)
Published in 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (01.12.2017)
Get full text
Conference Proceeding
Sub 8-Bit Quantization of Streaming Keyword Spotting Models for Embedded Chipsets
Zeng, Lu, Parthasarathi, Sree Hari Krishnan, Liu, Yuzong, Escott, Alex, Cheekatmalla, Santosh Kumar, Strom, Nikko, Vitaladevuni, Shiv
Year of Publication 13.07.2022
Year of Publication 13.07.2022
Get full text
Journal Article
Multi-Geometry Spatial Acoustic Modeling for Distant Speech Recognition
Kumatani, Kenichi, Wu, Minhua, Sundaram, Shiva, Strom, Nikko, Hoffmeister, Bjorn
Published in arXiv.org (28.04.2019)
Published in arXiv.org (28.04.2019)
Get full text
Paper
Journal Article
Frequency Domain Multi-channel Acoustic Modeling for Distant Speech Recognition
Wu, Minhua, Kumatani, Kenichi, Sundaram, Shiva, Strom, Nikko, Hoffmeister, Bjorn
Published in arXiv.org (28.04.2019)
Published in arXiv.org (28.04.2019)
Get full text
Paper
Journal Article
Deep multi-channel acoustic modeling using multiple microphone array geometries
Sundaram, Shiva, Hoffmeister, Bjorn, Wu, Minhua, Strom, Nikko, Kumatani, Kenichi
Year of Publication 07.02.2023
Get full text
Year of Publication 07.02.2023
Patent
Data Augmentation for Robust Keyword Spotting under Playback Interference
Raju, Anirudh, Panchapagesan, Sankaran, Liu, Xing, Mandal, Arindam, Strom, Nikko
Year of Publication 01.08.2018
Year of Publication 01.08.2018
Get full text
Journal Article
Comprehensive Evaluation of Statistical Speech Waveform Synthesis
Merritt, Thomas, Putrycz, Bartosz, Nadolski, Adam, Ye, Tianjun, Korzekwa, Daniel, Dolecki, Wiktor, Drugman, Thomas, Klimkov, Viacheslav, Moinet, Alexis, Breen, Andrew, Kuklinski, Rafal, Strom, Nikko, Barra-Chicote, Roberto
Published in 2018 IEEE Spoken Language Technology Workshop (SLT) (01.12.2018)
Published in 2018 IEEE Spoken Language Technology Workshop (SLT) (01.12.2018)
Get full text
Conference Proceeding