Multi-modal modeling for device-directed speech detection using acoustic and linguistic cues
Sato, Hiroshi, Shinohara, Yusuke, Ogawa, Atsunori
Published in Acoustical Science and Technology (01.01.2023)
Published in Acoustical Science and Technology (01.01.2023)
Get full text
Journal Article
Semi-supervised End-to-end Speech Recognition Using Text-to-speech and Autoencoders
Karita, Shigeki, Watanabe, Shinji, Iwata, Tomoharu, Delcroix, Marc, Ogawa, Atsunori, Nakatani, Tomohiro
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Get full text
Conference Proceeding
Multi-Source Domain Generalization Using Domain Attributes for Recurrent Neural Network Language Models
TAWARA, Naohiro, OGAWA, Atsunori, IWATA, Tomoharu, ASHIKAWA, Hiroto, KOBAYASHI, Tetsunori, OGAWA, Tetsuji
Published in IEICE Transactions on Information and Systems (01.01.2022)
Published in IEICE Transactions on Information and Systems (01.01.2022)
Get full text
Journal Article
Low-Latency Real-Time Meeting Recognition and Understanding Using Distant Microphones and Omni-Directional Camera
Hori, T., Araki, S., Yoshioka, T., Fujimoto, M., Watanabe, S., Oba, T., Ogawa, A., Otsuka, K., Mikami, D., Kinoshita, K., Nakatani, T., Nakamura, A., Yamato, J.
Published in IEEE transactions on audio, speech, and language processing (01.02.2012)
Published in IEEE transactions on audio, speech, and language processing (01.02.2012)
Get full text
Journal Article
Spatial correlation model based observation vector clustering and MVDR beamforming for meeting recognition
Araki, Shoko, Okada, Masahiro, Higuchi, Takuya, Ogawa, Atsunori, Nakatani, Tomohiro
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2016)
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2016)
Get full text
Conference Proceeding
Journal Article
Context adaptive deep neural networks for fast acoustic model adaptation in noisy conditions
Delcroix, Marc, Kinoshita, Keisuke, Chengzhu Yu, Ogawa, Atsunori, Yoshioka, Takuya, Nakatani, Tomohiro
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2016)
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2016)
Get full text
Conference Proceeding
Journal Article
Online environmental adaptation of CNN-based acoustic models using spatial diffuseness features
Huemmer, Christian, Delcroix, Marc, Ogawa, Atsunori, Kinoshita, Keisuke, Nakatani, Tomohiro, Kellermann, Walter
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2017)
Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2017)
Get full text
Conference Proceeding
The NTT CHiME-3 system: Advances in speech enhancement and recognition for mobile multi-microphone devices
Yoshioka, Takuya, Ito, Nobutaka, Delcroix, Marc, Ogawa, Atsunori, Kinoshita, Keisuke, Fujimoto, Masakiyo, Yu, Chengzhu, Fabian, Wojciech J., Espi, Miquel, Higuchi, Takuya, Araki, Shoko, Nakatani, Tomohiro
Published in 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) (01.12.2015)
Published in 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) (01.12.2015)
Get full text
Conference Proceeding