Semi-supervised End-to-end Speech Recognition Using Text-to-speech and Autoencoders
Karita, Shigeki, Watanabe, Shinji, Iwata, Tomoharu, Delcroix, Marc, Ogawa, Atsunori, Nakatani, Tomohiro
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Get full text
Conference Proceeding
DF-Conformer: Integrated Architecture of Conv-Tasnet and Conformer Using Linear Complexity Self-Attention for Speech Enhancement
Koizumi, Yuma, Karita, Shigeki, Wisdom, Scott, Erdogan, Hakan, Hershey, John R., Jones, Llion, Bacchiani, Michiel
Published in 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (01.01.2021)
Published in 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (01.01.2021)
Get full text
Conference Proceeding
A Comparative Study on Transformer vs RNN in Speech Applications
Karita, Shigeki, Chen, Nanxin, Hayashi, Tomoki, Hori, Takaaki, Inaguma, Hirofumi, Jiang, Ziyan, Someki, Masao, Soplin, Nelson Enrique Yalta, Yamamoto, Ryuichi, Wang, Xiaofei, Watanabe, Shinji, Yoshimura, Takenori, Zhang, Wangyou
Published in 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (01.12.2019)
Published in 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (01.12.2019)
Get full text
Conference Proceeding
Frame-by-Frame Closed-Form Update for Mask-Based Adaptive MVDR Beamforming
Higuchi, Takuya, Kinoshita, Keisuke, Ito, Nobutaka, Karita, Shigeki, Nakatani, Tomohiro
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Get full text
Conference Proceeding
Knowledge Transfer from Large-Scale Pretrained Language Models to End-To-End Speech Recognizers
Kubo, Yotaro, Karita, Shigeki, Bacchiani, Michiel
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Get full text
Conference Proceeding
Rescoring N-Best Speech Recognition List Based on One-on-One Hypothesis Comparison Using Encoder-Classifier Model
Ogawa, Atsunori, Delcroix, Marc, Karita, Shigeki, Nakatani, Tomohiro
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Get full text
Conference Proceeding
SEQUENCE TRAINING OF ENCODER-DECODER MODEL USING POLICY GRADIENT FOR END- TO-END SPEECH RECOGNITION
Karita, Shigeki, Ogawa, Atsunori, Delcroix, Marc, Nakatani, Tomohiro
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Get full text
Conference Proceeding
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations
Koizumi, Yuma, Zen, Heiga, Karita, Shigeki, Ding, Yifan, Yatabe, Kohei, Morioka, Nobuyuki, Zhang, Yu, Han, Wei, Bapna, Ankur, Bacchiani, Michiel
Published in 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (22.10.2023)
Published in 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) (22.10.2023)
Get full text
Conference Proceeding
Owner authentication for mobile devices using motion gestures based on multi-owner template update
Karita, Shigeki, Nakamura, Kumi, Kono, Kazuhiro, Ito, Yoshimichi, Babaguchi, Noboru
Published in 2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) (01.06.2015)
Published in 2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) (01.06.2015)
Get full text
Conference Proceeding
LEARNING DEVICE, LEARNING METHOD, AND LEARNING PROGRAM
DELCROIX MARC, WATABE SHINJI, OGAWA ATSUNORI, KARITA SHIGEKI, IWATA TOMOHARU
Year of Publication 11.03.2021
Get full text
Year of Publication 11.03.2021
Patent
LEARNING DEVICE, LEARNING METHOD, AND LEARNING PROGRAM
DELCROIX MARC, WATABE SHINJI, OGAWA ATSUNORI, KARITA SHIGEKI, IWATA TOMOHARU
Year of Publication 11.03.2021
Get full text
Year of Publication 11.03.2021
Patent
LEARNING DEVICE, LEARNING METHOD, AND LEARNING PROGRAM
DELCROIX MARC, WATABE SHINJI, OGAWA ATSUNORI, KARITA SHIGEKI, IWATA TOMOHARU
Year of Publication 11.03.2021
Get full text
Year of Publication 11.03.2021
Patent
SPEECH RECOGNITION DEVICE, SPEECH RECOGNITION METHOD AND SPEECH RECOGNITION PROGRAM
DELCROIX MARC, WATABE SHINJI, OGAWA ATSUNORI, NAKATANI TOMOHIRO, KARITA SHIGEKI
Year of Publication 11.03.2021
Get full text
Year of Publication 11.03.2021
Patent