DuAGNet: an unrestricted multimodal speech recognition framework using dual adaptive gating fusion

Speech recognition is a major communication channel for human-machine interaction with outstanding breakthroughs. However, the practicality of single-modal speech recognition is not satisfactory in high-noise or silent communication applications. Integrating multiple modalities can effectively addre...

Full description

Saved in:
Bibliographic Details
Published inApplied intelligence (Dordrecht, Netherlands) Vol. 55; no. 3; p. 224
Main Authors Wu, Jinghan, Zhang, Yakun, Zhang, Meishan, Zheng, Changyan, Zhang, Xingyu, Xie, Liang, An, Xingwei, Yin, Erwei
Format Journal Article
LanguageEnglish
Published Boston Springer Nature B.V 01.02.2025
Subjects
Online AccessGet full text

Cover

Loading…