The Role of Speech in Multimodal Human-Computer Interaction Towards Reliable Rejection of Non-keyword Input

Natural audio-visual interface between human user and machine requires understanding of user’s audio-visual commands. This does not necessarily require full speech and image recognition. It does require, just as the interaction with any working animal does, that the machine is capable of reacting to...

Full description

Saved in:

Bibliographic Details
Published in	Text, Speech and Dialogue pp. 2 - 8
Main Authors	Hermansky, Hynek, Fousek, Petr, Lehtonen, Mikko
Format	Book Chapter
Language	English
Published	Berlin, Heidelberg Springer Berlin Heidelberg
Series	Lecture Notes in Computer Science
Subjects	Automatic Speech Recognition Critical Band Impulse Response Rare Word Target Word
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Natural audio-visual interface between human user and machine requires understanding of user’s audio-visual commands. This does not necessarily require full speech and image recognition. It does require, just as the interaction with any working animal does, that the machine is capable of reacting to certain particular sounds and/or gestures while ignoring the rest. Towards this end, we are working on sound identification and classification approaches that would ignore most of the acoustic input and react only to a particular sound (keyword).
ISBN:	9783540287896 3540287892
ISSN:	0302-9743 1611-3349
DOI:	10.1007/11551874_2