Correcting phoneme recognition errors in learning word pronunciation through speech interaction

► We propose a method for learning the phoneme sequences of out-of-vocabulary words. ► Users correct mis-recognized phoneme sequences through speech interaction. ► Word segments can be used for locating the mis-recognized phonemes. ► Historical information during the interaction is used to make the...

Full description

Saved in:
Bibliographic Details
Published inSpeech communication Vol. 55; no. 1; pp. 190 - 203
Main Authors Zuo, Xiang, Sumii, Taisuke, Iwahashi, Naoto, Nakano, Mikio, Funakoshi, Kotaro, Oka, Natsuki
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.01.2013
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:► We propose a method for learning the phoneme sequences of out-of-vocabulary words. ► Users correct mis-recognized phoneme sequences through speech interaction. ► Word segments can be used for locating the mis-recognized phonemes. ► Historical information during the interaction is used to make the learning efficient. ► The method outperforms a previously proposed baseline method. This paper presents a method called Interactive Phoneme Update (IPU) that enables users to teach systems the pronunciation (phoneme sequences) of words in the course of speech interaction. Using the method, users can correct mis-recognized phoneme sequences by repeatedly making correction utterances according to the system responses. The originalities of this method are: (1) word-segment-based correction that allows users to use word segments for locating mis-recognized phonemes based on open-begin-end dynamic programming matching and generalized posterior probability, (2) history-based correction that utilizes the information of phoneme sequences that were recognized and corrected previously in the course of interactive learning of each word. Experimental results show that the proposed IPU method reduces the error rate by a factor of three over a previously proposed maximum-likelihood-based method.
Bibliography:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ObjectType-Article-1
ObjectType-Feature-2
ISSN:0167-6393
1872-7182
DOI:10.1016/j.specom.2012.08.008