Multi-band automatic speech recognition

This paper presents a new architecture for automatic speech recognition systems which is characterized by the division of the spectral domain of the speech signal into several independent frequency bands. This model is based on the psycho-acoustic work of Fletcher (1953) who proposed a similar princ...

Full description

Saved in:

Bibliographic Details
Published in	Computer speech & language Vol. 15; no. 2; pp. 151 - 174
Main Authors	Cerisara, Christophe, Fohr, Dominique
Format	Journal Article
Language	English
Published	Oxford Elsevier Ltd 01.04.2001 Elsevier
Subjects	Applied linguistics Computational linguistics Computer Science Linguistics Other Psychoacoustics Applied linguistics Speech recognition Computational linguistics Algorithm Speech processing System reconnaissance de la parole speech recognition multi-band multi-bandes
Online Access	Get full text

Cover

Loading…

More Information
Summary:	This paper presents a new architecture for automatic speech recognition systems which is characterized by the division of the spectral domain of the speech signal into several independent frequency bands. This model is based on the psycho-acoustic work of Fletcher (1953) who proposed a similar principle for the human auditory system. Jont B. Allen published a paper in 1994 in which he summarized the work of Fletcher and also proposed to adapt the multi-band paradigm to automatic speech recognition (ASR) (Allen, 1994). Many researchers have then studied this principle and built such ASR systems. The goal of this paper is to analyse some of the most important issues in the design of a multi-band ASR system in order to determine which architecture it should have in which environment. Two other major problems are then considered: how to train multi-band systems and how to use them for continuous ASR.
Bibliography:	ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 ObjectType-Article-1 ObjectType-Feature-2
ISSN:	0885-2308 1095-8363
DOI:	10.1006/csla.2001.0163