Formant tracking based on phoneme information

A method and system for selecting formant trajectories based on input speech and corresponding text data. The input speech is analyzed to obtain formant candidates for the respective time frame. The text data corresponding to the input speech is converted into a sequence of phonemes which are then t...

Full description

Saved in:
Bibliographic Details
Main Authors LEE MINKYU, OLIVE JOSEPH PHILIP, MOEBIUS BERND, VAN SANTEN JAN PIETER
Format Patent
LanguageEnglish
Published 09.09.2003
Edition7
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A method and system for selecting formant trajectories based on input speech and corresponding text data. The input speech is analyzed to obtain formant candidates for the respective time frame. The text data corresponding to the input speech is converted into a sequence of phonemes which are then time aligned such that each phoneme is temporally labeled with a corresponding segment of the input speech. Nominal formant frequencies are assigned to a center timing point of each phoneme and target formant trajectories are generated for each time frame by interpolating the nominal formant frequencies between adjacent phonemes. For each time frame, at least one formant candidate that is closest to the corresponding target formant trajectories is selected according to a minimum cost factor. The selected formant candidates are output for storage or further processing in subsequent speech applications.
Bibliography:Application Number: US19990386037