A stochastic model of intonation for text-to-speech synthesis

This paper presents a stochastic model of intonation contours for use in text-to-speech synthesis. The model has two modules, a linguistic module that generates abstract prosodic labels from text, and a phonetic module that generates an F 0 curve from the abstract prosodic labels. This model differs...

Full description

Saved in:
Bibliographic Details
Published inSpeech communication Vol. 26; no. 4; pp. 233 - 244
Main Authors Véronis, Jean, Di Cristo, Philippe, Courtois, Fabienne, Chaumette, Cédric
Format Journal Article
LanguageEnglish
Published Amsterdam Elsevier B.V 01.12.1998
Elsevier
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:This paper presents a stochastic model of intonation contours for use in text-to-speech synthesis. The model has two modules, a linguistic module that generates abstract prosodic labels from text, and a phonetic module that generates an F 0 curve from the abstract prosodic labels. This model differs from previous work in the abstract prosodic labels used, which can be automatically derived from the training corpus. This feature makes it possible to use large corpora or several corpora of different speech styles, in addition to making it easy to adapt to new languages. The present paper focuses on the linguistic module, which does not require full syntactic analysis of the text but simply relies on part-of-speech tagging. The results were validated on French by means of a perception test. Listeners did not perceive a significant difference in quality between the sentences synthesised using the phonetic module only, with prosodic labels derived from original recordings as input, and those synthesised directly from the text using the linguistic module followed by the phonetic module. The proposed model thus appears to capture most of the grammatical information needed to generate F 0.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0167-6393
1872-7182
DOI:10.1016/S0167-6393(98)00063-6