Prosody Generation Using Syllable-Centered Polynomial Representation of Pitch Contours

The present invention discloses a parametrical representation of prosody based on polynomial expansion coefficients of the pitch contour near the center of each syllable. The said syllable pitch expansion coefficients are generated from a recorded speech database, read from a number of sentences by...

Full description

Saved in:
Bibliographic Details
Main Author CHEN CHENGJUN JULIAN
Format Patent
LanguageEnglish
Published 10.07.2014
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The present invention discloses a parametrical representation of prosody based on polynomial expansion coefficients of the pitch contour near the center of each syllable. The said syllable pitch expansion coefficients are generated from a recorded speech database, read from a number of sentences by a reference speaker. By correlating the stress level and context information of each syllable in the text with the polynomial expansion coefficients of the corresponding spoken syllable, a correlation database is formed. To generate prosody for an input text, stress level and context information of each syllable in the text is identified. The prosody is generated by using the said correlation database to find the best set of pitch parameters for each syllable. By adding to global pitch contours and using interpolation formulas, complete pitch contour for the input text is generated. Duration and intensity profile are generated using a similar procedure.
Bibliography:Application Number: US201414216611