SYSTEM AND METHOD FOR DATA-DRIVEN INTONATION GENERATION

Systems, methods, and computer-readable storage media for text-to-speech processing having an improved intonation. The system first receives text to be converted to speech, the text having a first segment and a second segment. The system then compares the text to a database of stored utterances, ide...

Full description

Saved in:
Bibliographic Details
Main Authors CONKIE ALISTAIR D, KIM YEON-JUN, BEUTNAGEL MARK CHARLES, MISHRA TANIYA
Format Patent
LanguageEnglish
Published 28.05.2015
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Systems, methods, and computer-readable storage media for text-to-speech processing having an improved intonation. The system first receives text to be converted to speech, the text having a first segment and a second segment. The system then compares the text to a database of stored utterances, identifying in the database a first utterance corresponding to the first segment and determining an intonation of the first utterance. When the database does not contain a second utterance corresponding to the second segment, the system generates the speech corresponding to the text by combining the first utterance with a generated second utterance corresponding to the second segment, the generated second utterance having the intonation matching, or based on, the first utterance. These actions lead to an improved, smoother, more human-like synthetic speech output from the system.
Bibliography:Application Number: US201314087840