HMM-Based Vietnamese Speech Synthesis

In this paper, improving naturalness HMM-based speech synthesis for Vietnamese language is described. By this synthesis method, trajectories of speech parameters are generated from the trained Hidden Markov models. A final speech waveform is synthesized from those speech parameters. The main objecti...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of software innovation Vol. 3; no. 4; pp. 33 - 47
Main Authors Trinh, Son, Hoang, Kiem
Format Journal Article
LanguageEnglish
Published Mount Pleasant IGI Global 01.10.2015
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In this paper, improving naturalness HMM-based speech synthesis for Vietnamese language is described. By this synthesis method, trajectories of speech parameters are generated from the trained Hidden Markov models. A final speech waveform is synthesized from those speech parameters. The main objective for the development is to achieve maximum naturalness in output speech through key points. Firstly, system uses a high quality recorded Vietnamese speech database appropriate for training, especially in statistical parametric model approach. Secondly, prosodic informations such as tone, POS (part of speech) and features based on characteristics of Vietnamese language are added to ensure the quality of synthetic speech. Third, system uses STRAIGHT which showed its ability to produce high-quality voice manipulation and was successfully incorporated into HMM-based speech synthesis. The results collected show that the speech produced by our system has the best result when being compared with the other Vietnamese TTS systems trained from the same speech data.
ISSN:2166-7160
2166-7179
DOI:10.4018/IJSI.2015100103