HMM-Based Vietnamese Speech Synthesis

In this paper, improving naturalness HMM-based speech synthesis for Vietnamese language is described. By this synthesis method, trajectories of speech parameters are generated from the trained Hidden Markov models. A final speech waveform is synthesized from those speech parameters. The main objecti...

Full description

Saved in:

Bibliographic Details
Published in	International journal of software innovation Vol. 3; no. 4; pp. 33 - 47
Main Authors	Trinh, Son, Hoang, Kiem
Format	Journal Article
Language	English
Published	Mount Pleasant IGI Global 01.10.2015
Subjects	Linguistics Markov chains Mathematical models Parameters Speech Speech recognition Waveforms
Online Access	Get full text

Cover

Loading…

More Information
Summary:	In this paper, improving naturalness HMM-based speech synthesis for Vietnamese language is described. By this synthesis method, trajectories of speech parameters are generated from the trained Hidden Markov models. A final speech waveform is synthesized from those speech parameters. The main objective for the development is to achieve maximum naturalness in output speech through key points. Firstly, system uses a high quality recorded Vietnamese speech database appropriate for training, especially in statistical parametric model approach. Secondly, prosodic informations such as tone, POS (part of speech) and features based on characteristics of Vietnamese language are added to ensure the quality of synthetic speech. Third, system uses STRAIGHT which showed its ability to produce high-quality voice manipulation and was successfully incorporated into HMM-based speech synthesis. The results collected show that the speech produced by our system has the best result when being compared with the other Vietnamese TTS systems trained from the same speech data.
ISSN:	2166-7160 2166-7179
DOI:	10.4018/IJSI.2015100103