Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder

Nowadays, neural vocoders can generate very high-fidelity speech when a bunch of training data is available. Although a speaker-dependent (SD) vocoder usually outperforms a speaker-independent (SI) vocoder, it is impractical to collect a large amount of data of a specific target speaker for most rea...

Full description

Saved in:
Bibliographic Details
Main Authors Wu, Yi-Chiao, Hu, Cheng-Hung, Lee, Hung-Shin, Peng, Yu-Huai, Huang, Wen-Chin, Tsao, Yu, Wang, Hsin-Min, Toda, Tomoki
Format Journal Article
LanguageEnglish
Published 10.06.2021
Online AccessGet full text

Cover

Loading…