Voice Conversion Using Conditional CycleGAN

Voice conversion (VC) modifies characteristics of speech, such as gender and speaker identities. The VC can be applied to various tasks including speaking assistance and speaker anonymization. Generally, such VC techniques require parallel speech data for training, which is very expensive. Recently,...

Full description

Saved in:
Bibliographic Details
Published in2018 International Conference on Computational Science and Computational Intelligence (CSCI) pp. 1460 - 1461
Main Authors Yook, Dongsuk, Yoo, In-Chul, Yoo, Seungho
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.12.2018
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Voice conversion (VC) modifies characteristics of speech, such as gender and speaker identities. The VC can be applied to various tasks including speaking assistance and speaker anonymization. Generally, such VC techniques require parallel speech data for training, which is very expensive. Recently, voice conversion has been accomplished using CycleGAN, which does not require parallel speech data. In this paper, we further extend the idea of using CycleGAN to convert multiple speakers' voices by conditioning the CycleGAN using speaker identity information.
DOI:10.1109/CSCI46756.2018.00290