Voice Conversion Using Conditional CycleGAN

Voice conversion (VC) modifies characteristics of speech, such as gender and speaker identities. The VC can be applied to various tasks including speaking assistance and speaker anonymization. Generally, such VC techniques require parallel speech data for training, which is very expensive. Recently,...

Full description

Saved in:

Bibliographic Details
Published in	2018 International Conference on Computational Science and Computational Intelligence (CSCI) pp. 1460 - 1461
Main Authors	Yook, Dongsuk, Yoo, In-Chul, Yoo, Seungho
Format	Conference Proceeding
Language	English
Published	IEEE 01.12.2018
Subjects	Conditional CycleGAN (CC-GAN) CycleGAN Gallium nitride Generative adversarial networks generative adversarial networks (GAN) Generators Linguistics Logic gates Task analysis Training voice conversion
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Voice conversion (VC) modifies characteristics of speech, such as gender and speaker identities. The VC can be applied to various tasks including speaking assistance and speaker anonymization. Generally, such VC techniques require parallel speech data for training, which is very expensive. Recently, voice conversion has been accomplished using CycleGAN, which does not require parallel speech data. In this paper, we further extend the idea of using CycleGAN to convert multiple speakers' voices by conditioning the CycleGAN using speaker identity information.
DOI:	10.1109/CSCI46756.2018.00290