On the Study of Generative Adversarial Networks for Cross-Lingual Voice Conversion
Sisman, Berrak, Zhang, Mingyang, Dong, Minghui, Li, Haizhou
Published in 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (01.12.2019)
Published in 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) (01.12.2019)
Get full text
Conference Proceeding
Seen and Unseen Emotional Style Transfer for Voice Conversion with A New Emotional Speech Dataset
Zhou, Kun, Sisman, Berrak, Liu, Rui, Li, Haizhou
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Get full text
Conference Proceeding
Transformation of prosody in voice conversion
Sisman, Berrak, Li, Haizhou, Tan, Kay Chen
Published in 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) (01.12.2017)
Published in 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) (01.12.2017)
Get full text
Conference Proceeding
DeepConversion: Voice conversion with limited parallel training data
Zhang, Mingyang, Sisman, Berrak, Zhao, Li, Li, Haizhou
Published in Speech communication (01.09.2020)
Published in Speech communication (01.09.2020)
Get full text
Journal Article
Emotion Intensity and its Control for Emotional Voice Conversion
Zhou, Kun, Sisman, Berrak, Rana, Rajib, Schuller, Bjorn W., Li, Haizhou
Published in IEEE transactions on affective computing (01.01.2023)
Published in IEEE transactions on affective computing (01.01.2023)
Get full text
Journal Article
Error Reduction Network for DBLSTM-based Voice Conversion
Zhang, Mingyang, Sisman, Berrak, Rallabandi, Sai Sirisha, Li, Haizhou, Zhao, Li
Published in 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) (01.11.2018)
Published in 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) (01.11.2018)
Get full text
Conference Proceeding
Modeling Prosodic Phrasing with Multi-Task Learning in Tacotron-based TTS
Liu, Rui, Sisman, Berrak, Bao, Feilong, Gao, Guang Lai, Li, Haizhou
Published in IEEE signal processing letters (01.01.2020)
Published in IEEE signal processing letters (01.01.2020)
Get full text
Journal Article
Versatile Audio-Visual Learning for Emotion Recognition
Goncalves, Lucas, Leem, Seong-Gyun, Lin, Wei-Cheng, Sisman, Berrak, Busso, Carlos
Published in IEEE transactions on affective computing (24.07.2024)
Published in IEEE transactions on affective computing (24.07.2024)
Get full text
Journal Article
Revealing Emotional Clusters in Speaker Embeddings: A Contrastive Learning Strategy for Speech Emotion Recognition
Ulgen, Ismail Rasim, Du, Zongyang, Busso, Carlos, Sisman, Berrak
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Get full text
Conference Proceeding
Visualtts: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over
Lu, Junchen, Sisman, Berrak, Liu, Rui, Zhang, Mingyang, Li, Haizhou
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Get full text
Conference Proceeding
Speech Synthesis with Mixed Emotions
Zhou, Kun, Sisman, Berrak, Rana, Rajib, Schuller, Bjorn W., Li, Haizhou
Published in IEEE transactions on affective computing (01.10.2023)
Published in IEEE transactions on affective computing (01.10.2023)
Get full text
Journal Article