SECap: Speech Emotion Captioning with Large Language Model
Xu, Yaoxun, Chen, Hangting, Yu, Jianwei, Huang, Qiaochu, Wu, Zhiyong, Zhang, Shixiong, Li, Guangzhi, Luo, Yi, Gu, Rongzhi
Published in arXiv.org (23.12.2023)
Get full text
Published in arXiv.org (23.12.2023)
Paper
Automatic Prosody Annotation with Pre-Trained Text-Speech Model
Dai, Ziqian, Yu, Jianwei, Wang, Yan, Chen, Nuo, Bian, Yanyao, Li, Guangzhi, Deng Cai, Yu, Dong
Published in arXiv.org (16.06.2022)
Get full text
Published in arXiv.org (16.06.2022)
Paper
Maximizing Mutual Information for Tacotron
Liu, Peng, Wu, Xixin, Kang, Shiyin, Li, Guangzhi, Su, Dan, Yu, Dong
Published in arXiv.org (18.11.2019)
Get full text
Published in arXiv.org (18.11.2019)
Paper