Audio-Visual Recognition of Overlapped Speech for the LRS2 Dataset
Yu, Jianwei, Zhang, Shi-Xiong, Wu, Jian, Ghorbani, Shahram, Wu, Bo, Kang, Shiyin, Liu, Shansong, Liu, Xunying, Meng, Helen, Yu, Dong
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Get full text
Conference Proceeding
Multi-distribution deep belief network for speech synthesis
Shiyin Kang, Xiaojun Qian, Meng, Helen
Published in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (01.05.2013)
Published in 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (01.05.2013)
Get full text
Conference Proceeding
A deep recurrent approach for acoustic-to-articulatory inversion
Peng Liu, Quanjie Yu, Zhiyong Wu, Shiyin Kang, Meng, Helen, Lianhong Cai
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2015)
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2015)
Get full text
Conference Proceeding
A Compact Framework for Voice Conversion Using Wavenet Conditioned on Phonetic Posteriorgrams
Lu, Hui, Wu, Zhiyong, Li, Runnan, Kang, Shiyin, Jia, Jia, Meng, Helen
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Published in ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2019)
Get full text
Conference Proceeding
End-To-End Accent Conversion Without Using Native Utterances
Liu, Songxiang, Wang, Disong, Cao, Yuewen, Sun, Lifa, Wu, Xixin, Kang, Shiyin, Wu, Zhiyong, Liu, Xunying, Su, Dan, Yu, Dong, Meng, Helen
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Get full text
Conference Proceeding
Phonetic posteriorgrams for many-to-one voice conversion without parallel data training
Lifa Sun, Kun Li, Hao Wang, Shiyin Kang, Meng, Helen
Published in 2016 IEEE International Conference on Multimedia and Expo (ICME) (01.07.2016)
Published in 2016 IEEE International Conference on Multimedia and Expo (ICME) (01.07.2016)
Get full text
Conference Proceeding
Journal Article
Comparison of Syllable/Phone HMM Based Mandarin TTS
Quansheng Duan, Shiyin Kang, Zhiyong Wu, Lianhong Cai, Zhiwei Shuang, Yong Qin
Published in 2010 20th International Conference on Pattern Recognition (01.08.2010)
Published in 2010 20th International Conference on Pattern Recognition (01.08.2010)
Get full text
Conference Proceeding
On the localness modeling for the self-attention based end-to-end speech synthesis
Yang, Shan, Lu, Heng, Kang, Shiyin, Xue, Liumeng, Xiao, Jinba, Su, Dan, Xie, Lei, Yu, Dong
Published in Neural networks (01.05.2020)
Published in Neural networks (01.05.2020)
Get full text
Journal Article
Neural Network Language Modeling with Letter-Based Features and Importance Sampling
Xu, Hainan, Li, Ke, Wang, Yiming, Wang, Jian, Kang, Shiyin, Chen, Xie, Povey, Daniel, Khudanpur, Sanjeev
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Published in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2018)
Get full text
Conference Proceeding
FullSubNet+: Channel Attention Fullsubnet with Complex Spectrograms for Speech Enhancement
Chen, Jun, Wang, Zilin, Tuo, Deyi, Wu, Zhiyong, Kang, Shiyin, Meng, Helen
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Get full text
Conference Proceeding
PEG/RGD-modified magnetic polymeric liposomes for controlled drug release and tumor cell targeting
Su, Wenya, Wang, Hanjie, Wang, Sheng, Liao, Zhenyu, Kang, Shiyin, Peng, Yao, Han, Lei, Chang, Jin
Published in International journal of pharmaceutics (15.04.2012)
Published in International journal of pharmaceutics (15.04.2012)
Get full text
Journal Article
SCNet: Sparse Compression Network for Music Source Separation
Tong, Weinan, Zhu, Jiaxu, Chen, Jun, Kang, Shiyin, Jiang, Tao, Li, Yang, Wu, Zhiyong, Meng, Helen
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Published in ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (14.04.2024)
Get full text
Conference Proceeding
Disentangling Content and Fine-Grained Prosody Information Via Hybrid ASR Bottleneck Features for Voice Conversion
Zhao, Xintao, Liu, Feng, Song, Changhe, Wu, Zhiyong, Kang, Shiyin, Tuo, Deyi, Meng, Helen
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Get full text
Conference Proceeding
GTN-Bailando: Genre Consistent long-Term 3D Dance Generation Based on Pre-Trained Genre Token Network
Zhuang, Haolin, Lei, Shun, Xiao, Long, Li, Weiqin, Chen, Liyang, Yang, Sicheng, Wu, Zhiyong, Kang, Shiyin, Meng, Helen
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis
Lei, Shun, Zhou, Yixuan, Chen, Liyang, Wu, Zhiyong, Wu, Xixin, Kang, Shiyin, Meng, Helen
Published in IEEE/ACM transactions on audio, speech, and language processing (2023)
Published in IEEE/ACM transactions on audio, speech, and language processing (2023)
Get full text
Journal Article
CB-Conformer: Contextual Biasing Conformer for Biased Word Recognition
Xu, Yaoxun, Liu, Baiji, Huang, Qiaochu, Song, Xingchen, Wu, Zhiyong, Kang, Shiyin, Meng, Helen
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
Context-Aware Coherent Speaking Style Prediction with Hierarchical Transformers for Audiobook Speech Synthesis
Lei, Shun, Zhou, Yixuan, Chen, Liyang, Wu, Zhiyong, Kang, Shiyin, Meng, Helen
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding
Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis
Lei, Shun, Zhou, Yixuan, Chen, Liyang, Wu, Zhiyong, Kang, Shiyin, Meng, Helen
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)
Get full text
Conference Proceeding
TFCnet: Time-Frequency Domain Corrector for Speech Separation
Tong, Weinan, Zhu, Jiaxu, Chen, Jun, Wu, Zhiyong, Kang, Shiyin, Meng, Helen
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)
Get full text
Conference Proceeding