Voice Activity Detection Based on an Unsupervised Learning Framework
Dongwen Ying, Yonghong Yan, Jianwu Dang, Soong, F. K.
Published in IEEE transactions on audio, speech, and language processing (01.11.2011)
Published in IEEE transactions on audio, speech, and language processing (01.11.2011)
Get full text
Journal Article
A deep bidirectional LSTM approach for video-realistic talking head
Fan, Bo, Xie, Lei, Yang, Shan, Wang, Lijuan, Soong, Frank K.
Published in Multimedia tools and applications (01.05.2016)
Published in Multimedia tools and applications (01.05.2016)
Get full text
Journal Article
A Unified Trajectory Tiling Approach to High Quality Speech Rendering
Yao Qian, Soong, F. K., Zhi-Jie Yan
Published in IEEE transactions on audio, speech, and language processing (01.02.2013)
Published in IEEE transactions on audio, speech, and language processing (01.02.2013)
Get full text
Journal Article
A Cross-Language State Sharing and Mapping Approach to Bilingual (Mandarin-English) TTS
Yao Qian, Hui Liang, Soong, F.K.
Published in IEEE transactions on audio, speech, and language processing (01.08.2009)
Published in IEEE transactions on audio, speech, and language processing (01.08.2009)
Get full text
Journal Article
Word embedding for recurrent neural network based TTS synthesis
Peilu Wang, Yao Qian, Soong, Frank K., Lei He, Hai Zhao
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2015)
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2015)
Get full text
Conference Proceeding
Speaker and language factorization in DNN-based TTS synthesis
Yuchen Fan, Yao Qian, Soong, Frank K., Lei He
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2016)
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2016)
Get full text
Conference Proceeding
Journal Article
Cycle consistent network for end-to-end style transfer TTS training
Xue, Liumeng, Pan, Shifeng, He, Lei, Xie, Lei, Soong, Frank K.
Published in Neural networks (01.08.2021)
Published in Neural networks (01.08.2021)
Get full text
Journal Article
Improving Prosody with Linguistic and Bert Derived Features in Multi-Speaker Based Mandarin Chinese Neural TTS
Xiao, Yujia, He, Lei, Ming, Huaiping, Soong, Frank K.
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)
Get full text
Conference Proceeding
MSMC-TTS: Multi-Stage Multi-Codebook VQ-VAE based Neural TTS
Guo, Haohan, Xie, Fenglong, Wu, Xixin, Soong, Frank K., MengFellow, Helen
Published in IEEE/ACM transactions on audio, speech, and language processing (01.01.2023)
Published in IEEE/ACM transactions on audio, speech, and language processing (01.01.2023)
Get full text
Journal Article
Speech Bert Embedding for Improving Prosody in Neural TTS
Chen, Liping, Deng, Yan, Wang, Xi, Soong, Frank K., He, Lei
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)
Get full text
Conference Proceeding
On the training aspects of Deep Neural Network (DNN) for parametric TTS synthesis
Yao Qian, Yuchen Fan, Wenping Hu, Soong, Frank K.
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2014)
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2014)
Get full text
Conference Proceeding
Photo-real talking head with deep bidirectional LSTM
Bo Fan, Lijuan Wang, Soong, Frank K., Lei Xie
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2015)
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2015)
Get full text
Conference Proceeding
RIch-context Unit Selection (RUS) approach to high quality TTS
Zhi-Jie Yan, Yao Qian, Soong, Frank K
Published in 2010 IEEE International Conference on Acoustics, Speech and Signal Processing (01.01.2010)
Published in 2010 IEEE International Conference on Acoustics, Speech and Signal Processing (01.01.2010)
Get full text
Conference Proceeding