Search Results - "Soong, Frank K." :: K.UTB vyhledávací portál

Voice Activity Detection Based on an Unsupervised Learning Framework

by Dongwen Ying, Yonghong Yan, Jianwu Dang, Soong, F. K.
Published in IEEE transactions on audio, speech, and language processing (01.11.2011)

Get full text

Journal Article

Loading…

A deep bidirectional LSTM approach for video-realistic talking head

by Fan, Bo, Xie, Lei, Yang, Shan, Wang, Lijuan, Soong, Frank K.
Published in Multimedia tools and applications (01.05.2016)

Get full text

Journal Article

Loading…

HMM trajectory-guided sample selection for photo-realistic talking head

by Wang, Lijuan, Soong, Frank K.
Published in Multimedia tools and applications (01.11.2015)

Get full text

Journal Article

Loading…

A Unified Trajectory Tiling Approach to High Quality Speech Rendering

by Yao Qian, Soong, F. K., Zhi-Jie Yan
Published in IEEE transactions on audio, speech, and language processing (01.02.2013)

Get full text

Journal Article

Loading…

A Cross-Language State Sharing and Mapping Approach to Bilingual (Mandarin-English) TTS

by Yao Qian, Hui Liang, Soong, F.K.
Published in IEEE transactions on audio, speech, and language processing (01.08.2009)

Get full text

Journal Article

Loading…

Word embedding for recurrent neural network based TTS synthesis

by Peilu Wang, Yao Qian, Soong, Frank K., Lei He, Hai Zhao
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2015)

Get full text

Conference Proceeding

Loading…

Speaker and language factorization in DNN-based TTS synthesis

by Yuchen Fan, Yao Qian, Soong, Frank K., Lei He
Published in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.03.2016)

Get full text

Conference Proceeding Journal Article

Loading…

Improved mispronunciation detection with deep neural network trained acoustic models and transfer learning based logistic regression classifiers

by Hu, Wenping, Qian, Yao, Soong, Frank K., Wang, Yong
Published in Speech communication (01.03.2015)

Get full text

Journal Article

Loading…

Cycle consistent network for end-to-end style transfer TTS training

by Xue, Liumeng, Pan, Shifeng, He, Lei, Xie, Lei, Soong, Frank K.
Published in Neural networks (01.08.2021)

Get full text

Journal Article

Loading…

Effective and direct control of neural TTS prosody by removing interactions between different attributes

by An, Xiaochun, Soong, Frank K., Yang, Shan, Xie, Lei
Published in Neural networks (01.11.2021)

Get full text

Journal Article

Loading…

Disentangling Style and Speaker Attributes for TTS Style Transfer

by An, Xiaochun, Soong, Frank K., Xie, Lei
Published in IEEE/ACM transactions on audio, speech, and language processing (2022)

Get full text

Journal Article

Loading…

Improving Prosody with Linguistic and Bert Derived Features in Multi-Speaker Based Mandarin Chinese Neural TTS

by Xiao, Yujia, He, Lei, Ming, Huaiping, Soong, Frank K.
Published in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2020)

Get full text

Conference Proceeding

Loading…

MSMC-TTS: Multi-Stage Multi-Codebook VQ-VAE based Neural TTS

by Guo, Haohan, Xie, Fenglong, Wu, Xixin, Soong, Frank K., MengFellow, Helen
Published in IEEE/ACM transactions on audio, speech, and language processing (01.01.2023)

Get full text

Journal Article

Loading…

Voice conversion with SI-DNN and KL divergence based mapping without parallel training data

by Xie, Feng-Long, Soong, Frank K., Li, Haifeng
Published in Speech communication (01.01.2019)

Get full text

Journal Article

Loading…

ParaTTS: Learning Linguistic and Prosodic Cross-Sentence Information in Paragraph-Based TTS

by Xue, Liumeng, Soong, Frank K., Zhang, Shaofei, Xie, Lei
Published in IEEE/ACM transactions on audio, speech, and language processing (2022)

Get full text

Journal Article

Loading…

Speech Bert Embedding for Improving Prosody in Neural TTS

by Chen, Liping, Deng, Yan, Wang, Xi, Soong, Frank K., He, Lei
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)

Get full text

Conference Proceeding

Loading…

On the training aspects of Deep Neural Network (DNN) for parametric TTS synthesis

by Yao Qian, Yuchen Fan, Wenping Hu, Soong, Frank K.
Published in 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.05.2014)

Get full text

Conference Proceeding

Loading…

Photo-real talking head with deep bidirectional LSTM

by Bo Fan, Lijuan Wang, Soong, Frank K., Lei Xie
Published in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (01.04.2015)

Get full text

Conference Proceeding

Loading…

Improving Fastspeech TTS with Efficient Self-Attention and Compact Feed-Forward Network

by Xiao, Yujia, Wang, Xi, He, Lei, Soong, Frank K.
Published in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (23.05.2022)

Get full text

Conference Proceeding

Loading…

RIch-context Unit Selection (RUS) approach to high quality TTS

by Zhi-Jie Yan, Yao Qian, Soong, Frank K
Published in 2010 IEEE International Conference on Acoustics, Speech and Signal Processing (01.01.2010)

Get full text

Conference Proceeding

Refine Results

Format

Subject Area

Topic

Language

Year of Publication

Database