Search Results - "Xin, Detai" :: K.UTB vyhledávací portál

Improving Speech Prosody of Audiobook Text-To-Speech Synthesis with Acoustic and Textual Contexts

by Xin, Detai, Adavanne, Sharath, Ang, Federico, Kulkarni, Ashish, Takamichi, Shinnosuke, Saruwatari, Hiroshi
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)

Get full text

Conference Proceeding

Loading…

Duration-Aware Pause Insertion Using Pre-Trained Language Model for Multi-Speaker Text-To-Speech

by Yang, Dong, Koriyama, Tomoki, Saito, Yuki, Saeki, Takaaki, Xin, Detai, Saruwatari, Hiroshi
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)

Get full text

Conference Proceeding

Loading…

MID-Attribute Speaker Generation Using Optimal-Transport-Based Interpolation of Gaussian Mixture Models

by Watanabe, Aya, Takamichi, Shinnosuke, Saito, Yuki, Xin, Detai, Saruwatari, Hiroshi
Published in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (04.06.2023)

Get full text

Conference Proceeding

Loading…

JVNV: A Corpus of Japanese Emotional Speech With Verbal Content and Nonverbal Expressions

by Xin, Detai, Jiang, Junfeng, Takamichi, Shinnosuke, Saito, Yuki, Aizawa, Akiko, Saruwatari, Hiroshi
Published in IEEE access (2024)

Get full text

Journal Article

Loading…

JVNV: A Corpus of Japanese Emotional Speech with Verbal Content and Nonverbal Expressions

by Detai Xin, Jiang, Junfeng, Takamichi, Shinnosuke, Saito, Yuki, Aizawa, Akiko, Saruwatari, Hiroshi
Published in arXiv.org (09.10.2023)

Get full text

Paper Journal Article

Loading…

JNV corpus: A corpus of Japanese nonverbal vocalizations with diverse phrases and emotions

by Xin, Detai, Takamichi, Shinnosuke, Saruwatari, Hiroshi
Published in Speech communication (01.01.2024)

Get full text

Journal Article

Loading…

Disentangled Speaker and Language Representations Using Mutual Information Minimization and Domain Adaptation for Cross-Lingual TTS

by Xin, Detai, Komatsu, Tatsuya, Takamichi, Shinnosuke, Saruwatari, Hiroshi
Published in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (06.06.2021)

Get full text

Conference Proceeding

Loading…

JNV Corpus: A Corpus of Japanese Nonverbal Vocalizations with Diverse Phrases and Emotions

by Xin, Detai, Takamichi, Shinnosuke, Saruwatari, Hiroshi
Year of Publication 21.05.2023

Get full text

Journal Article

Loading…

BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec

by Xin, Detai, Tan, Xu, Takamichi, Shinnosuke, Saruwatari, Hiroshi
Year of Publication 09.09.2024

Get full text

Journal Article

Loading…

Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations

by Xin, Detai, Takamichi, Shinnosuke, Saruwatari, Hiroshi
Year of Publication 21.06.2022

Get full text

Journal Article

Loading…

Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus

by Xin, Detai, Takamichi, Shinnosuke, Morimatsu, Ai, Saruwatari, Hiroshi
Year of Publication 21.05.2023

Get full text

Journal Article

Loading…

Mid-attribute speaker generation using optimal-transport-based interpolation of Gaussian mixture models

by Watanabe, Aya, Takamichi, Shinnosuke, Saito, Yuki, Xin, Detai, Saruwatari, Hiroshi
Year of Publication 18.10.2022

Get full text

Journal Article

Loading…

Building speech corpus with diverse voice characteristics for its prompt-based representation

by Watanabe, Aya, Takamichi, Shinnosuke, Saito, Yuki, Nakata, Wataru, Xin, Detai, Saruwatari, Hiroshi
Year of Publication 20.03.2024

Get full text

Journal Article

Loading…

Speaking-Rate-Controllable HiFi-GAN Using Feature Interpolation

by Xin, Detai, Takamichi, Shinnosuke, Okamoto, Takuma, Kawai, Hisashi, Saruwatari, Hiroshi
Year of Publication 22.04.2022

Get full text

Journal Article

Loading…

Coco-Nut: Corpus of Japanese Utterance and Voice Characteristics Description for Prompt-based Control

by Watanabe, Aya, Takamichi, Shinnosuke, Saito, Yuki, Nakata, Wataru, Xin, Detai, Saruwatari, Hiroshi
Year of Publication 23.09.2023

Get full text

Journal Article

Loading…

How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics

by Park, Joonyong, Takamichi, Shinnosuke, Nakamura, Tomohiko, Seki, Kentaro, Xin, Detai, Saruwatari, Hiroshi
Year of Publication 01.06.2023

Get full text

Journal Article

Loading…

Duration-aware pause insertion using pre-trained language model for multi-speaker text-to-speech

by Yang, Dong, Koriyama, Tomoki, Saito, Yuki, Saeki, Takaaki, Xin, Detai, Saruwatari, Hiroshi
Year of Publication 27.02.2023

Get full text

Journal Article

Loading…

Improving Speech Prosody of Audiobook Text-to-Speech Synthesis with Acoustic and Textual Contexts

by Xin, Detai, Adavanne, Sharath, Ang, Federico, Kulkarni, Ashish, Takamichi, Shinnosuke, Saruwatari, Hiroshi
Year of Publication 04.11.2022

Get full text

Journal Article

Loading…

JNV Corpus: A Corpus of Japanese Nonverbal Vocalizations with Diverse Phrases and Emotions

by Detai Xin, Takamichi, Shinnosuke, Saruwatari, Hiroshi
Published in arXiv.org (21.05.2023)

Get full text

Paper

Loading…

RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis

by Xin, Detai, Tan, Xu, Shen, Kai, Ju, Zeqian, Yang, Dongchao, Wang, Yuancheng, Takamichi, Shinnosuke, Saruwatari, Hiroshi, Liu, Shujie, Li, Jinyu, Zhao, Sheng
Year of Publication 04.04.2024

Get full text

Journal Article

Refine Results

Format

Subject Area

Topic

Language

Year of Publication

Database