GENERATING A VOICE MODEL FOR A USER

Disclosed herein a system, a method and a device for generating a voice model for a user. A device can include an encoder and a decoder to generate a voice model for converting text to an audio output that resembles a voice of the person sending respective text. The encoder can includes a neural net...

Full description

Saved in:
Bibliographic Details
Main Authors ZVI, Tali, WOLF, Lior, VAZQUEZ, David, PARK, Hyunbin, TAIGMAN, Yaniv Nechemia, POLYAK, Adam
Format Patent
LanguageEnglish
French
German
Published 21.09.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Disclosed herein a system, a method and a device for generating a voice model for a user. A device can include an encoder and a decoder to generate a voice model for converting text to an audio output that resembles a voice of the person sending respective text. The encoder can includes a neural network and can receive a plurality of audio samples from a user. The encoder can generate a sequence of values and provide the sequence of values to the decoder. The decoder can establish, using the sequence of values and one or more speaker embeddings of the user, a voice model corresponding to the plurality of audio samples of the user.
Bibliography:Application Number: EP20200801099