GENERATING A VOICE MODEL FOR A USER
Disclosed herein a system, a method and a device for generating a voice model for a user. A device can include an encoder and a decoder to generate a voice model for converting text to an audio output that resembles a voice of the person sending respective text. The encoder can includes a neural net...
Saved in:
Main Authors | , , , , , |
---|---|
Format | Patent |
Language | English French German |
Published |
21.09.2022
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Disclosed herein a system, a method and a device for generating a voice model for a user. A device can include an encoder and a decoder to generate a voice model for converting text to an audio output that resembles a voice of the person sending respective text. The encoder can includes a neural network and can receive a plurality of audio samples from a user. The encoder can generate a sequence of values and provide the sequence of values to the decoder. The decoder can establish, using the sequence of values and one or more speaker embeddings of the user, a voice model corresponding to the plurality of audio samples of the user. |
---|---|
Bibliography: | Application Number: EP20200801099 |