Reactive speech synthesis
A method of responding to an interruption in a speech synthesis system (e.g. conversational agents, AI therapists, virtual personal assistants). In response to an interruption event (e.g. when a user has tried to interrupt the dialogue), the system takes an input of the reaction type and earliest ti...
Saved in:
Main Authors | , , , |
---|---|
Format | Patent |
Language | English |
Published |
20.02.2019
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | A method of responding to an interruption in a speech synthesis system (e.g. conversational agents, AI therapists, virtual personal assistants). In response to an interruption event (e.g. when a user has tried to interrupt the dialogue), the system takes an input of the reaction type and earliest time. A new audio is then created which is suitable for splicing. The system then modifies the audio (e.g. selecting different units, applying DSP modification, stopping before the end of the text) based on the type of modification, to introduce different speaking styles to provide a natural response at a speech interface. The interruption may be configured within a particular region such as a phonetic region, word boundary region, and phase level region. Appropriate settings may also be chosen such as switch to Lombard speech or tailing off in a polite manner. |
---|---|
Bibliography: | Application Number: GB20170013273 |