Dialog management for multiple users

A system that is capable of resolving anaphora using timing data received by a local device. A local device outputs audio representing a list of entries. The audio may represent synthesized speech of the list of entries. A user can interrupt the device to select an entry in the list, such as by sayi...

Full description

Saved in:

Bibliographic Details
Main Authors	Krishnan, Prakash, Strom, Nikko, Gupta, Nishtha, Auvray, Vincent, Shen, Minmin, Mandal, Arindam, Shi, Ying, Rastrow, Ariya, Challenner, Aaron, Zheng, Bonan, Jonnalagadda, Siddhartha Reddy, Tang, David Chi-Wai, Metallinou, Angeliki
Format	Patent
Language	English
Published	20.02.2024
Subjects	ACOUSTICS CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online Access	Get full text

Cover

Loading…

More Information
Summary:	A system that is capable of resolving anaphora using timing data received by a local device. A local device outputs audio representing a list of entries. The audio may represent synthesized speech of the list of entries. A user can interrupt the device to select an entry in the list, such as by saying "that one." The local device can determine an offset time representing the time between when audio playback began and when the user interrupted. The local device sends the offset time and audio data representing the utterance to a speech processing system which can then use the offset time and stored data to identify which entry on the list was most recently output by the local device when the user interrupted. The system can then resolve anaphora to match that entry and can perform additional processing based on the referred to item.
Bibliography:	Application Number: US202017112520