Optimizing dialogue policy decisions for digital assistants using implicit feedback

Systems and processes for optimizing dialogue policy decisions for digital assistants using implicit feedback are provided. In an example process, a user utterance is received. Based on a text representation of the user utterance, one or more user intents corresponding to the user utterance are dete...

Full description

Saved in:
Bibliographic Details
Main Authors Silvia Frias Delgaro, Thomas David Voice, David J. Vandyke, Thomas Gunter, Gennaro Frazzingaro, Thorvaldur Pall Helgason, Blaise Thomson
Format Patent
LanguageEnglish
Published 20.12.2018
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Systems and processes for optimizing dialogue policy decisions for digital assistants using implicit feedback are provided. In an example process, a user utterance is received. Based on a text representation of the user utterance, one or more user intents corresponding to the user utterance are determined. A policy action is selected from a plurality of candidate policy actions based on a belief state for the one or more user intents and a policy model. The policy action is performed, including outputting results of the policy action for presentation. A success score for the policy action is determined based on whether one or more predetermined types of implicit user feedback are detected after performing the policy action. A set of parameter values of the policy model is modified using the determined success score.
Bibliography:Application Number: DK2017PA70431