Random Action Replay for Reinforcement Learning
An artificial intelligence (AI) platform to support random action replay for natural language (NL) learning. A NL conversation is explored to train a neural network. One or more tuples are leverage for the training, with each tuple representing an input action, a vector, an output action, and a rewa...
Saved in:
Main Authors | , , , |
---|---|
Format | Patent |
Language | English |
Published |
30.12.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | An artificial intelligence (AI) platform to support random action replay for natural language (NL) learning. A NL conversation is explored to train a neural network. One or more tuples are leverage for the training, with each tuple representing an input action, a vector, an output action, and a reward value. An action is sampled from the vector, with the sampling including assessment of a corresponding first gradient. The first gradient is applied to selectively adjust the neural network. As NL input is received and applied to the selectively adjusted neural network, an output corresponding to the NL input is identified and a corresponding action is executed. |
---|---|
Bibliography: | Application Number: US202016946586 |