Policy Sharing Using Aggregation Trees for -Learning in a Continuous State and Action Spaces
Q-learning is a generic approach that uses a finite discrete state and an action domain to estimate action values using tabular or function approximation methods. An intelligent agent eventually learns policies from continuous sensory inputs and encodes these environmental inputs onto a discrete sta...
Saved in:
Published in | IEEE transactions on cognitive and developmental systems Vol. 12; no. 3; pp. 474 - 485 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
Piscataway
IEEE
01.09.2020
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Be the first to leave a comment!