Policy Sharing Using Aggregation Trees for -Learning in a Continuous State and Action Spaces

Q-learning is a generic approach that uses a finite discrete state and an action domain to estimate action values using tabular or function approximation methods. An intelligent agent eventually learns policies from continuous sensory inputs and encodes these environmental inputs onto a discrete sta...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on cognitive and developmental systems Vol. 12; no. 3; pp. 474 - 485
Main Authors Chen, Yu-Jen, Jiang, Wei-Cheng, Ju, Ming-Yi, Hwang, Kao-Shing
Format Journal Article
LanguageEnglish
Published Piscataway IEEE 01.09.2020
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text

Cover

Loading…