Q-learning solution for optimal consensus control of discrete-time multiagent systems using reinforcement learning

This paper investigates a Q-learning scheme for the optimal consensus control of discrete-time multiagent systems. The Q-learning algorithm is conducted by reinforcement learning (RL) using system data instead of system dynamics information. In the multiagent systems, the agents are interacted with...

Full description

Saved in:
Bibliographic Details
Published inJournal of the Franklin Institute Vol. 356; no. 13; pp. 6946 - 6967
Main Authors Mu, Chaoxu, Zhao, Qian, Gao, Zhongke, Sun, Changyin
Format Journal Article
LanguageEnglish
Published Elmsford Elsevier Ltd 01.09.2019
Elsevier Science Ltd
Subjects
Online AccessGet full text

Cover

Loading…