FoX: Formation-aware exploration in multi-agent reinforcement learning

Recently, deep multi-agent reinforcement learning (MARL) has gained significant popularity due to its success in various cooperative multi-agent tasks. However, exploration still remains a challenging problem in MARL due to the partial observability of the agents and the exploration space that can g...

Full description

Saved in:

Bibliographic Details
Main Authors	Jo, Yonghyeon, Lee, Sunwoo, Yeom, Junghyuk, Han, Seungyul
Format	Journal Article
Language	English
Published	22.08.2023
Subjects	Computer Science - Learning
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Recently, deep multi-agent reinforcement learning (MARL) has gained significant popularity due to its success in various cooperative multi-agent tasks. However, exploration still remains a challenging problem in MARL due to the partial observability of the agents and the exploration space that can grow exponentially as the number of agents increases. Firstly, in order to address the scalability issue of the exploration space, we define a formation-based equivalence relation on the exploration space and aim to reduce the search space by exploring only meaningful states in different formations. Then, we propose a novel formation-aware exploration (FoX) framework that encourages partially observable agents to visit the states in diverse formations by guiding them to be well aware of their current formation solely based on their own observations. Numerical results show that the proposed FoX framework significantly outperforms the state-of-the-art MARL algorithms on Google Research Football (GRF) and sparse Starcraft II multi-agent challenge (SMAC) tasks.
DOI:	10.48550/arxiv.2308.11272