Online heterogeneous multiagent learning under limited communication with applications to forest fire management

Many robotic missions require online estimation of the unknown state transition models associated with uncertainty that stems from mission dynamics. The learning problem is usually distributed among agents in multiagent scenarios, either due to the absence of a centralized processing unit or because...

Full description

Saved in:

Bibliographic Details
Published in	2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) pp. 5181 - 5188
Main Authors	Ure, N. Kemal, Omidshafiei, Shayegan, Lopez, Brett Thomas, Agha-Mohammadi, Ali-akbar, How, Jonathan P., Vian, John
Format	Conference Proceeding
Language	English
Published	IEEE 01.09.2015
Subjects	Approximation algorithms Computational modeling Convergence Data models Function approximation Heuristic algorithms Vehicle dynamics
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Many robotic missions require online estimation of the unknown state transition models associated with uncertainty that stems from mission dynamics. The learning problem is usually distributed among agents in multiagent scenarios, either due to the absence of a centralized processing unit or because of the large size of the joint learning problem. This paper addresses the problem of multiagent learning in the likely scenario that agents estimate different models from their measured data, but they can share information by communicating model parameters. Previous approaches either consider homogeneous scenarios or perform model transfer in an open-loop manner, which hinders the convergence rate. We develop a closed-loop multiagent learning algorithm, Collaborative Filtering-Decentralized Incremental Feature Dependency Discovery (CF-Dec-iFDD), which enables agents to learn and share models in heterogeneous scenarios. Each agent learns a linear function approximation of the actual model, and the number of features is increased incrementally to adjust model complexity based on the observed data. The agents obtain feedback from other agents on the model error reduction associated with the communicated features. Although this increases the communication cost of exchanging features, it improves the quality/utility of what is being exchanged, leading to improved convergence rate. The approach is demonstrated in indoor hardware flight tests on a forest fire management scenario for which agents must learn the transition model of the fire spread depending on external factors such as wind and vegetation. It is shown that CF-Dec-iFDD has superior convergence rate compared to the alternative approaches.
DOI:	10.1109/IROS.2015.7354107