Large-Scale Online Multitask Learning and Decision Making for Flexible Manufacturing
Large-scale machine coordination is a primary approach for flexible manufacturing, enabling large-scale autonomous machines to dynamically coordinate their actions in pursuit of a custom task. One of the key challenges for such large-scale systems is finding high-dimensional coordination decision-ma...
Saved in:
Published in | IEEE transactions on industrial informatics Vol. 12; no. 6; pp. 2139 - 2147 |
---|---|
Main Authors | , , , , , |
Format | Journal Article |
Language | English |
Published |
Piscataway
IEEE
01.12.2016
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Large-scale machine coordination is a primary approach for flexible manufacturing, enabling large-scale autonomous machines to dynamically coordinate their actions in pursuit of a custom task. One of the key challenges for such large-scale systems is finding high-dimensional coordination decision-making policies. Multitask policy gradient algorithms can be used in search of high-dimensional policies, particularly in collaborative decision support systems and distributed control systems. However, it is difficult for these algorithms to learn online high-dimensional coordination control policies (CCP) from large-scale custom manufacturing tasks. This paper proposes a large-scale online multitask learning and decision-making approach, which can consecutively learn high-dimensional CCP in order to quickly coordinate machine actions online for large-scale custom manufacturing task. A large-scale online multitask leaning algorithm is developed, which is able to learn large-scale high-dimensional CCP in a flexible manufacturing scenario. An online stochastic planning algorithm is proposed, which online optimizes the Markov network structure in order to avoid expensive global search for the optimal policy. Experiments have been undertaken using a professional flexible manufacturing testbed deployed within a smart factory of Weichai Power in China. Results show the proposed approach to be more efficient when compared with previous works. |
---|---|
ISSN: | 1551-3203 1941-0050 |
DOI: | 10.1109/TII.2016.2549919 |