Large-Scale Online Multitask Learning and Decision Making for Flexible Manufacturing

Large-scale machine coordination is a primary approach for flexible manufacturing, enabling large-scale autonomous machines to dynamically coordinate their actions in pursuit of a custom task. One of the key challenges for such large-scale systems is finding high-dimensional coordination decision-ma...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on industrial informatics Vol. 12; no. 6; pp. 2139 - 2147
Main Authors Wang, JunPing, Sun, YunChuan, Zhang, WenSheng, Thomas, Ian, Duan, ShiHui, Shi, YouKang
Format Journal Article
LanguageEnglish
Published Piscataway IEEE 01.12.2016
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Large-scale machine coordination is a primary approach for flexible manufacturing, enabling large-scale autonomous machines to dynamically coordinate their actions in pursuit of a custom task. One of the key challenges for such large-scale systems is finding high-dimensional coordination decision-making policies. Multitask policy gradient algorithms can be used in search of high-dimensional policies, particularly in collaborative decision support systems and distributed control systems. However, it is difficult for these algorithms to learn online high-dimensional coordination control policies (CCP) from large-scale custom manufacturing tasks. This paper proposes a large-scale online multitask learning and decision-making approach, which can consecutively learn high-dimensional CCP in order to quickly coordinate machine actions online for large-scale custom manufacturing task. A large-scale online multitask leaning algorithm is developed, which is able to learn large-scale high-dimensional CCP in a flexible manufacturing scenario. An online stochastic planning algorithm is proposed, which online optimizes the Markov network structure in order to avoid expensive global search for the optimal policy. Experiments have been undertaken using a professional flexible manufacturing testbed deployed within a smart factory of Weichai Power in China. Results show the proposed approach to be more efficient when compared with previous works.
ISSN:1551-3203
1941-0050
DOI:10.1109/TII.2016.2549919