FLOW SHOP SCHEDULING WITH REINFORCEMENT LEARNING

Reinforcement learning (RL) is a state or action value based machine learning method which solves large-scale multi-stage decision problems such as Markov Decision Process (MDP) and Semi-Markov Decision Process (SMDP) problems. We minimize the makespan of flow shop scheduling problems with an RL alg...

Full description

Saved in:

Bibliographic Details
Published in	Asia-Pacific journal of operational research Vol. 30; no. 5; p. 1350014
Main Authors	ZHANG, ZHICONG, WANG, WEIPING, ZHONG, SHOUYAN, HU, KAISHUN
Format	Journal Article
Language	English
Published	Singapore World Scientific Publishing Co. & Operational Research Society of Singapore 01.10.2013 World Scientific Publishing Co. Pte., Ltd
Subjects	Algorithms Computation Job shop scheduling Job shops Machine learning Markov analysis Markov processes Operations research Production scheduling Scheduling flow shop TD(λ) Scheduling reinforcement learning
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Reinforcement learning (RL) is a state or action value based machine learning method which solves large-scale multi-stage decision problems such as Markov Decision Process (MDP) and Semi-Markov Decision Process (SMDP) problems. We minimize the makespan of flow shop scheduling problems with an RL algorithm. We convert flow shop scheduling problems into SMDPs by constructing elaborate state features, actions and the reward function. Minimizing the accumulated reward is equivalent to minimizing the schedule objective function. We apply on-line TD(λ) algorithm with linear gradient-descent function approximation to solve the SMDPs. To examine the performance of the proposed RL algorithm, computational experiments are conducted on benchmarking problems in comparison with other scheduling methods. The experimental results support the efficiency of the proposed algorithm and illustrate that the RL approach is a promising computational approach for flow shop scheduling problems worthy of further investigation.
ISSN:	0217-5959 1793-7019 0217-5959
DOI:	10.1142/S0217595913500140