Linear programming formulation for non-stationary, finite-horizon Markov decision process models
Linear programming (LP) formulations are often employed to solve stationary, infinite-horizon Markov decision process (MDP) models. We present an LP approach to solving non-stationary, finite-horizon MDP models that can potentially overcome the computational challenges of standard MDP solution proce...
Saved in:
Published in | Operations research letters Vol. 45; no. 6; pp. 570 - 574 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
Elsevier B.V
01.11.2017
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Linear programming (LP) formulations are often employed to solve stationary, infinite-horizon Markov decision process (MDP) models. We present an LP approach to solving non-stationary, finite-horizon MDP models that can potentially overcome the computational challenges of standard MDP solution procedures. Specifically, we establish the existence of an LP formulation for risk-neutral MDP models whose states and transition probabilities are temporally heterogeneous. This formulation can be recast as an approximate linear programming formulation with significantly fewer decision variables. |
---|---|
ISSN: | 0167-6377 1872-7468 |
DOI: | 10.1016/j.orl.2017.09.001 |