An approximate dynamic programming approach for comparing firing policies in a networked air defense environment

•The threat of theater ballistic missiles remains an important concern for many nations.•This research examines a networked, defense-in-depth air and missile defense problem.•Two approximate dynamic programming (ADP) algorithms provide firing policy solutions.•Designed experiments yield insights abo...

Full description

Saved in:

Bibliographic Details
Published in	Computers & operations research Vol. 117; pp. 104890 - 15
Main Authors	Summers, Daniel S., Robbins, Matthew J., Lunday, Brian J.
Format	Journal Article
Language	English
Published	New York Elsevier Ltd 01.05.2020 Pergamon Press Inc
Subjects	Air and missile defense Air defense Algorithms Approximate dynamic programming Ballistic missiles Computer simulation Defense programs Dynamic programming Dynamic weapon target assignment problem Firing Interceptors Least squares Markov decision processes Markov processes Military Operations research Policies Weapons Air and missile defense Approximate dynamic programming Markov decision processes Dynamic weapon target assignment problem Military
Online Access	Get full text

Cover

Loading…

More Information
Summary:	•The threat of theater ballistic missiles remains an important concern for many nations.•This research examines a networked, defense-in-depth air and missile defense problem.•Two approximate dynamic programming (ADP) algorithms provide firing policy solutions.•Designed experiments yield insights about algorithm performance and problem features.•ADP policies outperform current U.S. policy of firing two interceptors at each missile. An objective for effective air defense is to identify the firing policy for interceptor allocation to incoming missiles that minimizes the expected total damage to defended assets over a sequence of engagements. We formulate this dynamic weapon target assignment problem as a Markov decision process and utilize a simulation-based, approximate dynamic programming (ADP) approach to solve problem instances based on a representative scenario. Least squares policy evaluation and least squares temporal differences algorithms are developed to determine approximate solutions. A designed experiment investigates problem features such as conflict duration, attacker and defender weapon sophistication, and defended asset values. An empirical comparison of the ADP policies and two baseline policies (i.e., firing either one or two interceptors at each incoming theater ballistic missile (TBM)) yields several insights: the ADP policies outperform both baseline polices when conflict duration is short and attacker weapons are sophisticated; firing one interceptor at each TBM (regardless of inventory status) outperforms the tested ADP policies when conflict duration is long and attacker weapons are less sophisticated; and firing two interceptors at each TBM (regardless of inventory status), which is the United States Army’s currently implemented policy, is never the superlative policy for the test instances investigated.
ISSN:	0305-0548 1873-765X 0305-0548
DOI:	10.1016/j.cor.2020.104890