Where’s the Reward? A Review of Reinforcement Learning for Instructional Sequencing

Since the 1960s, researchers have been trying to optimize the sequencing of instructional activities using the tools of reinforcement learning (RL) and sequential decision making under uncertainty. Many researchers have realized that reinforcement learning provides a natural framework for optimal in...

Full description

Saved in:

Bibliographic Details
Published in	International journal of artificial intelligence in education Vol. 29; no. 4; pp. 568 - 620
Main Authors	Doroudi, Shayan, Aleven, Vincent, Brunskill, Emma
Format	Journal Article
Language	English
Published	New York Springer New York 01.12.2019 Springer
Subjects	Artificial Intelligence Best Practices Cognitive Psychology Computer Science Computers and Education Decision Making Educational Technology Intelligent Tutoring Systems Learning Processes Reinforcement Sequential Learning User Interfaces and Human Computer Interaction Adaptive instruction Instructional sequencing Reinforcement learning History of artificial intelligence in education
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Since the 1960s, researchers have been trying to optimize the sequencing of instructional activities using the tools of reinforcement learning (RL) and sequential decision making under uncertainty. Many researchers have realized that reinforcement learning provides a natural framework for optimal instructional sequencing given a particular model of student learning, and excitement towards this area of research is as alive now as it was over fifty years ago. But does RL actually help students learn? If so, when and where might we expect it to be most helpful? To help answer these questions, we review the variety of attempts to use RL for instructional sequencing. First, we present a historical narrative of this research area. We identify three waves of research, which gives us a sense of the various communities of researchers that have been interested in this problem and where the field is going. Second, we review all of the empirical research that has compared RL-induced instructional policies to baseline methods of sequencing. We find that over half of the studies found that RL-induced policies significantly outperform baselines. Moreover, we identify five clusters of studies with different characteristics and varying levels of success in using RL to help students learn. We find that reinforcement learning has been most successful in cases where it has been constrained with ideas and theories from cognitive psychology and the learning sciences. However, given that our theories and models are limited, we also find that it has been useful to complement this approach with running more robust offline analyses that do not rely heavily on the assumptions of one particular model. Given that many researchers are turning to deep reinforcement learning and big data to tackle instructional sequencing, we believe keeping these best practices in mind can help guide the way to the reward in using RL for instructional sequencing.
ISSN:	1560-4292 1560-4306
DOI:	10.1007/s40593-019-00187-x