Measuring Convergence Inertia: Online Learning in Self-adaptive Systems with Context Shifts

To deal with situations not specifically designed for (unknown-unknowns), self-adaptive systems need to learn the best – or at least good enough – action to perform in each context faced during operation. An established solution for doing so is through the use of online learning. The complexity of o...

Full description

Saved in:

Bibliographic Details
Published in	Leveraging Applications of Formal Methods, Verification and Validation. Adaptation and Learning Vol. 13703; pp. 231 - 248
Main Authors	Alberts, Elvin, Gerostathopoulos, Ilias
Format	Book Chapter
Language	English
Published	Switzerland Springer 2022 Springer Nature Switzerland
Series	Lecture Notes in Computer Science
Subjects	Convergence inertia Non-stationary Online learning Self-adaptive systems
Online Access	Get full text
ISBN	3031197585 9783031197581
ISSN	0302-9743 1611-3349
DOI	10.1007/978-3-031-19759-8_15

Cover

More Information
Summary:	To deal with situations not specifically designed for (unknown-unknowns), self-adaptive systems need to learn the best – or at least good enough – action to perform in each context faced during operation. An established solution for doing so is through the use of online learning. The complexity of online learning however increases in the presence of context shifts – which are typical in self-adaptive systems. In this paper, we (i) propose a new metric, convergence inertia, to assess the robustness of reinforcement learning policies against context shifts, and (ii) use it to assess the robustness of different policies within the family of multi-armed bandits (MAB) to context shifts. Through an experiment with a self-adaptation exemplar of a web server, we demonstrate that inertia and the accompanying interpretation of the unknown-unknowns problem is a viable way to inform the selection of online learning policies for self-adaptive systems, since it brings the influence of context shifts to the forefront. In our experiment, we found that non-stationary MAB policies are better suited to handling context shifts in terms of inertia, although stationary policies tend to perform well in terms of overall convergence.
ISBN:	3031197585 9783031197581
ISSN:	0302-9743 1611-3349
DOI:	10.1007/978-3-031-19759-8_15