Modelling continual learning in humans with Hebbian context gating and exponentially decaying task signals

Humans can learn several tasks in succession with minimal mutual interference but perform more poorly when trained on multiple tasks at once. The opposite is true for standard deep neural networks. Here, we propose novel computational constraints for artificial neural networks, inspired by earlier w...

Full description

Saved in:

Bibliographic Details
Published in	PLoS computational biology Vol. 19; no. 1; p. e1010808
Main Authors	Flesch, Timo, Nagy, David G, Saxe, Andrew, Summerfield, Christopher
Format	Journal Article
Language	English
Published	United States Public Library of Science 01.01.2023 Public Library of Science (PLoS)
Subjects	Analysis Animals Architecture Artificial neural networks Biology and Life Sciences Brain Computer and Information Sciences Computer applications Curricula Curriculum Deep learning Gating Geometry Humans Interference Interference (Perception) Learning Machine Learning Medicine and Health Sciences Neural networks Neural Networks, Computer Neurosciences Prefrontal Cortex Psychological research Representations Social Sciences Stochasticity System theory Training United Kingdom
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Humans can learn several tasks in succession with minimal mutual interference but perform more poorly when trained on multiple tasks at once. The opposite is true for standard deep neural networks. Here, we propose novel computational constraints for artificial neural networks, inspired by earlier work on gating in the primate prefrontal cortex, that capture the cost of interleaved training and allow the network to learn two tasks in sequence without forgetting. We augment standard stochastic gradient descent with two algorithmic motifs, so-called "sluggish" task units and a Hebbian training step that strengthens connections between task units and hidden units that encode task-relevant information. We found that the "sluggish" units introduce a switch-cost during training, which biases representations under interleaved training towards a joint representation that ignores the contextual cue, while the Hebbian step promotes the formation of a gating scheme from task units to the hidden layer that produces orthogonal representations which are perfectly guarded against interference. Validating the model on previously published human behavioural data revealed that it matches performance of participants who had been trained on blocked or interleaved curricula, and that these performance differences were driven by misestimation of the true category boundary.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 The authors have declared that no competing interests exist.
ISSN:	1553-7358 1553-734X 1553-7358
DOI:	10.1371/journal.pcbi.1010808