Sufficient reliability of the behavioral and computational readouts of a probabilistic reversal learning task

Task-based measures that capture neurocognitive processes can help bridge the gap between brain and behavior. To transfer tasks to clinical application, reliability is a crucial benchmark because it imposes an upper bound to potential correlations with other variables (e.g., symptom or brain data)....

Full description

Saved in:

Bibliographic Details
Published in	Behavior research methods Vol. 54; no. 6; pp. 2993 - 3014
Main Authors	Waltmann, Maria, Schlagenhauf, Florian, Deserno, Lorenz
Format	Journal Article
Language	English
Published	New York Springer US 01.12.2022 Springer Nature B.V
Subjects	Behavioral Science and Psychology Cognition Cognitive ability Cognitive Psychology Computational neuroscience Humans Psychology Reproducibility of Results Reversal Learning Probabilistic reversal learning Computational modeling Reliability Reinforcement learning Hierarchical modeling
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Task-based measures that capture neurocognitive processes can help bridge the gap between brain and behavior. To transfer tasks to clinical application, reliability is a crucial benchmark because it imposes an upper bound to potential correlations with other variables (e.g., symptom or brain data). However, the reliability of many task readouts is low. In this study, we scrutinized the retest reliability of a probabilistic reversal learning task (PRLT) that is frequently used to characterize cognitive flexibility in psychiatric populations. We analyzed data from N = 40 healthy subjects, who completed the PRLT twice. We focused on how individual metrics are derived, i.e., whether data were partially pooled across participants and whether priors were used to inform estimates. We compared the reliability of the resulting indices across sessions, as well as the internal consistency of a selection of indices. We found good to excellent reliability for behavioral indices as derived from mixed-effects models that included data from both sessions. The internal consistency was good to excellent. For indices derived from computational modeling, we found excellent reliability when using hierarchical estimation with empirical priors and including data from both sessions. Our results indicate that the PRLT is well equipped to measure individual differences in cognitive flexibility in reinforcement learning. However, this depends heavily on hierarchical modeling of the longitudinal data (whether sessions are modeled separately or jointly), on estimation methods, and on the combination of parameters included in computational models. We discuss implications for the applicability of PRLT indices in psychiatric research and as diagnostic tools.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	1554-3528 1554-351X 1554-3528
DOI:	10.3758/s13428-021-01739-7