Influence of learning strategy on response time during complex value-based learning and choice

Measurements of response time (RT) have long been used to infer neural processes underlying various cognitive functions such as working memory, attention, and decision making. However, it is currently unknown if RT is also informative about various stages of value-based choice, particularly how rewa...

Full description

Saved in:

Bibliographic Details
Published in	PloS one Vol. 13; no. 5; p. e0197263
Main Authors	Farashahi, Shiva, Rowe, Katherine, Aslami, Zohra, Gobbini, Maria Ida, Soltani, Alireza
Format	Journal Article
Language	English
Published	United States Public Library of Science 22.05.2018 Public Library of Science (PLoS)
Subjects	Anticipation, Psychological Biology and Life Sciences Choice Behavior Choice learning Cognitive ability Decision making Experiments Feedback Feedback, Psychological Female Humans Learning Male Models, Psychological Neural circuitry Observations Psychological research Reaction Time Reaction time (Psychology) Reinforcement Response time Reward Shape memory Short term memory Social Sciences Testing United States > US
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Measurements of response time (RT) have long been used to infer neural processes underlying various cognitive functions such as working memory, attention, and decision making. However, it is currently unknown if RT is also informative about various stages of value-based choice, particularly how reward values are constructed. To investigate these questions, we analyzed the pattern of RT during a set of multi-dimensional learning and decision-making tasks that can prompt subjects to adopt different learning strategies. In our experiments, subjects could use reward feedback to directly learn reward values associated with possible choice options (object-based learning). Alternatively, they could learn reward values of options' features (e.g. color, shape) and combine these values to estimate reward values for individual options (feature-based learning). We found that RT was slower when the difference between subjects' estimates of reward probabilities for the two alternative objects on a given trial was smaller. Moreover, RT was overall faster when the preceding trial was rewarded or when the previously selected object was present. These effects, however, were mediated by an interaction between these factors such that subjects were faster when the previously selected object was present rather than absent but only after unrewarded trials. Finally, RT reflected the learning strategy (i.e. object-based or feature-based approach) adopted by the subject on a trial-by-trial basis, indicating an overall faster construction of reward value and/or value comparison during object-based learning. Altogether, these results demonstrate that the pattern of RT can be informative about how reward values are learned and constructed during complex value-based learning and decision making.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 Competing Interests: The authors have declared that no competing interests exist.
ISSN:	1932-6203 1932-6203
DOI:	10.1371/journal.pone.0197263