Decoding the Enigma: Benchmarking Humans and AIs on the Many Facets of Working Memory

Working memory (WM), a fundamental cognitive process facilitating the temporary storage, integration, manipulation, and retrieval of information, plays a vital role in reasoning and decision-making tasks. Robust benchmark datasets that capture the multifaceted nature of WM are crucial for the effect...

Full description

Saved in:
Bibliographic Details
Main Authors Sikarwar, Ankur, Zhang, Mengmi
Format Journal Article
LanguageEnglish
Published 20.07.2023
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Working memory (WM), a fundamental cognitive process facilitating the temporary storage, integration, manipulation, and retrieval of information, plays a vital role in reasoning and decision-making tasks. Robust benchmark datasets that capture the multifaceted nature of WM are crucial for the effective development and evaluation of AI WM models. Here, we introduce a comprehensive Working Memory (WorM) benchmark dataset for this purpose. WorM comprises 10 tasks and a total of 1 million trials, assessing 4 functionalities, 3 domains, and 11 behavioral and neural characteristics of WM. We jointly trained and tested state-of-the-art recurrent neural networks and transformers on all these tasks. We also include human behavioral benchmarks as an upper bound for comparison. Our results suggest that AI models replicate some characteristics of WM in the brain, most notably primacy and recency effects, and neural clusters and correlates specialized for different domains and functionalities of WM. In the experiments, we also reveal some limitations in existing models to approximate human behavior. This dataset serves as a valuable resource for communities in cognitive psychology, neuroscience, and AI, offering a standardized framework to compare and enhance WM models, investigate WM's neural underpinnings, and develop WM models with human-like capabilities. Our source code and data are available at https://github.com/ZhangLab-DeepNeuroCogLab/WorM.
AbstractList Working memory (WM), a fundamental cognitive process facilitating the temporary storage, integration, manipulation, and retrieval of information, plays a vital role in reasoning and decision-making tasks. Robust benchmark datasets that capture the multifaceted nature of WM are crucial for the effective development and evaluation of AI WM models. Here, we introduce a comprehensive Working Memory (WorM) benchmark dataset for this purpose. WorM comprises 10 tasks and a total of 1 million trials, assessing 4 functionalities, 3 domains, and 11 behavioral and neural characteristics of WM. We jointly trained and tested state-of-the-art recurrent neural networks and transformers on all these tasks. We also include human behavioral benchmarks as an upper bound for comparison. Our results suggest that AI models replicate some characteristics of WM in the brain, most notably primacy and recency effects, and neural clusters and correlates specialized for different domains and functionalities of WM. In the experiments, we also reveal some limitations in existing models to approximate human behavior. This dataset serves as a valuable resource for communities in cognitive psychology, neuroscience, and AI, offering a standardized framework to compare and enhance WM models, investigate WM's neural underpinnings, and develop WM models with human-like capabilities. Our source code and data are available at https://github.com/ZhangLab-DeepNeuroCogLab/WorM.
Author Zhang, Mengmi
Sikarwar, Ankur
Author_xml – sequence: 1
  givenname: Ankur
  surname: Sikarwar
  fullname: Sikarwar, Ankur
– sequence: 2
  givenname: Mengmi
  surname: Zhang
  fullname: Zhang, Mengmi
BackLink https://doi.org/10.48550/arXiv.2307.10768$$DView paper in arXiv
BookMark eNotj81OwzAQhH2AAxQegBN-gQTHJvGmt1L6J7XiUsQx2sSbNipZo6Qg8vZN055WszMazXcvbtgzCfEUqfAV4li9YPNf_YXaKBtGyiZwJz7fqfCu4p087knOuNrVOJZvxMW-xuZwNpa_NXIrkZ2crFrpeYhukDs5x4KO_auUX_4S3lDtm-5B3Jb43dLj9Y7Edj7bTpfB-mOxmk7WASYWAqtBGQ1ARjtnqYhB99qhpVzlpSKg1MYao8g4hN5NCXNwZZIYUHmKaEbi-VI7cGU_TdVv7rIzXzbwmRO8sUv_
ContentType Journal Article
Copyright http://creativecommons.org/licenses/by/4.0
Copyright_xml – notice: http://creativecommons.org/licenses/by/4.0
DBID AKY
ALC
GOX
DOI 10.48550/arxiv.2307.10768
DatabaseName arXiv Computer Science
arXiv Quantitative Biology
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2307_10768
GroupedDBID AKY
ALC
GOX
ID FETCH-LOGICAL-a678-72803288e32dd7ec582032da7eb0bf0e8e9752a113da8c589eab8df66380b9aa3
IEDL.DBID GOX
IngestDate Mon Jan 08 05:46:42 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a678-72803288e32dd7ec582032da7eb0bf0e8e9752a113da8c589eab8df66380b9aa3
OpenAccessLink https://arxiv.org/abs/2307.10768
ParticipantIDs arxiv_primary_2307_10768
PublicationCentury 2000
PublicationDate 2023-07-20
PublicationDateYYYYMMDD 2023-07-20
PublicationDate_xml – month: 07
  year: 2023
  text: 2023-07-20
  day: 20
PublicationDecade 2020
PublicationYear 2023
Score 1.8869404
SecondaryResourceType preprint
Snippet Working memory (WM), a fundamental cognitive process facilitating the temporary storage, integration, manipulation, and retrieval of information, plays a vital...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Artificial Intelligence
Computer Science - Computer Vision and Pattern Recognition
Computer Science - Learning
Quantitative Biology - Neurons and Cognition
Title Decoding the Enigma: Benchmarking Humans and AIs on the Many Facets of Working Memory
URI https://arxiv.org/abs/2307.10768
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV07T8MwELbaTiwIBKg8dQNrRGInjc1WoKUgFZZW6ladExs6NEVtQfDvuXOCYGE933S2dd-9vhPiMkcubikbKUQfpSkXCX3CF2JkL0tLGxc84Dx-6o2m6eMsm7UE_MzC4Ppz8VHzA9vNFXcpU3xJkLgt2lJyy9b986wuTgYqrkb_V48wZhD9cRLDPbHboDvo19exL1quOhDTOwrx2EUAgS0YVIuXJV7DDT2P1yWGVDWEVPoGKKqH_sMGVlVQHdNHhSEWbksiD01eG8bcHPt1KCbDweR2FDXbDCIkhxCFNVBSa6dkWeauyDTvLi8xdza2PnbamTyTmCSqRE2nxqHVpSdAoGNrENWR6FSrynUFxAYpBPZGFQmmPZVaQgEFeqZiY6pBcyy6wQbzt5qwYs7mmQfznPx_dCp2eJU65y1lfCY62_W7OyeHu7UXwerfffx_HA
link.rule.ids 228,230,783,888
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Decoding+the+Enigma%3A+Benchmarking+Humans+and+AIs+on+the+Many+Facets+of+Working+Memory&rft.au=Sikarwar%2C+Ankur&rft.au=Zhang%2C+Mengmi&rft.date=2023-07-20&rft_id=info:doi/10.48550%2Farxiv.2307.10768&rft.externalDocID=2307_10768