Decoding the Enigma: Benchmarking Humans and AIs on the Many Facets of Working Memory

Working memory (WM), a fundamental cognitive process facilitating the temporary storage, integration, manipulation, and retrieval of information, plays a vital role in reasoning and decision-making tasks. Robust benchmark datasets that capture the multifaceted nature of WM are crucial for the effect...

Full description

Saved in:

Bibliographic Details
Main Authors	Sikarwar, Ankur, Zhang, Mengmi
Format	Journal Article
Language	English
Published	20.07.2023
Subjects	Computer Science - Artificial Intelligence Computer Science - Computer Vision and Pattern Recognition Computer Science - Learning Quantitative Biology - Neurons and Cognition
Online Access	Get full text

Cover

Loading…

Abstract	Working memory (WM), a fundamental cognitive process facilitating the temporary storage, integration, manipulation, and retrieval of information, plays a vital role in reasoning and decision-making tasks. Robust benchmark datasets that capture the multifaceted nature of WM are crucial for the effective development and evaluation of AI WM models. Here, we introduce a comprehensive Working Memory (WorM) benchmark dataset for this purpose. WorM comprises 10 tasks and a total of 1 million trials, assessing 4 functionalities, 3 domains, and 11 behavioral and neural characteristics of WM. We jointly trained and tested state-of-the-art recurrent neural networks and transformers on all these tasks. We also include human behavioral benchmarks as an upper bound for comparison. Our results suggest that AI models replicate some characteristics of WM in the brain, most notably primacy and recency effects, and neural clusters and correlates specialized for different domains and functionalities of WM. In the experiments, we also reveal some limitations in existing models to approximate human behavior. This dataset serves as a valuable resource for communities in cognitive psychology, neuroscience, and AI, offering a standardized framework to compare and enhance WM models, investigate WM's neural underpinnings, and develop WM models with human-like capabilities. Our source code and data are available at https://github.com/ZhangLab-DeepNeuroCogLab/WorM.
AbstractList	Working memory (WM), a fundamental cognitive process facilitating the temporary storage, integration, manipulation, and retrieval of information, plays a vital role in reasoning and decision-making tasks. Robust benchmark datasets that capture the multifaceted nature of WM are crucial for the effective development and evaluation of AI WM models. Here, we introduce a comprehensive Working Memory (WorM) benchmark dataset for this purpose. WorM comprises 10 tasks and a total of 1 million trials, assessing 4 functionalities, 3 domains, and 11 behavioral and neural characteristics of WM. We jointly trained and tested state-of-the-art recurrent neural networks and transformers on all these tasks. We also include human behavioral benchmarks as an upper bound for comparison. Our results suggest that AI models replicate some characteristics of WM in the brain, most notably primacy and recency effects, and neural clusters and correlates specialized for different domains and functionalities of WM. In the experiments, we also reveal some limitations in existing models to approximate human behavior. This dataset serves as a valuable resource for communities in cognitive psychology, neuroscience, and AI, offering a standardized framework to compare and enhance WM models, investigate WM's neural underpinnings, and develop WM models with human-like capabilities. Our source code and data are available at https://github.com/ZhangLab-DeepNeuroCogLab/WorM.
Author	Zhang, Mengmi Sikarwar, Ankur
Author_xml	– sequence: 1 givenname: Ankur surname: Sikarwar fullname: Sikarwar, Ankur – sequence: 2 givenname: Mengmi surname: Zhang fullname: Zhang, Mengmi
BackLink	https://doi.org/10.48550/arXiv.2307.10768$$DView paper in arXiv
BookMark	eNotj81OwzAQhH2AAxQegBN-gQTHJvGmt1L6J7XiUsQx2sSbNipZo6Qg8vZN055WszMazXcvbtgzCfEUqfAV4li9YPNf_YXaKBtGyiZwJz7fqfCu4p087knOuNrVOJZvxMW-xuZwNpa_NXIrkZ2crFrpeYhukDs5x4KO_auUX_4S3lDtm-5B3Jb43dLj9Y7Edj7bTpfB-mOxmk7WASYWAqtBGQ1ARjtnqYhB99qhpVzlpSKg1MYao8g4hN5NCXNwZZIYUHmKaEbi-VI7cGU_TdVv7rIzXzbwmRO8sUv_
ContentType	Journal Article
Copyright	http://creativecommons.org/licenses/by/4.0
Copyright_xml	– notice: http://creativecommons.org/licenses/by/4.0
DBID	AKY ALC GOX
DOI	10.48550/arxiv.2307.10768
DatabaseName	arXiv Computer Science arXiv Quantitative Biology arXiv.org
DatabaseTitleList
Database_xml	– sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
ExternalDocumentID	2307_10768
GroupedDBID	AKY ALC GOX
ID	FETCH-LOGICAL-a678-72803288e32dd7ec582032da7eb0bf0e8e9752a113da8c589eab8df66380b9aa3
IEDL.DBID	GOX
IngestDate	Mon Jan 08 05:46:42 EST 2024
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-a678-72803288e32dd7ec582032da7eb0bf0e8e9752a113da8c589eab8df66380b9aa3
OpenAccessLink	https://arxiv.org/abs/2307.10768
ParticipantIDs	arxiv_primary_2307_10768
PublicationCentury	2000
PublicationDate	2023-07-20
PublicationDateYYYYMMDD	2023-07-20
PublicationDate_xml	– month: 07 year: 2023 text: 2023-07-20 day: 20
PublicationDecade	2020
PublicationYear	2023
Score	1.8869404
SecondaryResourceType	preprint
Snippet	Working memory (WM), a fundamental cognitive process facilitating the temporary storage, integration, manipulation, and retrieval of information, plays a vital...
SourceID	arxiv
SourceType	Open Access Repository
SubjectTerms	Computer Science - Artificial Intelligence Computer Science - Computer Vision and Pattern Recognition Computer Science - Learning Quantitative Biology - Neurons and Cognition
Title	Decoding the Enigma: Benchmarking Humans and AIs on the Many Facets of Working Memory
URI	https://arxiv.org/abs/2307.10768
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV07T8MwELbaTiwIBKg8dQNrRGInjc1WoKUgFZZW6ladExs6NEVtQfDvuXOCYGE933S2dd-9vhPiMkcubikbKUQfpSkXCX3CF2JkL0tLGxc84Dx-6o2m6eMsm7UE_MzC4Ppz8VHzA9vNFXcpU3xJkLgt2lJyy9b986wuTgYqrkb_V48wZhD9cRLDPbHboDvo19exL1quOhDTOwrx2EUAgS0YVIuXJV7DDT2P1yWGVDWEVPoGKKqH_sMGVlVQHdNHhSEWbksiD01eG8bcHPt1KCbDweR2FDXbDCIkhxCFNVBSa6dkWeauyDTvLi8xdza2PnbamTyTmCSqRE2nxqHVpSdAoGNrENWR6FSrynUFxAYpBPZGFQmmPZVaQgEFeqZiY6pBcyy6wQbzt5qwYs7mmQfznPx_dCp2eJU65y1lfCY62_W7OyeHu7UXwerfffx_HA
link.rule.ids	228,230,783,888
linkProvider	Cornell University
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Decoding+the+Enigma%3A+Benchmarking+Humans+and+AIs+on+the+Many+Facets+of+Working+Memory&rft.au=Sikarwar%2C+Ankur&rft.au=Zhang%2C+Mengmi&rft.date=2023-07-20&rft_id=info:doi/10.48550%2Farxiv.2307.10768&rft.externalDocID=2307_10768