Testing the Predictions of Surprisal Theory in 11 Languages

Surprisal theory posits that less-predictable words should take more time to process, with word predictability quantified as surprisal, i.e., negative log probability in context. While evidence supporting the predictions of surprisal theory has been replicated widely, much of it has focused on a ver...

Full description

Saved in:
Bibliographic Details
Published inTransactions of the Association for Computational Linguistics Vol. 11; pp. 1451 - 1470
Main Authors Wilcox, Ethan G., Pimentel, Tiago, Meister, Clara, Cotterell, Ryan, Levy, Roger P.
Format Journal Article
LanguageEnglish
Published Cambridge MIT Press Journals, The 14.12.2023
The MIT Press
Subjects
Online AccessGet full text
ISSN2307-387X
2307-387X
DOI10.1162/tacl_a_00612

Cover

Loading…
Abstract Surprisal theory posits that less-predictable words should take more time to process, with word predictability quantified as surprisal, i.e., negative log probability in context. While evidence supporting the predictions of surprisal theory has been replicated widely, much of it has focused on a very narrow slice of data: native English speakers reading English texts. Indeed, no comprehensive multilingual analysis exists. We address this gap in the current literature by investigating the relationship between surprisal and reading times in eleven different languages, distributed across five language families. Deriving estimates from language models trained on monolingual and multilingual corpora, we test three predictions associated with surprisal theory: (i) whether surprisal is predictive of reading times, (ii) whether expected surprisal, i.e., contextual entropy, is predictive of reading times, and (iii) whether the linking function between surprisal and reading times is linear. We find that all three predictions are borne out crosslinguistically. By focusing on a more diverse set of languages, we argue that these results offer the most robust link to date between information theory and incremental language processing across languages.
AbstractList Surprisal theory posits that less-predictable words should take more time to process, with word predictability quantified as surprisal, i.e., negative log probability in context. While evidence supporting the predictions of surprisal theory has been replicated widely, much of it has focused on a very narrow slice of data: native English speakers reading English texts. Indeed, no comprehensive multilingual analysis exists. We address this gap in the current literature by investigating the relationship between surprisal and reading times in eleven different languages, distributed across five language families. Deriving estimates from language models trained on monolingual and multilingual corpora, we test three predictions associated with surprisal theory: (i) whether surprisal is predictive of reading times, (ii) whether expected surprisal, i.e., contextual entropy, is predictive of reading times, and (iii) whether the linking function between surprisal and reading times is linear. We find that all three predictions are borne out crosslinguistically. By focusing on a more diverse set of languages, we argue that these results offer the most robust link to date between information theory and incremental language processing across languages.
Author Meister, Clara
Pimentel, Tiago
Levy, Roger P.
Wilcox, Ethan G.
Cotterell, Ryan
Author_xml – sequence: 1
  givenname: Ethan G.
  surname: Wilcox
  fullname: Wilcox, Ethan G.
– sequence: 2
  givenname: Tiago
  surname: Pimentel
  fullname: Pimentel, Tiago
– sequence: 3
  givenname: Clara
  surname: Meister
  fullname: Meister, Clara
– sequence: 4
  givenname: Ryan
  surname: Cotterell
  fullname: Cotterell, Ryan
– sequence: 5
  givenname: Roger P.
  surname: Levy
  fullname: Levy, Roger P.
BookMark eNptkE9LAzEQxYMoWGtvfoAFr1YzSTbJ4knEf1BQsIK3MMlm25R1U5PtwW_vahWKeJpheO_xm3dE9rvYeUJOgJ4DSHbRo2sNGkolsD0yYpyqKdfqdX9nPySTnFeUUtCgqWQjcjn3uQ_douiXvnhKvg6uD7HLRWyK501ap5CxLeZLH9NHEboCoJhht9jgwudjctBgm_3kZ47Jy-3N_Pp-Onu8e7i-mk0dl6qfogYUloPQFr0CUZXCUSWsrGqvqaWAlbbOY61QNpYxaQX1dcltXVFOOfIxedjm1hFXZkB6w_RhIgbzfYhpYTD1wbXeyEpr8MKikkw45FUJ6KUSpWiw9FwPWafbrHWK75vhd7OKm9QN-IZVAAKYVGxQsa3KpZhz8o1xocevYvqEoTVAzVfnZrfzwXT2x_SL-q_8Ewv5hGY
CitedBy_id crossref_primary_10_1016_j_jml_2023_104497
crossref_primary_10_1016_j_cognition_2024_105765
crossref_primary_10_1080_0907676X_2024_2418016
crossref_primary_10_1162_opmi_a_00119
crossref_primary_10_3758_s13428_023_02261_8
crossref_primary_10_1038_s41597_025_04771_w
crossref_primary_10_1162_opmi_a_00150
crossref_primary_10_3758_s13423_024_02588_z
crossref_primary_10_1515_lingvan_2023_0187
crossref_primary_10_3758_s13428_024_02523_z
crossref_primary_10_1073_pnas_2307876121
crossref_primary_10_1162_tacl_a_00714
crossref_primary_10_1146_annurev_linguistics_011724_121517
crossref_primary_10_1016_j_jml_2024_104603
crossref_primary_10_1111_cogs_13501
crossref_primary_10_1111_cogs_13478
crossref_primary_10_1016_j_jml_2024_104534
crossref_primary_10_3758_s13428_024_02561_7
Cites_doi 10.3115/1073336.1073357
10.3115/1699510.1699553
10.18653/v1/2021.acl-long.405
10.3758/s13414-011-0219-2
10.1016/j.cognition.2007.05.006
10.18653/v1/2021.emnlp-main.74
10.3758/s13428-021-01772-6
10.1111/tops.12025
10.18653/v1/2020.acl-main.747
10.18653/v1/N19-4009
10.1098/rsos.211837
10.18653/v1/2022.emnlp-main.712
10.1163/9780585492230
10.1016/j.jml.2020.104174
10.18653/v1/W18-0102
10.1111/cogs.12274
10.1162/tacl_a_00603
10.18653/v1/2021.acl-long.90
10.1016/B978-008044980-7/50017-3
10.31234/osf.io/qjnpv
10.1037/0096-3445.111.2.228
10.1126/sciadv.aaw2594
10.2307/1912791
10.4324/9780203123430
10.3758/s13428-017-0908-4
10.18653/v1/2021.emnlp-main.73
10.3758/BRM.41.1.163
10.31234/osf.io/4hyna
10.1207/s15516709cog0000_64
10.1177/0956797611409589
10.18653/v1/N19-1413
10.1016/j.jml.2019.104082
10.18653/v1/2021.naacl-main.10
10.1016/0010-0285(75)90005-5
10.18653/v1/N18-2085
10.1023/A:1022492123056
10.1016/j.cognition.2013.02.013
10.1037/0033-2909.124.3.372
10.1002/j.1538-7305.1948.tb01338.x
10.18653/v1/P18-1007
10.18653/v1/2021.acl-long.288
10.7551/mitpress/7503.003.0111
10.1353/lan.2011.0057
10.1016/j.cognition.2008.07.008
10.1016/j.jml.2012.11.001
10.18653/v1/P19-1491
10.1162/tacl_a_00548
10.18653/v1/N19-1423
ContentType Journal Article
Copyright 2023. This work is published under https://creativecommons.org/licenses/by/4.0/legalcode (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Copyright_xml – notice: 2023. This work is published under https://creativecommons.org/licenses/by/4.0/legalcode (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
DBID AAYXX
CITATION
7T9
8FE
8FG
ABUWG
AFKRA
ALSLI
ARAPS
AZQEC
BENPR
BGLVJ
CCPQU
CPGLG
CRLPW
DWQXO
GNUQQ
HCIFZ
JQ2
K7-
P5Z
P62
PHGZM
PHGZT
PIMPY
PKEHL
PQEST
PQGLB
PQQKQ
PQUKI
PRQQA
DOA
DOI 10.1162/tacl_a_00612
DatabaseName CrossRef
Linguistics and Language Behavior Abstracts (LLBA)
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Central (Alumni)
ProQuest Central UK/Ireland
Social Science Premium Collection
Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Central
Technology Collection
ProQuest One Community College
Linguistics Collection
Linguistics Database
ProQuest Central Korea
ProQuest Central Student
SciTech Premium Collection
ProQuest Computer Science Collection
Computer Science Database
ProQuest advanced technologies & aerospace journals
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Premium
ProQuest One Academic
Publicly Available Content Database
ProQuest One Academic Middle East (New)
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest One Social Sciences
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
Publicly Available Content Database
Computer Science Database
ProQuest Central Student
Technology Collection
ProQuest One Academic Middle East (New)
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
ProQuest Central
ProQuest One Applied & Life Sciences
Linguistics Collection
ProQuest Central Korea
ProQuest Central (New)
Advanced Technologies & Aerospace Collection
Social Science Premium Collection
ProQuest One Social Sciences
ProQuest One Academic Eastern Edition
Linguistics and Language Behavior Abstracts (LLBA)
ProQuest Technology Collection
ProQuest SciTech Collection
Advanced Technologies & Aerospace Database
ProQuest One Academic UKI Edition
Linguistics Database
ProQuest One Academic
ProQuest One Academic (New)
DatabaseTitleList CrossRef
Publicly Available Content Database
Database_xml – sequence: 1
  dbid: DOA
  name: Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
EISSN 2307-387X
EndPage 1470
ExternalDocumentID oai_doaj_org_article_69881e4ba7624ca3951ae67454fa5e38
10_1162_tacl_a_00612
GroupedDBID AAFWJ
AAYXX
ABUWG
AFKRA
AFPKN
ALMA_UNASSIGNED_HOLDINGS
ALSLI
ARAPS
BENPR
BGLVJ
CCPQU
CITATION
CPGLG
CRLPW
DWQXO
EBS
GROUPED_DOAJ
HCIFZ
JMNJE
K7-
M~E
OJV
OK1
PHGZM
PHGZT
PIMPY
RMI
7T9
8FE
8FG
AZQEC
GNUQQ
JQ2
P62
PKEHL
PQEST
PQGLB
PQQKQ
PQUKI
PRQQA
PUEGO
ID FETCH-LOGICAL-c367t-a81a4b3148bae714954c074b69de80b01a98bcead7a6fb226b40ed53bd90303a3
IEDL.DBID DOA
ISSN 2307-387X
IngestDate Wed Aug 27 01:20:25 EDT 2025
Sat Jul 26 00:01:17 EDT 2025
Tue Jul 01 03:28:36 EDT 2025
Thu Apr 24 23:02:27 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c367t-a81a4b3148bae714954c074b69de80b01a98bcead7a6fb226b40ed53bd90303a3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
OpenAccessLink https://doaj.org/article/69881e4ba7624ca3951ae67454fa5e38
PQID 2911412672
PQPubID 6535866
PageCount 20
ParticipantIDs doaj_primary_oai_doaj_org_article_69881e4ba7624ca3951ae67454fa5e38
proquest_journals_2911412672
crossref_citationtrail_10_1162_tacl_a_00612
crossref_primary_10_1162_tacl_a_00612
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2023-12-14
PublicationDateYYYYMMDD 2023-12-14
PublicationDate_xml – month: 12
  year: 2023
  text: 2023-12-14
  day: 14
PublicationDecade 2020
PublicationPlace Cambridge
PublicationPlace_xml – name: Cambridge
PublicationTitle Transactions of the Association for Computational Linguistics
PublicationYear 2023
Publisher MIT Press Journals, The
The MIT Press
Publisher_xml – name: MIT Press Journals, The
– name: The MIT Press
References Jegerski (2023121518583196000_bib32) 2013
Frank (2023121518583196000_bib17) 2008
de Varda (2023121518583196000_bib11) 2022
Conneau (2023121518583196000_bib8) 2020
Linzen (2023121518583196000_bib41) 2016; 40
Luke (2023121518583196000_bib42) 2018; 50
Boyce (2023121518583196000_bib3) 2020; 111
Hale (2023121518583196000_bib25) 2003; 32
Wilcox (2023121518583196000_bib66) 2020
Rayner (2023121518583196000_bib51) 1975; 7
Cotterell (2023121518583196000_bib9) 2018
Shannon (2023121518583196000_bib59) 1948; 27
Boyce (2023121518583196000_bib4) 2020
Just (2023121518583196000_bib33) 1982; 111
Schotter (2023121518583196000_bib55) 2012; 74
Siegelman (2023121518583196000_bib61) 2022; 54
Frank (2023121518583196000_bib18) 2010
Shain (2023121518583196000_bib58) 2022
Shliazhko (2023121518583196000_bib60) 2022
Haspelmath (2023121518583196000_bib28) 2005
Demberg (2023121518583196000_bib12) 2008; 109
Hillert (2023121518583196000_bib29) 1998
Mielke (2023121518583196000_bib44) 2019
Cevoli (2023121518583196000_bib6) 2022; 9
Coupé (2023121518583196000_bib10) 2019; 5
Hart (2023121518583196000_bib27) 1995
Byung-Doh (2023121518583196000_bib45) 2023; 11
Meister (2023121518583196000_bib43) 2021
Brothers (2023121518583196000_bib5) 2021; 116
Granger (2023121518583196000_bib22) 1969; 37
Kingma (2023121518583196000_bib35) 2015
Kuribayashi (2023121518583196000_bib37) 2022
Pimentel (2023121518583196000_bib49) 2023
Fossum (2023121518583196000_bib16) 2012
Roark (2023121518583196000_bib53) 2009
Shain (2023121518583196000_bib57) 2021
Kennedy (2023121518583196000_bib34) 2003
Ott (2023121518583196000_bib46) 2019
Speer (2023121518583196000_bib63) 2022
Hollenstein (2023121518583196000_bib30) 2021
Raffel (2023121518583196000_bib50) 2020; 21
Clifton (2023121518583196000_bib7) 2007
Guo (2023121518583196000_bib23) 2020
van Schijndel (2023121518583196000_bib64) 2017
Hale (2023121518583196000_bib26) 2006; 30
Rayner (2023121518583196000_bib52) 1998; 124
Levy (2023121518583196000_bib39) 2008; 106
Levy (2023121518583196000_bib40) 2006; 19
Rönnqvist (2023121518583196000_bib54) 2019
Agerri (2023121518583196000_bib1) 2020
Forster (2023121518583196000_bib15) 2009; 41
Kudo (2023121518583196000_bib36) 2018
Pimentel (2023121518583196000_bib48) 2021
Zhang (2023121518583196000_bib67) 2021
Doddapaneni (2023121518583196000_bib14) 2021
Frank (2023121518583196000_bib19) 2013; 5
Goodkind (2023121518583196000_bib21) 2018
Hale (2023121518583196000_bib24) 2001
Kuribayashi (2023121518583196000_bib38) 2021
Smith (2023121518583196000_bib62) 2013; 128
Devlin (2023121518583196000_bib13) 2019
Hoover (2023121518583196000_bib31) 2022
Frank (2023121518583196000_bib20) 2011; 22
Virtanen (2023121518583196000_bib65) 2019
Shain (2023121518583196000_bib56) 2019
Pellegrino (2023121518583196000_bib47) 2011; 87
Barr (2023121518583196000_bib2) 2013; 68
References_xml – volume-title: Second Meeting of the North American Chapter of the Association for Computational Linguistics
  year: 2001
  ident: 2023121518583196000_bib24
  article-title: A probabilistic Earley parser as a psycholinguistic model
  doi: 10.3115/1073336.1073357
– year: 2022
  ident: 2023121518583196000_bib60
  article-title: mGPT: Few-shot learners go multilingual
  publication-title: arXiv preprint arXiv:2204.07580
– volume-title: International Conference on Learning Representations
  year: 2015
  ident: 2023121518583196000_bib35
  article-title: Adam: A method for stochastic optimization
– start-page: 324
  volume-title: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing
  year: 2009
  ident: 2023121518583196000_bib53
  article-title: Deriving lexical and syntactic expectation-based measures for psycholinguistic modeling via incremental top-down parsing
  doi: 10.3115/1699510.1699553
– start-page: 5203
  volume-title: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
  year: 2021
  ident: 2023121518583196000_bib38
  article-title: Lower perplexity is not always human-like
  doi: 10.18653/v1/2021.acl-long.405
– volume: 74
  start-page: 5
  issue: 1
  year: 2012
  ident: 2023121518583196000_bib55
  article-title: Parafoveal processing in reading
  publication-title: Attention, Perception, & Psychophysics
  doi: 10.3758/s13414-011-0219-2
– volume: 106
  start-page: 1126
  issue: 3
  year: 2008
  ident: 2023121518583196000_bib39
  article-title: Expectation-based syntactic comprehension
  publication-title: Cognition
  doi: 10.1016/j.cognition.2007.05.006
– start-page: 963
  volume-title: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
  year: 2021
  ident: 2023121518583196000_bib43
  article-title: Revisiting the Uniform Information Density hypothesis
  doi: 10.18653/v1/2021.emnlp-main.74
– start-page: 29
  volume-title: Proceedings of the First NLPL Workshop on Deep Learning for Natural Language Processing
  year: 2019
  ident: 2023121518583196000_bib54
  article-title: Is multilingual BERT fluent in language generation?
– volume: 54
  start-page: 2843
  issue: 6
  year: 2022
  ident: 2023121518583196000_bib61
  article-title: Expanding horizons of cross-linguistic research on reading: The multilingual eye-movement corpus (MECO)
  publication-title: Behavior Research Methods
  doi: 10.3758/s13428-021-01772-6
– volume: 5
  start-page: 475
  issue: 3
  year: 2013
  ident: 2023121518583196000_bib19
  article-title: Uncertainty reduction as a measure of cognitive load in sentence comprehension
  publication-title: Topics in Cognitive Science
  doi: 10.1111/tops.12025
– start-page: 8440
  volume-title: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
  year: 2020
  ident: 2023121518583196000_bib8
  article-title: Unsupervised cross-lingual representation learning at scale
  doi: 10.18653/v1/2020.acl-main.747
– year: 2021
  ident: 2023121518583196000_bib14
  article-title: A primer on pretrained multilingual language models
  publication-title: arXiv preprint arXiv:2107.00676
– start-page: 48
  volume-title: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (Demonstrations)
  year: 2019
  ident: 2023121518583196000_bib46
  article-title: fairseq: A fast, extensible toolkit for sequence modeling
  doi: 10.18653/v1/N19-4009
– start-page: 2440
  volume-title: Proceedings of the Twelfth Language Resources and Evaluation Conference
  year: 2020
  ident: 2023121518583196000_bib23
  article-title: Wiki-40B: Multilingual language model dataset
– volume: 9
  start-page: 211837
  issue: 6
  year: 2022
  ident: 2023121518583196000_bib6
  article-title: Prediction as a basis for skilled reading: Insights from modern language models
  publication-title: Royal Society Open Science
  doi: 10.1098/rsos.211837
– start-page: 10421
  volume-title: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
  year: 2022
  ident: 2023121518583196000_bib37
  article-title: Context limitations make neural language models more human-like
  doi: 10.18653/v1/2022.emnlp-main.712
– volume-title: Sentence Processing: A Crosslinguistic Perspective
  year: 1998
  ident: 2023121518583196000_bib29
  doi: 10.1163/9780585492230
– volume: 116
  start-page: 104174
  year: 2021
  ident: 2023121518583196000_bib5
  article-title: Word predictability effects are linear, not logarithmic: Implications for probabilistic models of sentence comprehension
  publication-title: Journal of Memory and Language
  doi: 10.1016/j.jml.2020.104174
– volume-title: Proceedings of the Annual Meeting of the Cognitive Science Society
  year: 2008
  ident: 2023121518583196000_bib17
  article-title: Speaking rationally: Uniform information density as an optimal strategy for language production
– start-page: 4781
  volume-title: Proceedings of the Twelfth Language Resources and Evaluation Conference
  year: 2020
  ident: 2023121518583196000_bib1
  article-title: Give your text representation models some love: The case for Basque
– start-page: 10
  volume-title: Proceedings of the 8th Workshop on Cognitive Modeling and Computational Linguistics (CMCL 2018)
  year: 2018
  ident: 2023121518583196000_bib21
  article-title: Predictive power of word surprisal for reading times is a linear function of language model quality
  doi: 10.18653/v1/W18-0102
– volume: 40
  start-page: 1382
  issue: 6
  year: 2016
  ident: 2023121518583196000_bib41
  article-title: Uncertainty and expectation in sentence processing: Evidence from subcategorization distributions
  publication-title: Cognitive Science
  doi: 10.1111/cogs.12274
– year: 2023
  ident: 2023121518583196000_bib49
  article-title: On the effect of anticipation on reading times
  publication-title: Transactions of the Association for Computational Linguistics
  doi: 10.1162/tacl_a_00603
– volume-title: The World Atlas of Language Structures
  year: 2005
  ident: 2023121518583196000_bib28
– start-page: 1112
  volume-title: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
  year: 2021
  ident: 2023121518583196000_bib67
  article-title: When do you need billions of words of pretraining data?
  doi: 10.18653/v1/2021.acl-long.90
– start-page: 341
  year: 2007
  ident: 2023121518583196000_bib7
  article-title: Eye movements in reading words and sentences
  publication-title: Eye Movements
  doi: 10.1016/B978-008044980-7/50017-3
– year: 2022
  ident: 2023121518583196000_bib31
  article-title: The plausibility of sampling as an algorithmic theory of sentence processing
  publication-title: PsyArXiv preprint
  doi: 10.31234/osf.io/qjnpv
– year: 2019
  ident: 2023121518583196000_bib65
  article-title: Multilingual is not enough: BERT for Finnish
  publication-title: arXiv preprint arXiv:1912.07076
– volume: 111
  start-page: 228
  issue: 2
  year: 1982
  ident: 2023121518583196000_bib33
  article-title: Paradigms and processes in reading comprehension
  publication-title: Journal of Experimental Psychology: General
  doi: 10.1037/0096-3445.111.2.228
– volume: 5
  start-page: 1
  issue: 9
  year: 2019
  ident: 2023121518583196000_bib10
  article-title: Different languages, similar encoding efficiency: Comparable information rates across the human communicative niche
  publication-title: Science Advances
  doi: 10.1126/sciadv.aaw2594
– volume-title: Meaningful Differences in the Everyday Experience of Young American Children
  year: 1995
  ident: 2023121518583196000_bib27
– volume: 37
  start-page: 424
  issue: 3
  year: 1969
  ident: 2023121518583196000_bib22
  article-title: Investigating causal relations by econometric models and cross-spectral methods
  publication-title: Econometrica
  doi: 10.2307/1912791
– start-page: 36
  volume-title: Research Methods in Second Language Psycholinguistics
  year: 2013
  ident: 2023121518583196000_bib32
  article-title: Self-paced reading
  doi: 10.4324/9780203123430
– volume: 50
  start-page: 826
  year: 2018
  ident: 2023121518583196000_bib42
  article-title: The Provo corpus: A large eye-tracking corpus with predictability norms
  publication-title: Behavior Research Methods
  doi: 10.3758/s13428-017-0908-4
– start-page: 949
  volume-title: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
  year: 2021
  ident: 2023121518583196000_bib48
  article-title: A surprisal–duration trade-off across and within the world’s languages
  doi: 10.18653/v1/2021.emnlp-main.73
– year: 2022
  ident: 2023121518583196000_bib63
– volume: 41
  start-page: 163
  issue: 1
  year: 2009
  ident: 2023121518583196000_bib15
  article-title: The maze task: Measuring forced incremental sentence processing time
  publication-title: Behavior Research Methods
  doi: 10.3758/BRM.41.1.163
– year: 2022
  ident: 2023121518583196000_bib58
  article-title: Large- scale evidence for logarithmic effects of word predictability on reading time
  publication-title: PsyArXiv preprint
  doi: 10.31234/osf.io/4hyna
– volume: 30
  issue: 4
  year: 2006
  ident: 2023121518583196000_bib26
  article-title: Uncertainty about the rest of the sentence.
  publication-title: Cognitive Science
  doi: 10.1207/s15516709cog0000_64
– volume: 22
  start-page: 829
  issue: 6
  year: 2011
  ident: 2023121518583196000_bib20
  article-title: Insensitivity of the human sentence-processing system to hierarchical structure
  publication-title: Psychological Science
  doi: 10.1177/0956797611409589
– start-page: 4086
  volume-title: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)
  year: 2019
  ident: 2023121518583196000_bib56
  article-title: A large-scale study of the effects of word frequency and predictability in naturalistic reading
  doi: 10.18653/v1/N19-1413
– volume: 111
  start-page: 104082
  year: 2020
  ident: 2023121518583196000_bib3
  article-title: Maze made easy: Better and easier measurement of incremental processing difficulty
  publication-title: Journal of Memory and Language
  doi: 10.1016/j.jml.2019.104082
– start-page: 106
  volume-title: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
  year: 2021
  ident: 2023121518583196000_bib30
  article-title: Multilingual language models predict human reading behavior
  doi: 10.18653/v1/2021.naacl-main.10
– volume: 7
  start-page: 65
  issue: 1
  year: 1975
  ident: 2023121518583196000_bib51
  article-title: The perceptual span and peripheral cues in reading
  publication-title: Cognitive Psychology
  doi: 10.1016/0010-0285(75)90005-5
– start-page: 536
  volume-title: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
  year: 2018
  ident: 2023121518583196000_bib9
  article-title: Are all languages equally hard to language-model?
  doi: 10.18653/v1/N18-2085
– volume: 32
  start-page: 101
  year: 2003
  ident: 2023121518583196000_bib25
  article-title: The information conveyed by words in sentences
  publication-title: Journal of Psycholinguistic Research
  doi: 10.1023/A:1022492123056
– volume: 128
  start-page: 302
  issue: 3
  year: 2013
  ident: 2023121518583196000_bib62
  article-title: The effect of word predictability on reading time is logarithmic
  publication-title: Cognition
  doi: 10.1016/j.cognition.2013.02.013
– start-page: 1260
  volume-title: Proceedings of the Cognitive Science Society
  year: 2017
  ident: 2023121518583196000_bib64
  article-title: Approximations of predictive entropy correlate with reading times
– volume: 124
  start-page: 372
  issue: 3
  year: 1998
  ident: 2023121518583196000_bib52
  article-title: Eye movements in reading and information processing: 20 years of research
  publication-title: Psychological Bulletin
  doi: 10.1037/0033-2909.124.3.372
– start-page: 1707
  volume-title: Proceedings of the 2020 Meeting of the Cognitive Science Society
  year: 2020
  ident: 2023121518583196000_bib66
  article-title: On the predictive power of neural language models for human real-time comprehension behavior
– volume: 21
  start-page: 1
  issue: 1
  year: 2020
  ident: 2023121518583196000_bib50
  article-title: Exploring the limits of transfer learning with a unified text-to-text transformer
  publication-title: Journal of Machine Learning Research
– volume: 27
  start-page: 379
  issue: 3
  year: 1948
  ident: 2023121518583196000_bib59
  article-title: A mathematical theory of communication
  publication-title: The Bell System Technical Journal
  doi: 10.1002/j.1538-7305.1948.tb01338.x
– start-page: 66
  volume-title: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
  year: 2018
  ident: 2023121518583196000_bib36
  article-title: Subword regularization: Improving neural network translation models with multiple subword candidates
  doi: 10.18653/v1/P18-1007
– start-page: 3718
  volume-title: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
  year: 2021
  ident: 2023121518583196000_bib57
  article-title: CDRNN: Discovering complex dynamics in human language processing
  doi: 10.18653/v1/2021.acl-long.288
– start-page: 81
  volume-title: Proceedings of the 2010 Workshop on Cognitive Modeling and Computational Linguistics
  year: 2010
  ident: 2023121518583196000_bib18
  article-title: Uncertainty reduction as a measure of cognitive processing effort
– volume: 19
  year: 2006
  ident: 2023121518583196000_bib40
  article-title: Speakers optimize information density through syntactic reduction
  publication-title: Advances in Neural Information Processing Systems
  doi: 10.7551/mitpress/7503.003.0111
– volume: 87
  start-page: 539
  issue: 3
  year: 2011
  ident: 2023121518583196000_bib47
  article-title: A cross-language perspective on speech information rate
  publication-title: Language
  doi: 10.1353/lan.2011.0057
– start-page: 138
  volume-title: Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022
  year: 2022
  ident: 2023121518583196000_bib11
  article-title: The effects of surprisal across languages: Results from native and non-native reading
– volume: 109
  start-page: 193
  issue: 2
  year: 2008
  ident: 2023121518583196000_bib12
  article-title: Data from eye-tracking corpora as evidence for theories of syntactic processing complexity
  publication-title: Cognition
  doi: 10.1016/j.cognition.2008.07.008
– volume-title: Proceedings of the 12th European Conference on Eye Movements
  year: 2003
  ident: 2023121518583196000_bib34
  article-title: The Dundee corpus
– volume: 68
  start-page: 255
  issue: 3
  year: 2013
  ident: 2023121518583196000_bib2
  article-title: Random effects structure for confirmatory hypothesis testing: Keep it maximal
  publication-title: Journal of Memory and Language
  doi: 10.1016/j.jml.2012.11.001
– start-page: 61
  volume-title: Proceedings of the 3rd Workshop on Cognitive Modeling and Computational Linguistics (CMCL 2012)
  year: 2012
  ident: 2023121518583196000_bib16
  article-title: Sequential vs. hierarchical syntactic models of human incremental sentence processing
– start-page: 4975
  volume-title: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
  year: 2019
  ident: 2023121518583196000_bib44
  article-title: What kind of language is hard to language-model?
  doi: 10.18653/v1/P19-1491
– volume: 11
  start-page: 336
  year: 2023
  ident: 2023121518583196000_bib45
  article-title: Why does surprisal from larger transformer-based language models provide a poorer fit to human reading times?
  publication-title: Transactions of the Association for Computational Linguistics
  doi: 10.1162/tacl_a_00548
– start-page: 4171
  volume-title: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)
  year: 2019
  ident: 2023121518583196000_bib13
  article-title: BERT: Pre-training of deep bidirectional transformers for language understanding
  doi: 10.18653/v1/N19-1423
– volume-title: Talk at 26th Architectures and Mechanisms for Language Processing conference (AMLaP 26)
  year: 2020
  ident: 2023121518583196000_bib4
  article-title: A-maze of natural stories: Texts are comprehensible using the maze task
SSID ssj0001818062
Score 2.4076583
Snippet Surprisal theory posits that less-predictable words should take more time to process, with word predictability quantified as surprisal, i.e., negative log...
SourceID doaj
proquest
crossref
SourceType Open Website
Aggregation Database
Enrichment Source
Index Database
StartPage 1451
SubjectTerms English language
Information theory
Language modeling
Language processing
Multilingualism
Natural language processing
Predictions
Reading
Theoretical linguistics
SummonAdditionalLinks – databaseName: ProQuest Central
  dbid: BENPR
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3PS8MwFA66XbyIouJ0Sg56krKlTdMUD-JkMkTH0A12C0maDmG0c6v_v3lZuimi1zaXvLyfX16-h9CV6YaKcJ0FkTIA3UgS8DilgcrDJLZVUKYlQAMvQzaY0KdpPPWA28q3VdY-0TnqrNSAkXdCa5WUhCwJ7xYfAUyNgttVP0JjFzWtC-a2-Gr2-sPR6xZlgafMbqooNDwDkey07n5nYaeSei6kcGH-R1xy9P2_vLMLOY8HaN_nivh-fbiHaMcUR-h2DLwYxQzbzA2PlnDP4lQHlzl-s1ID0tw5Xj-5x-8FJgQ_e0xydYwmj_3xwyDwExACHbGkCiQnkqrIlixKmgSKGaptzFcszQwHCFOmXGmrDIlkubKZlKJdk8WRylJrvJGMTlCjKAtzijDR0qhEURXnhmYJT6lRKdx5hkBZ1-220E29f6E9PThMqZgLVyawUHyXVgtdb1Yv1rQYf6zrgSg3a4DM2n0olzPhbUOwlHNiqJLWMVOrGzbpk4YlNKa5jE3EW6hdH4TwFrYSW304-__3OdqDEfHQgkJoGzWq5ae5sIlEpS69tnwBM3fGug
  priority: 102
  providerName: ProQuest
Title Testing the Predictions of Surprisal Theory in 11 Languages
URI https://www.proquest.com/docview/2911412672
https://doaj.org/article/69881e4ba7624ca3951ae67454fa5e38
Volume 11
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3dS8MwEA86X3wRRcXpHHnQJylb0jRJ8cnJ5hAdQzfYW0jSVITRyVb_f3NpNyYivvharqS9y-U-cvc7hK5clxoibRbFxkHqRpNIJimLTE5F4qOgzGpIDTyP-HDKHmfJbGvUF9SEVfDAFeM6PJWSOGa011rmX_QegXZcsITlOnFxaPP1Nm8rmArZFWhh5nRd6c5pp9R2rrQKJv2bDQpQ_T9O4mBeBofooPYL8V31PUdoxxXH6HYCGBjFG_ZeGh4v4U4lbBO8yPGr5xAA5M5x1V6P3wtMCH6q84-rEzQd9Cf3w6iedhDZmIsy0pJoZmIfnhjtBAQuzHr7bniaOQnpSp1KY73ghea58V6TYV2XJbHJUq-osY5PUaNYFO4MYWK1M8Iwk-SOZUKmzJkU7jcpwNN1u010s_5_ZWsocJhIMVchJOBUbXOria431B8VBMYvdD1g5YYGgKvDAy9OVYtT_SXOJmqtBaFqbVop6k9kRigX9Pw_1rhA-zA0HopSCGuhRrn8dJfetShNG-3KwUMb7fX6o_FLO-ypL6sSzdc
linkProvider Directory of Open Access Journals
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwzV1Nb9QwEB3140AvFASIhRZ8oCeUdu04jiPUAxSqLd1WFWzF3oztOKuKVbbaTYXav9K_0h_HTDZpoajcKnFNrCiy38y8GY-fAd6ErnBc-zyKXaDSjeWRTjIZuUKkCWZBubdUGjg4VL1j-XmYDBfgsj0LQ22VrU-sHXU-8VQj3xJolZILlYqmg3I_nP_E_Gy2vfcRF3NDiN1Pg51e1FwhEPlYpVVkNbfSxcj5nQ0pZQPSY9B0KsuDphqgzbTzOJupVYVDKuJkN-RJ7PIM0R_bGL-7CMuYVSRo9cs7X_pH325KOHROur6ylLqpSaV22LbWK7FVWT821tQc4o-gV98N8Jfrr-PZ7ipctTMxb2P5sXlWuU1_cUsk8j-dqkfwsOHR7P0c-I9hIZRP4N2ANEPKEUNWy46mtAdVmxWbFOwrIooEhcdsLkfATkrGOes39drZUzi-l_99BkvlpAzPgXFvg0uddEkRZJ7qTAaX0X6wIDm_brcDb9vlM76RTqcbPMamTqGUML8vdgc2rkefziVD7hj3gZBwPYaEvusHk-nINH7DqExrHqSzGLQk2g0SYhtUKhNZ2CTEugNrLQZM431m5gYAL_79-jU86A0O-qa_d7j_ElYEEjhq1eFyDZaq6VlYR8JVuVcN8Bl8v28A_QLjBDgo
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Testing+the+Predictions+of+Surprisal+Theory+in+11+Languages&rft.jtitle=Transactions+of+the+Association+for+Computational+Linguistics&rft.au=Ethan+G.+Wilcox&rft.au=Tiago+Pimentel&rft.au=Clara+Meister&rft.au=Ryan+Cotterell&rft.date=2023-12-14&rft.pub=The+MIT+Press&rft.eissn=2307-387X&rft.volume=11&rft_id=info:doi/10.1162%2Ftacl_a_00612&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_69881e4ba7624ca3951ae67454fa5e38
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2307-387X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2307-387X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2307-387X&client=summon