Testing the Predictions of Surprisal Theory in 11 Languages

Surprisal theory posits that less-predictable words should take more time to process, with word predictability quantified as surprisal, i.e., negative log probability in context. While evidence supporting the predictions of surprisal theory has been replicated widely, much of it has focused on a ver...

Full description

Saved in:

Bibliographic Details
Published in	Transactions of the Association for Computational Linguistics Vol. 11; pp. 1451 - 1470
Main Authors	Wilcox, Ethan G., Pimentel, Tiago, Meister, Clara, Cotterell, Ryan, Levy, Roger P.
Format	Journal Article
Language	English
Published	Cambridge MIT Press Journals, The 14.12.2023 The MIT Press
Subjects	English language Information theory Language modeling Language processing Multilingualism Natural language processing Predictions Reading Theoretical linguistics
Online Access	Get full text
ISSN	2307-387X 2307-387X
DOI	10.1162/tacl_a_00612

Cover

Loading…

Abstract	Surprisal theory posits that less-predictable words should take more time to process, with word predictability quantified as surprisal, i.e., negative log probability in context. While evidence supporting the predictions of surprisal theory has been replicated widely, much of it has focused on a very narrow slice of data: native English speakers reading English texts. Indeed, no comprehensive multilingual analysis exists. We address this gap in the current literature by investigating the relationship between surprisal and reading times in eleven different languages, distributed across five language families. Deriving estimates from language models trained on monolingual and multilingual corpora, we test three predictions associated with surprisal theory: (i) whether surprisal is predictive of reading times, (ii) whether expected surprisal, i.e., contextual entropy, is predictive of reading times, and (iii) whether the linking function between surprisal and reading times is linear. We find that all three predictions are borne out crosslinguistically. By focusing on a more diverse set of languages, we argue that these results offer the most robust link to date between information theory and incremental language processing across languages.
AbstractList	Surprisal theory posits that less-predictable words should take more time to process, with word predictability quantified as surprisal, i.e., negative log probability in context. While evidence supporting the predictions of surprisal theory has been replicated widely, much of it has focused on a very narrow slice of data: native English speakers reading English texts. Indeed, no comprehensive multilingual analysis exists. We address this gap in the current literature by investigating the relationship between surprisal and reading times in eleven different languages, distributed across five language families. Deriving estimates from language models trained on monolingual and multilingual corpora, we test three predictions associated with surprisal theory: (i) whether surprisal is predictive of reading times, (ii) whether expected surprisal, i.e., contextual entropy, is predictive of reading times, and (iii) whether the linking function between surprisal and reading times is linear. We find that all three predictions are borne out crosslinguistically. By focusing on a more diverse set of languages, we argue that these results offer the most robust link to date between information theory and incremental language processing across languages.
Author	Meister, Clara Pimentel, Tiago Levy, Roger P. Wilcox, Ethan G. Cotterell, Ryan
Author_xml	– sequence: 1 givenname: Ethan G. surname: Wilcox fullname: Wilcox, Ethan G. – sequence: 2 givenname: Tiago surname: Pimentel fullname: Pimentel, Tiago – sequence: 3 givenname: Clara surname: Meister fullname: Meister, Clara – sequence: 4 givenname: Ryan surname: Cotterell fullname: Cotterell, Ryan – sequence: 5 givenname: Roger P. surname: Levy fullname: Levy, Roger P.
BookMark	eNptkE9LAzEQxYMoWGtvfoAFr1YzSTbJ4knEf1BQsIK3MMlm25R1U5PtwW_vahWKeJpheO_xm3dE9rvYeUJOgJ4DSHbRo2sNGkolsD0yYpyqKdfqdX9nPySTnFeUUtCgqWQjcjn3uQ_douiXvnhKvg6uD7HLRWyK501ap5CxLeZLH9NHEboCoJhht9jgwudjctBgm_3kZ47Jy-3N_Pp-Onu8e7i-mk0dl6qfogYUloPQFr0CUZXCUSWsrGqvqaWAlbbOY61QNpYxaQX1dcltXVFOOfIxedjm1hFXZkB6w_RhIgbzfYhpYTD1wbXeyEpr8MKikkw45FUJ6KUSpWiw9FwPWafbrHWK75vhd7OKm9QN-IZVAAKYVGxQsa3KpZhz8o1xocevYvqEoTVAzVfnZrfzwXT2x_SL-q_8Ewv5hGY
CitedBy_id	crossref_primary_10_1016_j_jml_2023_104497 crossref_primary_10_1016_j_cognition_2024_105765 crossref_primary_10_1080_0907676X_2024_2418016 crossref_primary_10_1162_opmi_a_00119 crossref_primary_10_3758_s13428_023_02261_8 crossref_primary_10_1038_s41597_025_04771_w crossref_primary_10_1162_opmi_a_00150 crossref_primary_10_3758_s13423_024_02588_z crossref_primary_10_1515_lingvan_2023_0187 crossref_primary_10_3758_s13428_024_02523_z crossref_primary_10_1073_pnas_2307876121 crossref_primary_10_1162_tacl_a_00714 crossref_primary_10_1146_annurev_linguistics_011724_121517 crossref_primary_10_1016_j_jml_2024_104603 crossref_primary_10_1111_cogs_13501 crossref_primary_10_1111_cogs_13478 crossref_primary_10_1016_j_jml_2024_104534 crossref_primary_10_3758_s13428_024_02561_7
Cites_doi	10.3115/1073336.1073357 10.3115/1699510.1699553 10.18653/v1/2021.acl-long.405 10.3758/s13414-011-0219-2 10.1016/j.cognition.2007.05.006 10.18653/v1/2021.emnlp-main.74 10.3758/s13428-021-01772-6 10.1111/tops.12025 10.18653/v1/2020.acl-main.747 10.18653/v1/N19-4009 10.1098/rsos.211837 10.18653/v1/2022.emnlp-main.712 10.1163/9780585492230 10.1016/j.jml.2020.104174 10.18653/v1/W18-0102 10.1111/cogs.12274 10.1162/tacl_a_00603 10.18653/v1/2021.acl-long.90 10.1016/B978-008044980-7/50017-3 10.31234/osf.io/qjnpv 10.1037/0096-3445.111.2.228 10.1126/sciadv.aaw2594 10.2307/1912791 10.4324/9780203123430 10.3758/s13428-017-0908-4 10.18653/v1/2021.emnlp-main.73 10.3758/BRM.41.1.163 10.31234/osf.io/4hyna 10.1207/s15516709cog0000_64 10.1177/0956797611409589 10.18653/v1/N19-1413 10.1016/j.jml.2019.104082 10.18653/v1/2021.naacl-main.10 10.1016/0010-0285(75)90005-5 10.18653/v1/N18-2085 10.1023/A:1022492123056 10.1016/j.cognition.2013.02.013 10.1037/0033-2909.124.3.372 10.1002/j.1538-7305.1948.tb01338.x 10.18653/v1/P18-1007 10.18653/v1/2021.acl-long.288 10.7551/mitpress/7503.003.0111 10.1353/lan.2011.0057 10.1016/j.cognition.2008.07.008 10.1016/j.jml.2012.11.001 10.18653/v1/P19-1491 10.1162/tacl_a_00548 10.18653/v1/N19-1423
ContentType	Journal Article
Copyright	2023. This work is published under https://creativecommons.org/licenses/by/4.0/legalcode (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Copyright_xml	– notice: 2023. This work is published under https://creativecommons.org/licenses/by/4.0/legalcode (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
DBID	AAYXX CITATION 7T9 8FE 8FG ABUWG AFKRA ALSLI ARAPS AZQEC BENPR BGLVJ CCPQU CPGLG CRLPW DWQXO GNUQQ HCIFZ JQ2 K7- P5Z P62 PHGZM PHGZT PIMPY PKEHL PQEST PQGLB PQQKQ PQUKI PRQQA DOA
DOI	10.1162/tacl_a_00612
DatabaseName	CrossRef Linguistics and Language Behavior Abstracts (LLBA) ProQuest SciTech Collection ProQuest Technology Collection ProQuest Central (Alumni) ProQuest Central UK/Ireland Social Science Premium Collection Advanced Technologies & Aerospace Collection ProQuest Central Essentials ProQuest Central Technology Collection ProQuest One Community College Linguistics Collection Linguistics Database ProQuest Central Korea ProQuest Central Student SciTech Premium Collection ProQuest Computer Science Collection Computer Science Database ProQuest advanced technologies & aerospace journals ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Premium ProQuest One Academic Publicly Available Content Database ProQuest One Academic Middle East (New) ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic ProQuest One Academic UKI Edition ProQuest One Social Sciences DOAJ Directory of Open Access Journals
DatabaseTitle	CrossRef Publicly Available Content Database Computer Science Database ProQuest Central Student Technology Collection ProQuest One Academic Middle East (New) ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Essentials ProQuest Computer Science Collection ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College ProQuest Central ProQuest One Applied & Life Sciences Linguistics Collection ProQuest Central Korea ProQuest Central (New) Advanced Technologies & Aerospace Collection Social Science Premium Collection ProQuest One Social Sciences ProQuest One Academic Eastern Edition Linguistics and Language Behavior Abstracts (LLBA) ProQuest Technology Collection ProQuest SciTech Collection Advanced Technologies & Aerospace Database ProQuest One Academic UKI Edition Linguistics Database ProQuest One Academic ProQuest One Academic (New)
DatabaseTitleList	CrossRef Publicly Available Content Database
Database_xml	– sequence: 1 dbid: DOA name: Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database
DeliveryMethod	fulltext_linktorsrc
EISSN	2307-387X
EndPage	1470
ExternalDocumentID	oai_doaj_org_article_69881e4ba7624ca3951ae67454fa5e38 10_1162_tacl_a_00612
GroupedDBID	AAFWJ AAYXX ABUWG AFKRA AFPKN ALMA_UNASSIGNED_HOLDINGS ALSLI ARAPS BENPR BGLVJ CCPQU CITATION CPGLG CRLPW DWQXO EBS GROUPED_DOAJ HCIFZ JMNJE K7- M~E OJV OK1 PHGZM PHGZT PIMPY RMI 7T9 8FE 8FG AZQEC GNUQQ JQ2 P62 PKEHL PQEST PQGLB PQQKQ PQUKI PRQQA PUEGO
ID	FETCH-LOGICAL-c367t-a81a4b3148bae714954c074b69de80b01a98bcead7a6fb226b40ed53bd90303a3
IEDL.DBID	DOA
ISSN	2307-387X
IngestDate	Wed Aug 27 01:20:25 EDT 2025 Sat Jul 26 00:01:17 EDT 2025 Tue Jul 01 03:28:36 EDT 2025 Thu Apr 24 23:02:27 EDT 2025
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	true
IsScholarly	true
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c367t-a81a4b3148bae714954c074b69de80b01a98bcead7a6fb226b40ed53bd90303a3
Notes	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
OpenAccessLink	https://doaj.org/article/69881e4ba7624ca3951ae67454fa5e38
PQID	2911412672
PQPubID	6535866
PageCount	20
ParticipantIDs	doaj_primary_oai_doaj_org_article_69881e4ba7624ca3951ae67454fa5e38 proquest_journals_2911412672 crossref_citationtrail_10_1162_tacl_a_00612 crossref_primary_10_1162_tacl_a_00612
ProviderPackageCode	CITATION AAYXX
PublicationCentury	2000
PublicationDate	2023-12-14
PublicationDateYYYYMMDD	2023-12-14
PublicationDate_xml	– month: 12 year: 2023 text: 2023-12-14 day: 14
PublicationDecade	2020
PublicationPlace	Cambridge
PublicationPlace_xml	– name: Cambridge
PublicationTitle	Transactions of the Association for Computational Linguistics
PublicationYear	2023
Publisher	MIT Press Journals, The The MIT Press
Publisher_xml	– name: MIT Press Journals, The – name: The MIT Press
References	Jegerski (2023121518583196000_bib32) 2013 Frank (2023121518583196000_bib17) 2008 de Varda (2023121518583196000_bib11) 2022 Conneau (2023121518583196000_bib8) 2020 Linzen (2023121518583196000_bib41) 2016; 40 Luke (2023121518583196000_bib42) 2018; 50 Boyce (2023121518583196000_bib3) 2020; 111 Hale (2023121518583196000_bib25) 2003; 32 Wilcox (2023121518583196000_bib66) 2020 Rayner (2023121518583196000_bib51) 1975; 7 Cotterell (2023121518583196000_bib9) 2018 Shannon (2023121518583196000_bib59) 1948; 27 Boyce (2023121518583196000_bib4) 2020 Just (2023121518583196000_bib33) 1982; 111 Schotter (2023121518583196000_bib55) 2012; 74 Siegelman (2023121518583196000_bib61) 2022; 54 Frank (2023121518583196000_bib18) 2010 Shain (2023121518583196000_bib58) 2022 Shliazhko (2023121518583196000_bib60) 2022 Haspelmath (2023121518583196000_bib28) 2005 Demberg (2023121518583196000_bib12) 2008; 109 Hillert (2023121518583196000_bib29) 1998 Mielke (2023121518583196000_bib44) 2019 Cevoli (2023121518583196000_bib6) 2022; 9 Coupé (2023121518583196000_bib10) 2019; 5 Hart (2023121518583196000_bib27) 1995 Byung-Doh (2023121518583196000_bib45) 2023; 11 Meister (2023121518583196000_bib43) 2021 Brothers (2023121518583196000_bib5) 2021; 116 Granger (2023121518583196000_bib22) 1969; 37 Kingma (2023121518583196000_bib35) 2015 Kuribayashi (2023121518583196000_bib37) 2022 Pimentel (2023121518583196000_bib49) 2023 Fossum (2023121518583196000_bib16) 2012 Roark (2023121518583196000_bib53) 2009 Shain (2023121518583196000_bib57) 2021 Kennedy (2023121518583196000_bib34) 2003 Ott (2023121518583196000_bib46) 2019 Speer (2023121518583196000_bib63) 2022 Hollenstein (2023121518583196000_bib30) 2021 Raffel (2023121518583196000_bib50) 2020; 21 Clifton (2023121518583196000_bib7) 2007 Guo (2023121518583196000_bib23) 2020 van Schijndel (2023121518583196000_bib64) 2017 Hale (2023121518583196000_bib26) 2006; 30 Rayner (2023121518583196000_bib52) 1998; 124 Levy (2023121518583196000_bib39) 2008; 106 Levy (2023121518583196000_bib40) 2006; 19 Rönnqvist (2023121518583196000_bib54) 2019 Agerri (2023121518583196000_bib1) 2020 Forster (2023121518583196000_bib15) 2009; 41 Kudo (2023121518583196000_bib36) 2018 Pimentel (2023121518583196000_bib48) 2021 Zhang (2023121518583196000_bib67) 2021 Doddapaneni (2023121518583196000_bib14) 2021 Frank (2023121518583196000_bib19) 2013; 5 Goodkind (2023121518583196000_bib21) 2018 Hale (2023121518583196000_bib24) 2001 Kuribayashi (2023121518583196000_bib38) 2021 Smith (2023121518583196000_bib62) 2013; 128 Devlin (2023121518583196000_bib13) 2019 Hoover (2023121518583196000_bib31) 2022 Frank (2023121518583196000_bib20) 2011; 22 Virtanen (2023121518583196000_bib65) 2019 Shain (2023121518583196000_bib56) 2019 Pellegrino (2023121518583196000_bib47) 2011; 87 Barr (2023121518583196000_bib2) 2013; 68
References_xml	– volume-title: Second Meeting of the North American Chapter of the Association for Computational Linguistics year: 2001 ident: 2023121518583196000_bib24 article-title: A probabilistic Earley parser as a psycholinguistic model doi: 10.3115/1073336.1073357 – year: 2022 ident: 2023121518583196000_bib60 article-title: mGPT: Few-shot learners go multilingual publication-title: arXiv preprint arXiv:2204.07580 – volume-title: International Conference on Learning Representations year: 2015 ident: 2023121518583196000_bib35 article-title: Adam: A method for stochastic optimization – start-page: 324 volume-title: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing year: 2009 ident: 2023121518583196000_bib53 article-title: Deriving lexical and syntactic expectation-based measures for psycholinguistic modeling via incremental top-down parsing doi: 10.3115/1699510.1699553 – start-page: 5203 volume-title: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) year: 2021 ident: 2023121518583196000_bib38 article-title: Lower perplexity is not always human-like doi: 10.18653/v1/2021.acl-long.405 – volume: 74 start-page: 5 issue: 1 year: 2012 ident: 2023121518583196000_bib55 article-title: Parafoveal processing in reading publication-title: Attention, Perception, & Psychophysics doi: 10.3758/s13414-011-0219-2 – volume: 106 start-page: 1126 issue: 3 year: 2008 ident: 2023121518583196000_bib39 article-title: Expectation-based syntactic comprehension publication-title: Cognition doi: 10.1016/j.cognition.2007.05.006 – start-page: 963 volume-title: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing year: 2021 ident: 2023121518583196000_bib43 article-title: Revisiting the Uniform Information Density hypothesis doi: 10.18653/v1/2021.emnlp-main.74 – start-page: 29 volume-title: Proceedings of the First NLPL Workshop on Deep Learning for Natural Language Processing year: 2019 ident: 2023121518583196000_bib54 article-title: Is multilingual BERT fluent in language generation? – volume: 54 start-page: 2843 issue: 6 year: 2022 ident: 2023121518583196000_bib61 article-title: Expanding horizons of cross-linguistic research on reading: The multilingual eye-movement corpus (MECO) publication-title: Behavior Research Methods doi: 10.3758/s13428-021-01772-6 – volume: 5 start-page: 475 issue: 3 year: 2013 ident: 2023121518583196000_bib19 article-title: Uncertainty reduction as a measure of cognitive load in sentence comprehension publication-title: Topics in Cognitive Science doi: 10.1111/tops.12025 – start-page: 8440 volume-title: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics year: 2020 ident: 2023121518583196000_bib8 article-title: Unsupervised cross-lingual representation learning at scale doi: 10.18653/v1/2020.acl-main.747 – year: 2021 ident: 2023121518583196000_bib14 article-title: A primer on pretrained multilingual language models publication-title: arXiv preprint arXiv:2107.00676 – start-page: 48 volume-title: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (Demonstrations) year: 2019 ident: 2023121518583196000_bib46 article-title: fairseq: A fast, extensible toolkit for sequence modeling doi: 10.18653/v1/N19-4009 – start-page: 2440 volume-title: Proceedings of the Twelfth Language Resources and Evaluation Conference year: 2020 ident: 2023121518583196000_bib23 article-title: Wiki-40B: Multilingual language model dataset – volume: 9 start-page: 211837 issue: 6 year: 2022 ident: 2023121518583196000_bib6 article-title: Prediction as a basis for skilled reading: Insights from modern language models publication-title: Royal Society Open Science doi: 10.1098/rsos.211837 – start-page: 10421 volume-title: Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing year: 2022 ident: 2023121518583196000_bib37 article-title: Context limitations make neural language models more human-like doi: 10.18653/v1/2022.emnlp-main.712 – volume-title: Sentence Processing: A Crosslinguistic Perspective year: 1998 ident: 2023121518583196000_bib29 doi: 10.1163/9780585492230 – volume: 116 start-page: 104174 year: 2021 ident: 2023121518583196000_bib5 article-title: Word predictability effects are linear, not logarithmic: Implications for probabilistic models of sentence comprehension publication-title: Journal of Memory and Language doi: 10.1016/j.jml.2020.104174 – volume-title: Proceedings of the Annual Meeting of the Cognitive Science Society year: 2008 ident: 2023121518583196000_bib17 article-title: Speaking rationally: Uniform information density as an optimal strategy for language production – start-page: 4781 volume-title: Proceedings of the Twelfth Language Resources and Evaluation Conference year: 2020 ident: 2023121518583196000_bib1 article-title: Give your text representation models some love: The case for Basque – start-page: 10 volume-title: Proceedings of the 8th Workshop on Cognitive Modeling and Computational Linguistics (CMCL 2018) year: 2018 ident: 2023121518583196000_bib21 article-title: Predictive power of word surprisal for reading times is a linear function of language model quality doi: 10.18653/v1/W18-0102 – volume: 40 start-page: 1382 issue: 6 year: 2016 ident: 2023121518583196000_bib41 article-title: Uncertainty and expectation in sentence processing: Evidence from subcategorization distributions publication-title: Cognitive Science doi: 10.1111/cogs.12274 – year: 2023 ident: 2023121518583196000_bib49 article-title: On the effect of anticipation on reading times publication-title: Transactions of the Association for Computational Linguistics doi: 10.1162/tacl_a_00603 – volume-title: The World Atlas of Language Structures year: 2005 ident: 2023121518583196000_bib28 – start-page: 1112 volume-title: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) year: 2021 ident: 2023121518583196000_bib67 article-title: When do you need billions of words of pretraining data? doi: 10.18653/v1/2021.acl-long.90 – start-page: 341 year: 2007 ident: 2023121518583196000_bib7 article-title: Eye movements in reading words and sentences publication-title: Eye Movements doi: 10.1016/B978-008044980-7/50017-3 – year: 2022 ident: 2023121518583196000_bib31 article-title: The plausibility of sampling as an algorithmic theory of sentence processing publication-title: PsyArXiv preprint doi: 10.31234/osf.io/qjnpv – year: 2019 ident: 2023121518583196000_bib65 article-title: Multilingual is not enough: BERT for Finnish publication-title: arXiv preprint arXiv:1912.07076 – volume: 111 start-page: 228 issue: 2 year: 1982 ident: 2023121518583196000_bib33 article-title: Paradigms and processes in reading comprehension publication-title: Journal of Experimental Psychology: General doi: 10.1037/0096-3445.111.2.228 – volume: 5 start-page: 1 issue: 9 year: 2019 ident: 2023121518583196000_bib10 article-title: Different languages, similar encoding efficiency: Comparable information rates across the human communicative niche publication-title: Science Advances doi: 10.1126/sciadv.aaw2594 – volume-title: Meaningful Differences in the Everyday Experience of Young American Children year: 1995 ident: 2023121518583196000_bib27 – volume: 37 start-page: 424 issue: 3 year: 1969 ident: 2023121518583196000_bib22 article-title: Investigating causal relations by econometric models and cross-spectral methods publication-title: Econometrica doi: 10.2307/1912791 – start-page: 36 volume-title: Research Methods in Second Language Psycholinguistics year: 2013 ident: 2023121518583196000_bib32 article-title: Self-paced reading doi: 10.4324/9780203123430 – volume: 50 start-page: 826 year: 2018 ident: 2023121518583196000_bib42 article-title: The Provo corpus: A large eye-tracking corpus with predictability norms publication-title: Behavior Research Methods doi: 10.3758/s13428-017-0908-4 – start-page: 949 volume-title: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing year: 2021 ident: 2023121518583196000_bib48 article-title: A surprisal–duration trade-off across and within the world’s languages doi: 10.18653/v1/2021.emnlp-main.73 – year: 2022 ident: 2023121518583196000_bib63 – volume: 41 start-page: 163 issue: 1 year: 2009 ident: 2023121518583196000_bib15 article-title: The maze task: Measuring forced incremental sentence processing time publication-title: Behavior Research Methods doi: 10.3758/BRM.41.1.163 – year: 2022 ident: 2023121518583196000_bib58 article-title: Large- scale evidence for logarithmic effects of word predictability on reading time publication-title: PsyArXiv preprint doi: 10.31234/osf.io/4hyna – volume: 30 issue: 4 year: 2006 ident: 2023121518583196000_bib26 article-title: Uncertainty about the rest of the sentence. publication-title: Cognitive Science doi: 10.1207/s15516709cog0000_64 – volume: 22 start-page: 829 issue: 6 year: 2011 ident: 2023121518583196000_bib20 article-title: Insensitivity of the human sentence-processing system to hierarchical structure publication-title: Psychological Science doi: 10.1177/0956797611409589 – start-page: 4086 volume-title: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) year: 2019 ident: 2023121518583196000_bib56 article-title: A large-scale study of the effects of word frequency and predictability in naturalistic reading doi: 10.18653/v1/N19-1413 – volume: 111 start-page: 104082 year: 2020 ident: 2023121518583196000_bib3 article-title: Maze made easy: Better and easier measurement of incremental processing difficulty publication-title: Journal of Memory and Language doi: 10.1016/j.jml.2019.104082 – start-page: 106 volume-title: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies year: 2021 ident: 2023121518583196000_bib30 article-title: Multilingual language models predict human reading behavior doi: 10.18653/v1/2021.naacl-main.10 – volume: 7 start-page: 65 issue: 1 year: 1975 ident: 2023121518583196000_bib51 article-title: The perceptual span and peripheral cues in reading publication-title: Cognitive Psychology doi: 10.1016/0010-0285(75)90005-5 – start-page: 536 volume-title: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers) year: 2018 ident: 2023121518583196000_bib9 article-title: Are all languages equally hard to language-model? doi: 10.18653/v1/N18-2085 – volume: 32 start-page: 101 year: 2003 ident: 2023121518583196000_bib25 article-title: The information conveyed by words in sentences publication-title: Journal of Psycholinguistic Research doi: 10.1023/A:1022492123056 – volume: 128 start-page: 302 issue: 3 year: 2013 ident: 2023121518583196000_bib62 article-title: The effect of word predictability on reading time is logarithmic publication-title: Cognition doi: 10.1016/j.cognition.2013.02.013 – start-page: 1260 volume-title: Proceedings of the Cognitive Science Society year: 2017 ident: 2023121518583196000_bib64 article-title: Approximations of predictive entropy correlate with reading times – volume: 124 start-page: 372 issue: 3 year: 1998 ident: 2023121518583196000_bib52 article-title: Eye movements in reading and information processing: 20 years of research publication-title: Psychological Bulletin doi: 10.1037/0033-2909.124.3.372 – start-page: 1707 volume-title: Proceedings of the 2020 Meeting of the Cognitive Science Society year: 2020 ident: 2023121518583196000_bib66 article-title: On the predictive power of neural language models for human real-time comprehension behavior – volume: 21 start-page: 1 issue: 1 year: 2020 ident: 2023121518583196000_bib50 article-title: Exploring the limits of transfer learning with a unified text-to-text transformer publication-title: Journal of Machine Learning Research – volume: 27 start-page: 379 issue: 3 year: 1948 ident: 2023121518583196000_bib59 article-title: A mathematical theory of communication publication-title: The Bell System Technical Journal doi: 10.1002/j.1538-7305.1948.tb01338.x – start-page: 66 volume-title: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) year: 2018 ident: 2023121518583196000_bib36 article-title: Subword regularization: Improving neural network translation models with multiple subword candidates doi: 10.18653/v1/P18-1007 – start-page: 3718 volume-title: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) year: 2021 ident: 2023121518583196000_bib57 article-title: CDRNN: Discovering complex dynamics in human language processing doi: 10.18653/v1/2021.acl-long.288 – start-page: 81 volume-title: Proceedings of the 2010 Workshop on Cognitive Modeling and Computational Linguistics year: 2010 ident: 2023121518583196000_bib18 article-title: Uncertainty reduction as a measure of cognitive processing effort – volume: 19 year: 2006 ident: 2023121518583196000_bib40 article-title: Speakers optimize information density through syntactic reduction publication-title: Advances in Neural Information Processing Systems doi: 10.7551/mitpress/7503.003.0111 – volume: 87 start-page: 539 issue: 3 year: 2011 ident: 2023121518583196000_bib47 article-title: A cross-language perspective on speech information rate publication-title: Language doi: 10.1353/lan.2011.0057 – start-page: 138 volume-title: Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022 year: 2022 ident: 2023121518583196000_bib11 article-title: The effects of surprisal across languages: Results from native and non-native reading – volume: 109 start-page: 193 issue: 2 year: 2008 ident: 2023121518583196000_bib12 article-title: Data from eye-tracking corpora as evidence for theories of syntactic processing complexity publication-title: Cognition doi: 10.1016/j.cognition.2008.07.008 – volume-title: Proceedings of the 12th European Conference on Eye Movements year: 2003 ident: 2023121518583196000_bib34 article-title: The Dundee corpus – volume: 68 start-page: 255 issue: 3 year: 2013 ident: 2023121518583196000_bib2 article-title: Random effects structure for confirmatory hypothesis testing: Keep it maximal publication-title: Journal of Memory and Language doi: 10.1016/j.jml.2012.11.001 – start-page: 61 volume-title: Proceedings of the 3rd Workshop on Cognitive Modeling and Computational Linguistics (CMCL 2012) year: 2012 ident: 2023121518583196000_bib16 article-title: Sequential vs. hierarchical syntactic models of human incremental sentence processing – start-page: 4975 volume-title: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics year: 2019 ident: 2023121518583196000_bib44 article-title: What kind of language is hard to language-model? doi: 10.18653/v1/P19-1491 – volume: 11 start-page: 336 year: 2023 ident: 2023121518583196000_bib45 article-title: Why does surprisal from larger transformer-based language models provide a poorer fit to human reading times? publication-title: Transactions of the Association for Computational Linguistics doi: 10.1162/tacl_a_00548 – start-page: 4171 volume-title: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) year: 2019 ident: 2023121518583196000_bib13 article-title: BERT: Pre-training of deep bidirectional transformers for language understanding doi: 10.18653/v1/N19-1423 – volume-title: Talk at 26th Architectures and Mechanisms for Language Processing conference (AMLaP 26) year: 2020 ident: 2023121518583196000_bib4 article-title: A-maze of natural stories: Texts are comprehensible using the maze task
SSID	ssj0001818062
Score	2.4076583
Snippet	Surprisal theory posits that less-predictable words should take more time to process, with word predictability quantified as surprisal, i.e., negative log...
SourceID	doaj proquest crossref
SourceType	Open Website Aggregation Database Enrichment Source Index Database
StartPage	1451
SubjectTerms	English language Information theory Language modeling Language processing Multilingualism Natural language processing Predictions Reading Theoretical linguistics
SummonAdditionalLinks	– databaseName: ProQuest Central dbid: BENPR link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3PS8MwFA66XbyIouJ0Sg56krKlTdMUD-JkMkTH0A12C0maDmG0c6v_v3lZuimi1zaXvLyfX16-h9CV6YaKcJ0FkTIA3UgS8DilgcrDJLZVUKYlQAMvQzaY0KdpPPWA28q3VdY-0TnqrNSAkXdCa5WUhCwJ7xYfAUyNgttVP0JjFzWtC-a2-Gr2-sPR6xZlgafMbqooNDwDkey07n5nYaeSei6kcGH-R1xy9P2_vLMLOY8HaN_nivh-fbiHaMcUR-h2DLwYxQzbzA2PlnDP4lQHlzl-s1ID0tw5Xj-5x-8FJgQ_e0xydYwmj_3xwyDwExACHbGkCiQnkqrIlixKmgSKGaptzFcszQwHCFOmXGmrDIlkubKZlKJdk8WRylJrvJGMTlCjKAtzijDR0qhEURXnhmYJT6lRKdx5hkBZ1-220E29f6E9PThMqZgLVyawUHyXVgtdb1Yv1rQYf6zrgSg3a4DM2n0olzPhbUOwlHNiqJLWMVOrGzbpk4YlNKa5jE3EW6hdH4TwFrYSW304-__3OdqDEfHQgkJoGzWq5ae5sIlEpS69tnwBM3fGug priority: 102 providerName: ProQuest
Title	Testing the Predictions of Surprisal Theory in 11 Languages
URI	https://www.proquest.com/docview/2911412672 https://doaj.org/article/69881e4ba7624ca3951ae67454fa5e38
Volume	11
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3dS8MwEA86X3wRRcXpHHnQJylb0jRJ8cnJ5hAdQzfYW0jSVITRyVb_f3NpNyYivvharqS9y-U-cvc7hK5clxoibRbFxkHqRpNIJimLTE5F4qOgzGpIDTyP-HDKHmfJbGvUF9SEVfDAFeM6PJWSOGa011rmX_QegXZcsITlOnFxaPP1Nm8rmArZFWhh5nRd6c5pp9R2rrQKJv2bDQpQ_T9O4mBeBofooPYL8V31PUdoxxXH6HYCGBjFG_ZeGh4v4U4lbBO8yPGr5xAA5M5x1V6P3wtMCH6q84-rEzQd9Cf3w6iedhDZmIsy0pJoZmIfnhjtBAQuzHr7bniaOQnpSp1KY73ghea58V6TYV2XJbHJUq-osY5PUaNYFO4MYWK1M8Iwk-SOZUKmzJkU7jcpwNN1u010s_5_ZWsocJhIMVchJOBUbXOria431B8VBMYvdD1g5YYGgKvDAy9OVYtT_SXOJmqtBaFqbVop6k9kRigX9Pw_1rhA-zA0HopSCGuhRrn8dJfetShNG-3KwUMb7fX6o_FLO-ypL6sSzdc
linkProvider	Directory of Open Access Journals
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwzV1Nb9QwEB3140AvFASIhRZ8oCeUdu04jiPUAxSqLd1WFWzF3oztOKuKVbbaTYXav9K_0h_HTDZpoajcKnFNrCiy38y8GY-fAd6ErnBc-zyKXaDSjeWRTjIZuUKkCWZBubdUGjg4VL1j-XmYDBfgsj0LQ22VrU-sHXU-8VQj3xJolZILlYqmg3I_nP_E_Gy2vfcRF3NDiN1Pg51e1FwhEPlYpVVkNbfSxcj5nQ0pZQPSY9B0KsuDphqgzbTzOJupVYVDKuJkN-RJ7PIM0R_bGL-7CMuYVSRo9cs7X_pH325KOHROur6ylLqpSaV22LbWK7FVWT821tQc4o-gV98N8Jfrr-PZ7ipctTMxb2P5sXlWuU1_cUsk8j-dqkfwsOHR7P0c-I9hIZRP4N2ANEPKEUNWy46mtAdVmxWbFOwrIooEhcdsLkfATkrGOes39drZUzi-l_99BkvlpAzPgXFvg0uddEkRZJ7qTAaX0X6wIDm_brcDb9vlM76RTqcbPMamTqGUML8vdgc2rkefziVD7hj3gZBwPYaEvusHk-nINH7DqExrHqSzGLQk2g0SYhtUKhNZ2CTEugNrLQZM431m5gYAL_79-jU86A0O-qa_d7j_ElYEEjhq1eFyDZaq6VlYR8JVuVcN8Bl8v28A_QLjBDgo
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Testing+the+Predictions+of+Surprisal+Theory+in+11+Languages&rft.jtitle=Transactions+of+the+Association+for+Computational+Linguistics&rft.au=Ethan+G.+Wilcox&rft.au=Tiago+Pimentel&rft.au=Clara+Meister&rft.au=Ryan+Cotterell&rft.date=2023-12-14&rft.pub=The+MIT+Press&rft.eissn=2307-387X&rft.volume=11&rft_id=info:doi/10.1162%2Ftacl_a_00612&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_69881e4ba7624ca3951ae67454fa5e38
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2307-387X&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2307-387X&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2307-387X&client=summon