Assessing the Capacity of Transformer to Abstract Syntactic Representations: A Contrastive Analysis Based on Long-distance Agreement

Many studies have shown that transformers are able to predict subject-verb agreement, demonstrating their ability to uncover an abstract representation of the sentence in an unsupervised way. Recently, Li et al. ( ) found that transformers were also able to predict the object-past participle agreeme...

Full description

Saved in:

Bibliographic Details
Published in	Transactions of the Association for Computational Linguistics Vol. 11; pp. 18 - 33
Main Authors	Li, Bingzhi, Wisniewski, Guillaume, Crabbé, Benoît
Format	Journal Article
Language	English
Published	One Broadway, 12th Floor, Cambridge, Massachusetts 02142, USA MIT Press 12.01.2023 MIT Press Journals, The The MIT Press
Subjects	Anaphora Artificial Intelligence Computational linguistics Computer Science Contrastive analysis Document and Text Processing French language Grammatical agreement Grammatical subject Linguistics Modelling Neural networks Representations Syntactic analysis Syntactic movement Transformers Verbs
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Many studies have shown that transformers are able to predict subject-verb agreement, demonstrating their ability to uncover an abstract representation of the sentence in an unsupervised way. Recently, Li et al. ( ) found that transformers were also able to predict the object-past participle agreement in French, the modeling of which in formal grammar is fundamentally different from that of subject-verb agreement and relies on a movement and an anaphora resolution. To better understand transformers’ internal working, we propose to contrast how they handle these two kinds of agreement. Using probing and counterfactual analysis methods, our experiments on French agreements show that the agreement task suffers from several confounders that partially question the conclusions drawn so far and transformers handle subject-verb and object-past participle agreements in a way that is consistent with their modeling in theoretical linguistics.
Bibliography:	2023 ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2307-387X 2307-387X
DOI:	10.1162/tacl_a_00531