Infusing Finetuning with Semantic Dependencies

For natural language processing systems, two kinds of evidence support the use of text representations from neural language models “pretrained” on large unannotated corpora: performance on application-inspired benchmarks (Peters et al., , ), and the emergence of syntactic abstractions in those repre...

Full description

Saved in:

Bibliographic Details
Published in	Transactions of the Association for Computational Linguistics Vol. 9; pp. 226 - 242
Main Authors	Wu, Zhaofeng, Peng, Hao, Smith, Noah A.
Format	Journal Article
Language	English
Published	One Rogers Street, Cambridge, MA 02142-1209, USA MIT Press 01.01.2021 MIT Press Journals, The The MIT Press
Subjects	Argument structure Benchmarks Language modeling Natural language Natural language processing Representations Semantics Syntax
Online Access	Get full text

Cover

Loading…

More Information
Summary:	For natural language processing systems, two kinds of evidence support the use of text representations from neural language models “pretrained” on large unannotated corpora: performance on application-inspired benchmarks (Peters et al., , ), and the emergence of syntactic abstractions in those representations (Tenney et al., , ). On the other hand, the lack of grounded supervision calls into question how well these representations can ever capture meaning (Bender and Koller, ). We apply novel probes to recent language models— specifically focusing on predicate-argument structure as operationalized by semantic dependencies (Ivanova et al., )—and find that, unlike syntax, semantics is not brought to the surface by today’s pretrained models. We then use convolutional graph encoders to explicitly incorporate semantic parses into task-specific finetuning, yielding benefits to natural language understanding (NLU) tasks in the GLUE benchmark. This approach demonstrates the potential for general-purpose (rather than task-specific) linguistic supervision, above and beyond conventional pretraining and finetuning. Several diagnostics help to localize the benefits of our approach.
Bibliography:	2021 ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2307-387X 2307-387X
DOI:	10.1162/tacl_a_00363