Using BERT for Word Sense Disambiguation

Word Sense Disambiguation (WSD), which aims to identify the correct sense of a given polyseme, is a long-standing problem in NLP. In this paper, we propose to use BERT to extract better polyseme representations for WSD and explore several ways of combining BERT and the classifier. We also utilize se...

Full description

Saved in:
Bibliographic Details
Main Authors Du, Jiaju, Qi, Fanchao, Sun, Maosong
Format Journal Article
LanguageEnglish
Published 18.09.2019
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Word Sense Disambiguation (WSD), which aims to identify the correct sense of a given polyseme, is a long-standing problem in NLP. In this paper, we propose to use BERT to extract better polyseme representations for WSD and explore several ways of combining BERT and the classifier. We also utilize sense definitions to train a unified classifier for all words, which enables the model to disambiguate unseen polysemes. Experiments show that our model achieves the state-of-the-art results on the standard English All-word WSD evaluation.
DOI:10.48550/arxiv.1909.08358