Knowledge-enhanced prototypical network with class cluster loss for few-shot relation classification

Few-shot Relation Classification identifies the relation between target entity pairs in unstructured natural language texts by training on a small number of labeled samples. Recent prototype network-based studies have focused on enhancing the prototype representation capability of models by incorpor...

Full description

Saved in:

Bibliographic Details
Published in	PloS one Vol. 18; no. 6; p. e0286915
Main Authors	Liu, Tao, Ke, Zunwang, Li, Yanbing, Silamu, Wushour
Format	Journal Article
Language	English
Published	United States Public Library of Science 08.06.2023 Public Library of Science (PLoS)
Subjects	Biology and Life Sciences Classification Clusters Computational linguistics Computer and Information Sciences Data mining Data visualization Datasets Design Engineering and Technology Evaluation Graph neural networks Labeling Language processing Metric space Modelling Natural language Natural language interfaces Natural language processing Neural networks Optimization algorithms Outliers (statistics) Physical Sciences Prototypes Representations Semantics Similarity Social Sciences Training China Bulgaria
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Few-shot Relation Classification identifies the relation between target entity pairs in unstructured natural language texts by training on a small number of labeled samples. Recent prototype network-based studies have focused on enhancing the prototype representation capability of models by incorporating external knowledge. However, the majority of these works constrain the representation of class prototypes implicitly through complex network structures, such as multi-attention mechanisms, graph neural networks, and contrastive learning, which constrict the model’s ability to generalize. In addition, most models with triplet loss disregard intra-class compactness during model training, thereby limiting the model’s ability to handle outlier samples with low semantic similarity. Therefore, this paper proposes a non-weighted prototype enhancement module that uses the feature-level similarity between prototypes and relation information as a gate to filter and complete features. Meanwhile, we design a class cluster loss that samples difficult positive and negative samples and explicitly constrains both intra-class compactness and inter-class separability to learn a metric space with high discriminability. Extensive experiments were done on the publicly available dataset FewRel 1.0 and 2.0, and the results show the effectiveness of the proposed model.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 Competing Interests: The authors have declared that no competing interests exist.
ISSN:	1932-6203 1932-6203
DOI:	10.1371/journal.pone.0286915