Learning Refined Features for Open-World Text Classification

Open-world classification requires a classifier not only to classify samples of the observed classes but also to detect samples which are not suitable to be classified as the known classes. State-of-the-art methods train a network to extract features for separating known classes firstly. Then some s...

Full description

Saved in:
Bibliographic Details
Published inWeb and Big Data pp. 367 - 381
Main Authors Li, Zeting, Cai, Yi, Tan, Xingwei, Han, Guoqiang, Ren, Haopeng, Wu, Xin, Li, Wen
Format Book Chapter
LanguageEnglish
Published Cham Springer International Publishing
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Open-world classification requires a classifier not only to classify samples of the observed classes but also to detect samples which are not suitable to be classified as the known classes. State-of-the-art methods train a network to extract features for separating known classes firstly. Then some strategies, such as outlier detector, are used to reject samples from unknown classes based on the feature space. However, this network as a feature extractor cannot model comprehensive features of known classes in an open world scenario due to limited training data. This causes a problem that the strategies are unable to separate unknown classes from known classes accurately in this feature space. Motivated by the theory of psychology and cognitive science, we utilize class descriptions summarized by human to refine discriminant features and propose a regularization with class descriptions. The regularization is incorporated into DOC (one of state-of-the-art models) to improve the performance of open-world classification. The experiments on two text classification datasets demonstrate the effectiveness of the proposed method.
ISBN:9783030858957
3030858952
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-030-85896-4_29