A blended feature selection method in text classification

In text classification system, accuracy is a major indicator of performance, and feature selection method has a significant impact on it. In this paper, we propose a blended feature selection method, which combines four traditional feature selection methods (document frequency, information gain, mut...

Full description

Saved in:
Bibliographic Details
Published inIET Conference Proceedings pp. 577 - 580
Main Authors Shen, Kewei, Chen, Xian, Ma, Jing, Le, Ke, Lu, Yueming, Zhang, Kuo
Format Conference Proceeding
LanguageEnglish
Published Stevenage, UK IET 2013
The Institution of Engineering & Technology
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In text classification system, accuracy is a major indicator of performance, and feature selection method has a significant impact on it. In this paper, we propose a blended feature selection method, which combines four traditional feature selection methods (document frequency, information gain, mutual information and chi-square) into a better feature selection method. BFSM is tested on a Chinese corpus of 4000 documents and improves the accuracy compared with those traditional methods.
ISBN:9781849198011
1849198012
DOI:10.1049/cp.2013.2077