An Improved TFIDF Algorithm in Text Classification
Term frequency/inverse document frequency (TF-IDF) is widely used in text classification at present, which is borrowed from Information Retrieval. Based on this conventional classical TF-IDF formula, we present a new TF-IDF weight schemes named CTF-IDF. The experiment shows that the improved method...
Saved in:
Published in | Applied Mechanics and Materials Vol. 651-653; pp. 2258 - 2261 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
Zurich
Trans Tech Publications Ltd
01.09.2014
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Term frequency/inverse document frequency (TF-IDF) is widely used in text classification at present, which is borrowed from Information Retrieval. Based on this conventional classical TF-IDF formula, we present a new TF-IDF weight schemes named CTF-IDF. The experiment shows that the improved method is feasible and effective. Furthermore, from the subsequent evaluations using 10-fold cross-validation, we can see the CTF-IDF greatly improves the accuracy of text classification. |
---|---|
Bibliography: | Selected, peer reviewed papers from the 2014 3rd International Conference on Advanced Engineering Materials and Architecture Science (ICAEMAS 2014), July 26-27, 2014, Huhhot, Inner Mongolia, China ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
ISBN: | 9783038352679 3038352675 |
ISSN: | 1660-9336 1662-7482 1662-7482 |
DOI: | 10.4028/www.scientific.net/AMM.651-653.2258 |