An Improved TFIDF Algorithm in Text Classification

Term frequency/inverse document frequency (TF-IDF) is widely used in text classification at present, which is borrowed from Information Retrieval. Based on this conventional classical TF-IDF formula, we present a new TF-IDF weight schemes named CTF-IDF. The experiment shows that the improved method...

Full description

Saved in:
Bibliographic Details
Published inApplied Mechanics and Materials Vol. 651-653; pp. 2258 - 2261
Main Authors Wu, Shao Bo, Xu, Dong Dong
Format Journal Article
LanguageEnglish
Published Zurich Trans Tech Publications Ltd 01.09.2014
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Term frequency/inverse document frequency (TF-IDF) is widely used in text classification at present, which is borrowed from Information Retrieval. Based on this conventional classical TF-IDF formula, we present a new TF-IDF weight schemes named CTF-IDF. The experiment shows that the improved method is feasible and effective. Furthermore, from the subsequent evaluations using 10-fold cross-validation, we can see the CTF-IDF greatly improves the accuracy of text classification.
Bibliography:Selected, peer reviewed papers from the 2014 3rd International Conference on Advanced Engineering Materials and Architecture Science (ICAEMAS 2014), July 26-27, 2014, Huhhot, Inner Mongolia, China
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISBN:9783038352679
3038352675
ISSN:1660-9336
1662-7482
1662-7482
DOI:10.4028/www.scientific.net/AMM.651-653.2258