The design and implementation of a subject-oriented Web information classification system

With the explosive growth of World Wide Web, it is becoming increasingly difficult for users to collect and analyze Web pages that are relevant to a particular subject. In this paper, a subject-oriented Web information classification system (WICS) is presented, by which Web pages can be efficiently...

Full description

Saved in:
Bibliographic Details
Published inProceedings of the Ninth International Conference on Computer Supported Cooperative Work in Design : May 24-26, 2005, Coventry, UK Vol. 2; pp. 836 - 840 Vol. 2
Main Authors Yishan Huang, Qianping Wang, Jing Yang, Quan Ding
Format Conference Proceeding
LanguageEnglish
Published IEEE 2005
Subjects
Online AccessGet full text
ISBN1846000025
9781846000027
DOI10.1109/CSCWD.2005.194294

Cover

Loading…
More Information
Summary:With the explosive growth of World Wide Web, it is becoming increasingly difficult for users to collect and analyze Web pages that are relevant to a particular subject. In this paper, a subject-oriented Web information classification system (WICS) is presented, by which Web pages can be efficiently collected and classified into several subjects, and the search results are provided to users. Based on analyzing the ordinary search engines, Web text mining is introduced and applied to the WICS. The text preprocessing, index, inverted files and vector space distance algorithm (vector space model, VSM) are brought forward in the prototype. The initial experiments show that classify Web information by the prototype makes convenience for users to inquire information; the relevancy and precision are improved.
ISBN:1846000025
9781846000027
DOI:10.1109/CSCWD.2005.194294