Detecting Link Spam Using Temporal Information

How to effectively protect against spam on search ranking results is an important issue for contemporary web search engines. This paper addresses the problem of combating one major type of web spam: 'link spam.' Most of the previous work on anti link spam managed to make use of one snapsho...

Full description

Saved in:
Bibliographic Details
Published inSixth International Conference on Data Mining (ICDM'06) pp. 1049 - 1053
Main Authors Guoyang Shen, Bin Gao, Tie-Yan Liu, Guang Feng, Shiji Song, Hang Li
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.12.2006
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:How to effectively protect against spam on search ranking results is an important issue for contemporary web search engines. This paper addresses the problem of combating one major type of web spam: 'link spam.' Most of the previous work on anti link spam managed to make use of one snapshot of web data to detect spam, and thus it did not take advantage of the fact that link spam tends to result in drastic changes of links in a short time period. To overcome the shortcoming, this paper proposes using temporal information on links in detection of link spam, as well as other information. Specifically, it defines temporal features such as in-link growth rate (IGR) and in-link death rate (IDR) in a spam classification model (i.e., SVM). Experimental results on web domain graph data show that link spam can be successfully detected with the proposed method.
ISBN:9780769527017
0769527019
ISSN:1550-4786
2374-8486
DOI:10.1109/ICDM.2006.51