Web Page Recognition Algorithm Based on Link Analysis in Theme Search Engine

Web page recognition is a problem in the design of web crawler in theme search engine. This paper designs a web page recognition algorithm based on link analysis to solve this problem. The main idea of this algorithm is to get the relevant web page recognition model through a combination of link ana...

Full description

Saved in:
Bibliographic Details
Published in2012 Second International Conference on Cloud and Green Computing pp. 405 - 409
Main Authors Zude Chen, Jianxun Liu, Haijun Zhai, Lei Jiang, Buqing Cao
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.11.2012
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Web page recognition is a problem in the design of web crawler in theme search engine. This paper designs a web page recognition algorithm based on link analysis to solve this problem. The main idea of this algorithm is to get the relevant web page recognition model through a combination of link analysis and theme URL knowledge base, based on the idea of statistics and social network analysis. Through the experiment, the precision rate of this algorithm is over 93 percent, and the recall rate is up to 85.4 percent. So the experiment is significant, better than other web page recognition algorithm. Experimental results show the feasibility and effectiveness of this algorithm.
ISBN:1467330272
9781467330275
DOI:10.1109/CGC.2012.42