Web Page Recognition Algorithm Based on Link Analysis in Theme Search Engine
Web page recognition is a problem in the design of web crawler in theme search engine. This paper designs a web page recognition algorithm based on link analysis to solve this problem. The main idea of this algorithm is to get the relevant web page recognition model through a combination of link ana...
Saved in:
Published in | 2012 Second International Conference on Cloud and Green Computing pp. 405 - 409 |
---|---|
Main Authors | , , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.11.2012
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Web page recognition is a problem in the design of web crawler in theme search engine. This paper designs a web page recognition algorithm based on link analysis to solve this problem. The main idea of this algorithm is to get the relevant web page recognition model through a combination of link analysis and theme URL knowledge base, based on the idea of statistics and social network analysis. Through the experiment, the precision rate of this algorithm is over 93 percent, and the recall rate is up to 85.4 percent. So the experiment is significant, better than other web page recognition algorithm. Experimental results show the feasibility and effectiveness of this algorithm. |
---|---|
ISBN: | 1467330272 9781467330275 |
DOI: | 10.1109/CGC.2012.42 |