METHOD AND DEVICE FOR GATHERING INFORMATION AND RECORDING MEDIUM HAVING RECORDED INFORMATION GATHERING PROGRAM
PROBLEM TO BE SOLVED: To make predictable resources which need to be gathered without actually gathering them by classifying resources decentralized over a network for every content and selectively and automatically gathering only necessary resources. SOLUTION: A worldwide web(WWW) document of data...
Saved in:
Main Authors | , , , , |
---|---|
Format | Patent |
Language | English |
Published |
30.11.1999
|
Edition | 6 |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | PROBLEM TO BE SOLVED: To make predictable resources which need to be gathered without actually gathering them by classifying resources decentralized over a network for every content and selectively and automatically gathering only necessary resources. SOLUTION: A worldwide web(WWW) document of data saved in a data saving part 4 is supplied to a content analysis part 11 through a hyperlink extraction part 5 and a hyperlink evaluation part 6. The content analysis part 11 takes a morpheme analysis of the WWW document to extract nouns, proper nouns, and undefined words, and generates word lists by documents and supplies them to a content prediction and learning part 12. Featured patterns of specific fields are extracted from WWW documents and their lists are generated by the documents and supplied to the content prediction and learning part 12. Then HTML tags are extracted from a WWW document and a list of tags for hyperlinks is generated and supplied to the content prediction and learning part 12. Thus, resources decentralized over the network are classified for every content and only necessary resources are gathered. |
---|---|
Bibliography: | Application Number: JP19980135195 |