Adaptive gathering of structured and unstructured data system and method

Content is obtained from a webpage accessed via a URI, which URI is obtained from a URI queue. The content is parsed for price and product information according to a parse map, with the resulting parse result being stored. The priority of URIs in the URI queue is adjusted based on analysis of the pa...

Full description

Saved in:
Bibliographic Details
Main Authors MUPPALLA RAJESH, PARTHASARATHY SANJAY, KALIKIVAYI SATYANARAYANA RAO
Format Patent
LanguageEnglish
Published 03.06.2015
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Content is obtained from a webpage accessed via a URI, which URI is obtained from a URI queue. The content is parsed for price and product information according to a parse map, with the resulting parse result being stored. The priority of URIs in the URI queue is adjusted based on analysis of the parse result for changes in price and product attributes and according to other criteria. The parse map may be one associated with the URI or a general purpose parse maps. The parse result may be validated by human- and machine-based systems, including by graphically labeling price and product information in the content for human confirmation or correction.
Bibliography:Application Number: CN201380049781