System and method for adaptive gathering of structured and unstructured data

Content is obtained from a webpage accessed via a URI, which URI is obtained from a URI queue. The content is parsed for price and product information according to a parse map, with the resulting parse result being stored. The priority of URIs in the URI queue is adjusted based on analysis of the pa...

Full description

Saved in:
Bibliographic Details
Main Authors SANJAY PARTHASARATHY, SATYANARAYANA RAO KALIKIVAYI, RAJESH MUPPALLA
Format Patent
LanguageEnglish
Hebrew
Published 30.08.2018
Online AccessGet full text

Cover

Loading…
More Information
Summary:Content is obtained from a webpage accessed via a URI, which URI is obtained from a URI queue. The content is parsed for price and product information according to a parse map, with the resulting parse result being stored. The priority of URIs in the URI queue is adjusted based on analysis of the parse result for changes in price and product attributes and according to other criteria. The parse map may be one associated with the URI or a general purpose parse maps. The parse result may be validated by human- and machine-based systems, including by graphically labeling price and product information in the content for human confirmation or correction.
Bibliography:Application Number: IL20150236890