Finding data in connected corpuses using examples

In one embodiment, datasets are stored in a catalog. The datasets are enriched by establishing relationships among the domains in different datasets. A user searches for relevant datasets by providing examples of the domains of interest. The system identifies datasets corresponding to the user-provi...

Full description

Saved in:
Bibliographic Details
Main Authors Platt, John C, Hays, Christopher Alan, Novik, Lev, Chaudhuri, Surajit, Mukerjee, Kunal, Meijer, Henricus Johannes Maria, Hudis, Efim
Format Patent
LanguageEnglish
Published 27.11.2018
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In one embodiment, datasets are stored in a catalog. The datasets are enriched by establishing relationships among the domains in different datasets. A user searches for relevant datasets by providing examples of the domains of interest. The system identifies datasets corresponding to the user-provided examples. The system them identifies connected subsets of the datasets that are directly linked or indirectly linked through other domains. The user provides known relationship examples to filter the connected subsets and to identify the connected subsets that are most relevant to the user's query. The selected connected subsets may be further analyzed by business intelligence/analytics to create pivot tables or to process the data.
Bibliography:Application Number: US201514659303