SCALABLE ANALYSIS PLATFORM FOR SEMI-STRUCTURED DATA

A data transformation system includes a schema inference module and an export module. The schema inference module is configured to dynamically create a cumulative schema for objects retrieved from a first data source. Each of the retrieved objects includes (i) data and (ii) metadata describing the d...

Full description

Saved in:
Bibliographic Details
Main Authors Sowell Benjamin A, Binkert Nathan A, Harizopoulos Stavros, Kaplan Bryan D, Meyer Kevin R, Tsirogiannis Dimitris, Shah Mehul A
Format Patent
LanguageEnglish
Published 20.07.2017
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A data transformation system includes a schema inference module and an export module. The schema inference module is configured to dynamically create a cumulative schema for objects retrieved from a first data source. Each of the retrieved objects includes (i) data and (ii) metadata describing the data. Dynamically creating the cumulative schema includes, for each object of the retrieved objects, (i) inferring a schema from the object and (ii) selectively updating the cumulative schema to describe the object according to the inferred schema. The export module is configured to output the data of the retrieved objects to a data destination system according to the cumulative schema.
Bibliography:Application Number: US201715478177