Systems and methods for scheduling data flow execution based on an arbitrary graph describing the desired data flow

The data transformation system (DTS) in one embodiment of the present invention comprises a capability to receive data from a data source, a data destination and a capability to store transformed data therein, and a data transformation pipeline (DTP) that constructs complex end-to-end data transform...

Full description

Saved in:
Bibliographic Details
Main Authors BLASZCZAK MICHAEL A, HOWEY JAMES K
Format Patent
LanguageEnglish
Published 23.09.2004
Edition7
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The data transformation system (DTS) in one embodiment of the present invention comprises a capability to receive data from a data source, a data destination and a capability to store transformed data therein, and a data transformation pipeline (DTP) that constructs complex end-to-end data transformation functionality (data flow executions or DFEs) by pipelining data flowing from one or more sources to one or more destinations through various interconnected nodes (that, when instantiated, become components in the pipeline) for transforming the data as it flows by (where the term transforming is used herein to broadly describe the universe of interactions that can be conducted to, with, by, or on data). Each component in the pipeline possesses specific predefined data transformation functionality, and the logical connections between components define the data flow pathway in an operational sense. The data transformation pipeline (DTP) enables a user to develop complex end-to-end data transformation functionality (the DFEs) by graphically describing and representing, via a graphical user interface (GUI), a desired data flow from one or more sources to one or more destinations through various interconnected nodes (a graph). Each node in the graph selected by the user and incorporated in the graph represents specific predefined data transformation functionality (each a component), and connections between the nodes (the components) define the data flow pathway.
Bibliography:Application Number: US20030391726