METHOD FOR CLASSIFYING AN UNMANAGED DATASET
A computer implemented method for classifying at least one source dataset of a computer system. The method may include providing a plurality of associated reference tables organized and associated in accordance with a reference storage model in the computer system. The method may also include calcul...
Saved in:
Main Authors | , , , |
---|---|
Format | Patent |
Language | English |
Published |
10.03.2022
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | A computer implemented method for classifying at least one source dataset of a computer system. The method may include providing a plurality of associated reference tables organized and associated in accordance with a reference storage model in the computer system. The method may also include calculating, by a data classifier application of the computer system, a first similarity score between the source dataset and a first reference table of the reference tables based on common attributes in the source dataset and a join of the first reference table with at least one further reference table of the reference tables having a relationship with the first reference table. The method may further include classifying, by the data classifier application, the source dataset by determining using at least the calculated first similarity score whether the source dataset is organized as the first reference table in accordance to the reference storage model. |
---|---|
Bibliography: | Application Number: US202117455068 |