METHOD FOR CLASSIFYING AN UNMANAGED DATASET

A computer implemented method for classifying at least one source dataset of a computer system. The method may include providing a plurality of associated reference tables organized and associated in accordance with a reference storage model in the computer system. The method may also include calcul...

Full description

Saved in:
Bibliographic Details
Main Authors Seifert, Jens, Oberhofer, Martin, Reddy, Adapala S, Saillet, Yannick
Format Patent
LanguageEnglish
Published 10.03.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A computer implemented method for classifying at least one source dataset of a computer system. The method may include providing a plurality of associated reference tables organized and associated in accordance with a reference storage model in the computer system. The method may also include calculating, by a data classifier application of the computer system, a first similarity score between the source dataset and a first reference table of the reference tables based on common attributes in the source dataset and a join of the first reference table with at least one further reference table of the reference tables having a relationship with the first reference table. The method may further include classifying, by the data classifier application, the source dataset by determining using at least the calculated first similarity score whether the source dataset is organized as the first reference table in accordance to the reference storage model.
Bibliography:Application Number: US202117455068