Tracking techniques for generated data

An apparatus, a method, and computer program product are provided that tracks data for, and generated by, machine learning for accurate and precise deletion. The method includes receiving a dataset for use in training a machine learning model and registering a file from the dataset into a reference...

Full description

Saved in:
Bibliographic Details
Main Authors Wakabayashi, Takehiro, Nagai, Shingo
Format Patent
LanguageEnglish
Published 23.01.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:An apparatus, a method, and computer program product are provided that tracks data for, and generated by, machine learning for accurate and precise deletion. The method includes receiving a dataset for use in training a machine learning model and registering a file from the dataset into a reference table, wherein the file is designated for monitoring. The file designation can indicate that the file is confidential and requires deletion upon completion of training of the machine learning model and project. The method also includes monitoring the file for an event that accesses the file, detecting a read access event occurring on the file, and determining a creation of a derivative file generated as a result of the read access event. The method further includes registering the derivative file into the reference table and indicating an association between the derivative file and the file in the reference table.
Bibliography:Application Number: US202217654262