Identifier Contribution Allocation in Synthetic Data Generation in Computer-Based Reasoning Systems

Techniques for synthetic data generation in computer-based reasoning systems are discussed and include receiving a request for generation of synthetic data based on a set of training data cases. One or more focal training data cases are determined. For undetermined features (either all of them or th...

Full description

Saved in:
Bibliographic Details
Main Authors Beel, Jacob, Srinivasamurthy, Ravisutha Sakrepatna, Hazard, Christopher James, Resnick, Michael, Shah, Yash
Format Patent
LanguageEnglish
Published 29.05.2025
Subjects
Online AccessGet full text

Cover

More Information
Summary:Techniques for synthetic data generation in computer-based reasoning systems are discussed and include receiving a request for generation of synthetic data based on a set of training data cases. One or more focal training data cases are determined. For undetermined features (either all of them or those that are not subject to conditions), a value for the feature is determined based on the focal cases. In some embodiments, the generated synthetic data may be checked for similarity against the training data, and if similarity conditions are met, it may be modified (e.g., resampled), removed, and/or replaced.
Bibliography:Application Number: US202418967075