Identifier Contribution Allocation in Synthetic Data Generation in Computer-Based Reasoning Systems

Techniques for synthetic data generation in computer-based reasoning systems are discussed and include receiving a request for generation of synthetic data based on a set of training data cases. One or more focal training data cases are determined. For undetermined features (either all of them or th...

Full description

Saved in:

Bibliographic Details
Main Authors	Beel, Jacob, Srinivasamurthy, Ravisutha Sakrepatna, Hazard, Christopher James, Resnick, Michael, Shah, Yash
Format	Patent
Language	English
Published	29.05.2025
Subjects	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING PHYSICS
Online Access	Get full text

Cover

More Information
Summary:	Techniques for synthetic data generation in computer-based reasoning systems are discussed and include receiving a request for generation of synthetic data based on a set of training data cases. One or more focal training data cases are determined. For undetermined features (either all of them or those that are not subject to conditions), a value for the feature is determined based on the focal cases. In some embodiments, the generated synthetic data may be checked for similarity against the training data, and if similarity conditions are met, it may be modified (e.g., resampled), removed, and/or replaced.
Bibliography:	Application Number: US202418967075