Identifier Contribution Allocation in Synthetic Data Generation in Computer-Based Reasoning Systems
Techniques for synthetic data generation in computer-based reasoning systems are discussed and include receiving a request for generation of synthetic data based on a set of training data cases. One or more focal training data cases are determined. For undetermined features (either all of them or th...
Saved in:
Main Authors | , , , , |
---|---|
Format | Patent |
Language | English |
Published |
29.05.2025
|
Subjects | |
Online Access | Get full text |
Cover
Summary: | Techniques for synthetic data generation in computer-based reasoning systems are discussed and include receiving a request for generation of synthetic data based on a set of training data cases. One or more focal training data cases are determined. For undetermined features (either all of them or those that are not subject to conditions), a value for the feature is determined based on the focal cases. In some embodiments, the generated synthetic data may be checked for similarity against the training data, and if similarity conditions are met, it may be modified (e.g., resampled), removed, and/or replaced. |
---|---|
Bibliography: | Application Number: US202418967075 |