Who Needs Real Data Anyway? Exploring the Use of Synthetic Data in Economic Evaluations of Health Interventions
Data needed for economic evaluations in healthcare are often subject to privacy regulations and confidentiality, limiting accessibility. This poses challenges for conducting, reviewing, and validating health economic evaluations. The use of “synthetic data” may solve this problem. An economic evalua...
Saved in:
Published in | Value in health |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
United States
Elsevier Inc
26.06.2025
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Data needed for economic evaluations in healthcare are often subject to privacy regulations and confidentiality, limiting accessibility. This poses challenges for conducting, reviewing, and validating health economic evaluations. The use of “synthetic data” may solve this problem.
An economic evaluation compared “shamectomy” with “usual care” for the prevention of a fictitious disease called shame. A data set (Dorg) was created, consisting of 1000 patients in the base case. Next, synthetic data (Dsyn) were created from Dorg. Dorg and Dsyn were used, separately, to inform a model-based economic evaluation, and the similarity of the results was assessed for various scenarios: different sizes of Dorg, order of synthetization, method of synthetization, number of synthesized data sets, and missing data.
With standard settings, incremental cost-effectiveness ratio (ICER)-results for shamectomy were €25 848/quality-adjusted life-year in Dorg and on average €25 857 in 500 Dsyns (95% CI €16 776-€60 021). In the base case, 15% of the generated Dsyns resulted in an ICER leading to a positive reimbursement decision, as opposed to a negative decision when using Dorg. With smaller Dorg data sets (n = 50 and n = 500), ICER ranges increased to −€151 542 (95% CI) and −€669 717 (95% CI), respectively.
Outcomes and conclusions of economic analyses based on synthetic data may deviate from those obtained by using the original data. For data sets < 1000 patients, which are common, deviations may be substantial and lead to suboptimal policy decisions. We propose a stepwise approach to using synthetic data for model-based health economic evaluations, using a large number of synthetic data sets (ie, >100) with the same size as the original data.
•This simulation study seeks to illustrate the benefits and limitations of using synthetic data in economic evaluations of health interventions.•Outcomes and conclusions of economic analyses based on synthetic data may deviate from those obtained by using the original data.•Data synthesizers should generate a large number of synthetic data sets with the same size as the original data. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 1098-3015 1524-4733 1524-4733 |
DOI: | 10.1016/j.jval.2025.06.007 |