Who Needs Real Data Anyway? Exploring the Use of Synthetic Data in Economic Evaluations of Health Interventions

Data needed for economic evaluations in healthcare are often subject to privacy regulations and confidentiality, limiting accessibility. This poses challenges for conducting, reviewing, and validating health economic evaluations. The use of “synthetic data” may solve this problem. An economic evalua...

Full description

Saved in:
Bibliographic Details
Published inValue in health
Main Authors van der Linden, N., Pouwels, X.G.L.V., Jahn, B., Siebert, U., Koffijberg, H.
Format Journal Article
LanguageEnglish
Published United States Elsevier Inc 26.06.2025
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Data needed for economic evaluations in healthcare are often subject to privacy regulations and confidentiality, limiting accessibility. This poses challenges for conducting, reviewing, and validating health economic evaluations. The use of “synthetic data” may solve this problem. An economic evaluation compared “shamectomy” with “usual care” for the prevention of a fictitious disease called shame. A data set (Dorg) was created, consisting of 1000 patients in the base case. Next, synthetic data (Dsyn) were created from Dorg. Dorg and Dsyn were used, separately, to inform a model-based economic evaluation, and the similarity of the results was assessed for various scenarios: different sizes of Dorg, order of synthetization, method of synthetization, number of synthesized data sets, and missing data. With standard settings, incremental cost-effectiveness ratio (ICER)-results for shamectomy were €25 848/quality-adjusted life-year in Dorg and on average €25 857 in 500 Dsyns (95% CI €16 776-€60 021). In the base case, 15% of the generated Dsyns resulted in an ICER leading to a positive reimbursement decision, as opposed to a negative decision when using Dorg. With smaller Dorg data sets (n = 50 and n = 500), ICER ranges increased to −€151 542 (95% CI) and −€669 717 (95% CI), respectively. Outcomes and conclusions of economic analyses based on synthetic data may deviate from those obtained by using the original data. For data sets < 1000 patients, which are common, deviations may be substantial and lead to suboptimal policy decisions. We propose a stepwise approach to using synthetic data for model-based health economic evaluations, using a large number of synthetic data sets (ie, >100) with the same size as the original data. •This simulation study seeks to illustrate the benefits and limitations of using synthetic data in economic evaluations of health interventions.•Outcomes and conclusions of economic analyses based on synthetic data may deviate from those obtained by using the original data.•Data synthesizers should generate a large number of synthetic data sets with the same size as the original data.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1098-3015
1524-4733
1524-4733
DOI:10.1016/j.jval.2025.06.007