Investigating real-world consequences of biases in commonly used clinical calculators

To evaluate whether one summary metric of calculator performance sufficiently conveys equity across different demographic subgroups, as well as to evaluate how calculator predictive performance affects downstream health outcomes. We evaluate 3 commonly used clinical calculators-Model for End-Stage L...

Full description

Saved in:

Bibliographic Details
Published in	The American journal of managed care Vol. 29; no. 1; pp. e1 - e7
Main Authors	Yoo, Richard M, Dash, Dev, Lu, Jonathan H, Genkins, Julian Z, Rabbani, Naveed, Fries, Jason A, Shah, Nigam H
Format	Journal Article
Language	English
Published	United States Intellisphere, LLC 01.01.2023 MultiMedia Healthcare Inc
Subjects	Anticoagulants Anticoagulants - therapeutic use Atrial Fibrillation - complications Atrial Fibrillation - drug therapy Bias Cardiac arrhythmia Care and treatment Clinical practice guidelines Data analysis Diabetes End Stage Liver Disease Female Health disparities Heart failure Hospitals Humans Hypertension Investigations Liver Liver diseases Liver transplants Male Mortality Patients Practice guidelines (Medicine) Pulmonary embolism Pulmonary embolisms Risk Assessment Risk Factors Severity of Illness Index Stroke
Online Access	Get full text

Cover

Loading…

More Information
Summary:	To evaluate whether one summary metric of calculator performance sufficiently conveys equity across different demographic subgroups, as well as to evaluate how calculator predictive performance affects downstream health outcomes. We evaluate 3 commonly used clinical calculators-Model for End-Stage Liver Disease (MELD), CHA2DS2-VASc, and simplified Pulmonary Embolism Severity Index (sPESI)-on the cohort extracted from the Stanford Medicine Research Data Repository, following the cohort selection process as described in respective calculator derivation papers. We quantified the predictive performance of the 3 clinical calculators across sex and race. Then, using the clinical guidelines that guide care based on these calculators' output, we quantified potential disparities in subsequent health outcomes. Across the examined subgroups, the MELD calculator exhibited worse performance for female and White populations, CHA2DS2-VASc calculator for the male population, and sPESI for the Black population. The extent to which such performance differences translated into differential health outcomes depended on the distribution of the calculators' scores around the thresholds used to trigger a care action via the corresponding guidelines. In particular, under the old guideline for CHA2DS2-VASc, among those who would not have been offered anticoagulant therapy, the Hispanic subgroup exhibited the highest rate of stroke. Clinical calculators, even when they do not include variables such as sex and race as inputs, can have very different care consequences across those subgroups. These differences in health care outcomes across subgroups can be explained by examining the distribution of scores and their calibration around the thresholds encoded in the accompanying care guidelines.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	1088-0224 1936-2692
DOI:	10.37765/ajmc.2023.89306