Investigating real-world consequences of biases in commonly used clinical calculators

To evaluate whether one summary metric of calculator performance sufficiently conveys equity across different demographic subgroups, as well as to evaluate how calculator predictive performance affects downstream health outcomes. We evaluate 3 commonly used clinical calculators-Model for End-Stage L...

Full description

Saved in:
Bibliographic Details
Published inThe American journal of managed care Vol. 29; no. 1; pp. e1 - e7
Main Authors Yoo, Richard M, Dash, Dev, Lu, Jonathan H, Genkins, Julian Z, Rabbani, Naveed, Fries, Jason A, Shah, Nigam H
Format Journal Article
LanguageEnglish
Published United States Intellisphere, LLC 01.01.2023
MultiMedia Healthcare Inc
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:To evaluate whether one summary metric of calculator performance sufficiently conveys equity across different demographic subgroups, as well as to evaluate how calculator predictive performance affects downstream health outcomes. We evaluate 3 commonly used clinical calculators-Model for End-Stage Liver Disease (MELD), CHA2DS2-VASc, and simplified Pulmonary Embolism Severity Index (sPESI)-on the cohort extracted from the Stanford Medicine Research Data Repository, following the cohort selection process as described in respective calculator derivation papers. We quantified the predictive performance of the 3 clinical calculators across sex and race. Then, using the clinical guidelines that guide care based on these calculators' output, we quantified potential disparities in subsequent health outcomes. Across the examined subgroups, the MELD calculator exhibited worse performance for female and White populations, CHA2DS2-VASc calculator for the male population, and sPESI for the Black population. The extent to which such performance differences translated into differential health outcomes depended on the distribution of the calculators' scores around the thresholds used to trigger a care action via the corresponding guidelines. In particular, under the old guideline for CHA2DS2-VASc, among those who would not have been offered anticoagulant therapy, the Hispanic subgroup exhibited the highest rate of stroke. Clinical calculators, even when they do not include variables such as sex and race as inputs, can have very different care consequences across those subgroups. These differences in health care outcomes across subgroups can be explained by examining the distribution of scores and their calibration around the thresholds encoded in the accompanying care guidelines.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1088-0224
1936-2692
DOI:10.37765/ajmc.2023.89306