Abstract 12739: Evaluation of Quantitative Decision-Making for Rhythm-Management of Atrial Fibrillation Using Tabular Q-Learning
IntroductionRhythm-management is among the most complex decisions for patients diagnosed with atrial fibrillation (AF). While clinical trials have identified subsets of patients who might have a benefit from a given rhythm-management strategy, for individual patients it is not always clear which str...
Saved in:
Published in | Circulation (New York, N.Y.) Vol. 146; no. Suppl_1; p. A12739 |
---|---|
Main Authors | , , , , , , , , , , , , , , |
Format | Journal Article |
Language | English |
Published |
Lippincott Williams & Wilkins
08.11.2022
|
Online Access | Get full text |
Cover
Loading…
Summary: | IntroductionRhythm-management is among the most complex decisions for patients diagnosed with atrial fibrillation (AF). While clinical trials have identified subsets of patients who might have a benefit from a given rhythm-management strategy, for individual patients it is not always clear which strategy is expected to have the greatest mortality benefit or durability. HypothesisCombined application of unsupervised machine learning and reinforcement learning can assist with optimizing individualized treatment strategies for patients with AF. MethodsIn this investigation of over 50K subjects with a new AF diagnosis between the dates of October 11, 2010 and October 26, 2020, we examined the application of a type of artificial intelligence called tabular Q-learning to identify the optimal initial rhythm-management strategy, based on a composite outcome of mortality, change in treatment, and sustainability of the given treatment, termed the reward function. ResultsWe first applied an unsupervised learning algorithm using a variational autoencoder with K-means clustering to cluster AF patients into 8 distinct phenotypes, which matched what has been observed in clinical practice. We then fit a Q-learning algorithm to predict the best outcome for each cluster, in which we found that while a rate-control strategy was most frequently selected by treating providers, the outcome was superior for rhythm-control strategies across all cluster groups, with those subjects in whom provider-selected treatment matched what the Q table recommended having fewer total deaths [4 (8.5%) vs 473 (22.4%), Odds ratio = 0.32, P = 0.02] and a greater reward [-0.0115±0.26 vs -0.392±2.06, P = 4.8x10-6]. We then demonstrated application of dynamic learning by updating the Q table prospectively using batch gradient descent, in which the optimal strategy in some clusters changed from cardioversion to ablation, although the reward was greatest when the largest amount of data was used for model fitting. ConclusionsWe found that tabular Q-learning provides a dynamic and interpretable approach to apply artificial intelligence to clinical decision making for AF. Further work is needed to examine application of Q-learning prospectively in clinical patients. |
---|---|
ISSN: | 0009-7322 1524-4539 |
DOI: | 10.1161/circ.146.suppl_1.12739 |