Evaluation of Quantitative Decision‐Making for Rhythm Management of Atrial Fibrillation Using Tabular Q‐Learning

Background Rhythm management is a complex decision for patients with atrial fibrillation (AF). Although clinical trials have identified subsets of patients who might benefit from a given rhythm-management strategy, for individual patients it is not always clear which strategy is expected to have the...

Full description

Saved in:

Bibliographic Details
Published in	Journal of the American Heart Association Vol. 12; no. 9; p. e028483
Main Authors	Barrett, Christopher D., Suzuki, Yuto, Hussein, Sundos, Garg, Lohit, Tumolo, Alexis, Sandhu, Amneet, West, John J., Zipse, Matthew, Aleong, Ryan, Varosy, Paul, Tzou, Wendy S., Banaei‐Kashani, Farnoush, Rosenberg, Michael A.
Format	Journal Article
Language	English
Published	England John Wiley and Sons Inc 02.05.2023 Wiley
Subjects	Anti-Arrhythmia Agents - therapeutic use Artificial Intelligence atrial fibrillation Atrial Fibrillation - drug therapy Atrial Fibrillation - therapy Electric Countershock Humans Original Research Q‐learning rate control reinforcement learning Retrospective Studies rhythm control atrial fibrillation Q‐learning rate control artificial intelligence reinforcement learning rhythm control unsupervised learning
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Background Rhythm management is a complex decision for patients with atrial fibrillation (AF). Although clinical trials have identified subsets of patients who might benefit from a given rhythm-management strategy, for individual patients it is not always clear which strategy is expected to have the greatest mortality benefit or durability. Methods and Results In this investigation 52 547 patients with a new atrial fibrillation diagnosis between 2010 and 2020 were retrospectively identified. We applied a type of artificial intelligence called tabular Q-learning to identify the optimal initial rhythm-management strategy, based on a composite outcome of mortality, change in treatment, and sustainability of the given treatment, termed the reward function. We first applied an unsupervised learning algorithm using a variational autoencoder with K-means clustering to cluster atrial fibrillation patients into 8 distinct phenotypes. We then fit a Q-learning algorithm to predict the best outcome for each cluster. Although rate-control strategy was most frequently selected by treating providers, the outcome was superior for rhythm-control strategies across all clusters. Subjects in whom provider-selected treatment matched the Q-table recommendation had fewer total deaths (4 [8.5%] versus 473 [22.4%], odds ratio=0.32, =0.02) and a greater reward ( =4.8×10 ). We then demonstrated application of dynamic learning by updating the Q-table prospectively using batch gradient descent, in which the optimal strategy in some clusters changed from cardioversion to ablation. Conclusions Tabular Q-learning provides a dynamic and interpretable approach to apply artificial intelligence to clinical decision-making for atrial fibrillation. Further work is needed to examine application of Q-learning prospectively in clinical patients.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 This article was sent to Wei‐Qi Wei, MD, PhD, Guest Editor, for review by expert referees, editorial decision, and final disposition. For Sources of Funding and Disclosures, see page 13. Supplemental Material is available at https://www.ahajournals.org/doi/suppl/10.1161/JAHA.122.028483
ISSN:	2047-9980 2047-9980
DOI:	10.1161/JAHA.122.028483