The Biologic Basis of Clinical Heterogeneity in Juvenile Idiopathic Arthritis

Objective Childhood arthritis encompasses a heterogeneous family of diseases. Significant variation in clinical presentation remains despite consensus‐driven diagnostic classifications. Developments in data analysis provide powerful tools for interrogating large heterogeneous data sets. We report a...

Full description

Saved in:
Bibliographic Details
Published inArthritis & rheumatology (Hoboken, N.J.) Vol. 66; no. 12; pp. 3463 - 3475
Main Authors Eng, Simon W. M., Duong, Trang T., Rosenberg, Alan M., Morris, Quaid, Yeung, Rae S. M., Dancey, Paul, Huber, Adam, Lang, Bianca, Ramsey, Suzanne, Stringer, Elizabeth, Chetaille, Anne‐Laure, Guillet, Chantal, Boire, Gilles, Campillo, Sarah, Chédeville, Gaëlle, Duffy, Ciáran, Duffy, Karen, Gibbon, Michele, Scuccimarri, Rosie, Jurencak, Roman, Roth, Johannes, Benseler, Susanne, Cameron, Bonnie, Feldman, Brian, Laxer, Ronald, Schneider, Rayfel, Silverman, Earl, Spiegel, Lynn, Tse, Shirley, Oen, Kiem, Baxter‐Jones, Adam, Maenz, Lynn, Matheson, Loren, Rezaei, Elham, Ellsworth, Janet, LeBlanc, Claire, Cabral, David, Guzman, Jaime, Houghton, Kristin, Petty, Ross, Tucker, Lori, Turvey, Stuart
Format Journal Article
LanguageEnglish
Published United States Wiley Subscription Services, Inc 01.12.2014
BlackWell Publishing Ltd
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Objective Childhood arthritis encompasses a heterogeneous family of diseases. Significant variation in clinical presentation remains despite consensus‐driven diagnostic classifications. Developments in data analysis provide powerful tools for interrogating large heterogeneous data sets. We report a novel approach to integrating biologic and clinical data toward a new classification for childhood arthritis, using computational biology for data‐driven pattern recognition. Methods Probabilistic principal components analysis was used to transform a large set of data into 4 interpretable indicators or composite variables on which patients were grouped by cluster analysis. Sensitivity analysis was conducted to determine key variables in determining indicators and cluster assignment. Results were validated against an independent validation cohort. Results Meaningful biologic and clinical characteristics, including levels of proinflammatory cytokines and measures of disease activity, defined axes/indicators that identified homogeneous patient subgroups by cluster analysis. The new patient classifications resolved major differences between patient subpopulations better than International League of Associations for Rheumatology subtypes. Fourteen variables were identified by sensitivity analysis to crucially determine indicators and clusters. This new schema was conserved in an independent validation cohort. Conclusion Data‐driven unsupervised machine learning is a powerful approach for interrogating clinical and biologic data toward disease classification, providing insight into the biology underlying clinical heterogeneity in childhood arthritis. Our analytical framework enabled the recovery of unique patterns from small cohorts and addresses a major challenge, patient numbers, in studying rare diseases.
Bibliography:Dr. Yeung holds the Hak‐Ming and Deborah Chiu Chair in Paediatric Translational Research at The Hospital for Sick Children, University of Toronto, Toronto, Ontario, Canada.
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:2326-5191
2326-5205
DOI:10.1002/art.38875