A Cancer Biologist's Primer on Machine Learning Applications in High‐Dimensional Cytometry
The application of machine learning and artificial intelligence to high‐dimensional cytometry data sets has increasingly become a staple of bioinformatic data analysis over the past decade. This is especially true in the field of cancer biology, where protocols for collecting multiparameter single‐c...
Saved in:
Published in | Cytometry. Part A Vol. 97; no. 8; pp. 782 - 799 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
Hoboken, USA
John Wiley & Sons, Inc
01.08.2020
Wiley Subscription Services, Inc |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | The application of machine learning and artificial intelligence to high‐dimensional cytometry data sets has increasingly become a staple of bioinformatic data analysis over the past decade. This is especially true in the field of cancer biology, where protocols for collecting multiparameter single‐cell data in a high‐throughput fashion are rapidly developed. As the use of machine learning methodology in cytometry becomes increasingly common, there is a need for cancer biologists to understand the basic theory and applications of a variety of algorithmic tools for analyzing and interpreting cytometry data. We introduce the reader to several keystone machine learning‐based analytic approaches with an emphasis on defining key terms and introducing a conceptual framework for making translational or clinically relevant discoveries. The target audience consists of cancer cell biologists and physician‐scientists interested in applying these tools to their own data, but who may have limited training in bioinformatics. © 2020 International Society for Advancement of Cytometry
In this review, we introduce the reader to the machine learning algorithms most commonly applied to single‐cell data in cancer biology. Without assuming any in‐depth prior knowledge of bioinformatics, we discuss multiple state‐of‐the‐art algorithms for dimensionality reduction, clustering, and predictive modeling as they apply to the study of cancer. After reading this review, readers should be able to understand the main strengths and weaknesses of each of the algorithms that we discuss as well as how such analyses can be applied to their own clinical or experimental data sets. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 ObjectType-Review-3 content type line 23 |
ISSN: | 1552-4922 1552-4930 1552-4930 |
DOI: | 10.1002/cyto.a.24158 |