A Cancer Biologist's Primer on Machine Learning Applications in High‐Dimensional Cytometry

The application of machine learning and artificial intelligence to high‐dimensional cytometry data sets has increasingly become a staple of bioinformatic data analysis over the past decade. This is especially true in the field of cancer biology, where protocols for collecting multiparameter single‐c...

Full description

Saved in:

Bibliographic Details
Published in	Cytometry. Part A Vol. 97; no. 8; pp. 782 - 799
Main Authors	Keyes, Timothy J., Domizi, Pablo, Lo, Yu‐Chen, Nolan, Garry P., Davis, Kara L.
Format	Journal Article
Language	English
Published	Hoboken, USA John Wiley & Sons, Inc 01.08.2020 Wiley Subscription Services, Inc
Subjects	Artificial intelligence Bioinformatics Biologists Cancer computational cytometry Cytometry Data analysis Data science Learning algorithms Machine learning mass cytometry cancer computational cytometry mass cytometry data science machine learning
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The application of machine learning and artificial intelligence to high‐dimensional cytometry data sets has increasingly become a staple of bioinformatic data analysis over the past decade. This is especially true in the field of cancer biology, where protocols for collecting multiparameter single‐cell data in a high‐throughput fashion are rapidly developed. As the use of machine learning methodology in cytometry becomes increasingly common, there is a need for cancer biologists to understand the basic theory and applications of a variety of algorithmic tools for analyzing and interpreting cytometry data. We introduce the reader to several keystone machine learning‐based analytic approaches with an emphasis on defining key terms and introducing a conceptual framework for making translational or clinically relevant discoveries. The target audience consists of cancer cell biologists and physician‐scientists interested in applying these tools to their own data, but who may have limited training in bioinformatics. © 2020 International Society for Advancement of Cytometry In this review, we introduce the reader to the machine learning algorithms most commonly applied to single‐cell data in cancer biology. Without assuming any in‐depth prior knowledge of bioinformatics, we discuss multiple state‐of‐the‐art algorithms for dimensionality reduction, clustering, and predictive modeling as they apply to the study of cancer. After reading this review, readers should be able to understand the main strengths and weaknesses of each of the algorithms that we discuss as well as how such analyses can be applied to their own clinical or experimental data sets.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 ObjectType-Review-3 content type line 23
ISSN:	1552-4922 1552-4930 1552-4930
DOI:	10.1002/cyto.a.24158