Determinants of Base Editing Outcomes from Target Library Analysis and Machine Learning

Although base editors are widely used to install targeted point mutations, the factors that determine base editing outcomes are not well understood. We characterized sequence-activity relationships of 11 cytosine and adenine base editors (CBEs and ABEs) on 38,538 genomically integrated targets in ma...

Full description

Saved in:
Bibliographic Details
Published inCell Vol. 182; no. 2; pp. 463 - 480.e30
Main Authors Arbab, Mandana, Shen, Max W., Mok, Beverly, Wilson, Christopher, Matuszek, Żaneta, Cassa, Christopher A., Liu, David R.
Format Journal Article
LanguageEnglish
Published United States Elsevier Inc 23.07.2020
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Although base editors are widely used to install targeted point mutations, the factors that determine base editing outcomes are not well understood. We characterized sequence-activity relationships of 11 cytosine and adenine base editors (CBEs and ABEs) on 38,538 genomically integrated targets in mammalian cells and used the resulting outcomes to train BE-Hive, a machine learning model that accurately predicts base editing genotypic outcomes (R ≈ 0.9) and efficiency (R ≈ 0.7). We corrected 3,388 disease-associated SNVs with ≥90% precision, including 675 alleles with bystander nucleotides that BE-Hive correctly predicted would not be edited. We discovered determinants of previously unpredictable C-to-G, or C-to-A editing and used these discoveries to correct coding sequences of 174 pathogenic transversion SNVs with ≥90% precision. Finally, we used insights from BE-Hive to engineer novel CBE variants that modulate editing outcomes. These discoveries illuminate base editing, enable editing at previously intractable targets, and provide new base editors with improved editing capabilities. [Display omitted] •Base editing outcome precision and efficiency are frequently unintuitive•Machine learning model (BE-Hive) accurately predicts base editing efficiency and editing patterns•Base editor engineering can increase and reduce aberrant transversion editing•We precisely correct 3,388 pathogenic SNVs, many previously considered intractable A comprehensive look at CRISPR base editing efficiencies and outcomes across target sequences, cell lines, and base editing effectors yields machine learning models and a web-based tool for users to predict the editing efficiency, bystander edits, and the best base editor to use for a sequence of interest.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Conceptualization, M.A., M.W.S. and D.R.L.; Methodology, M.A. and M.W.S.; Software, M.W.S.; Validation, M.A., B.M. and Z.M.; Formal analysis, M.W.S.; Investigation, M.A.; Resources, C.W. and c. A.C; Data Curation, M.W.S.; Writing - Original Draft, M.A., M.W.S. and D.R.L.; Writing - Review and Editing, M.A., M.W.S. and D.R.L.; Visualization, M.A., M.W.S., and C.W.; Supervision, D.R.L.; Project Administration, M.A., M.W.S. and D.R.L.; Funding Acquisition, D.R.L.
Lead contact: David R. Liu (drliu@fas.harvard.edu)
Author Contributions
These authors contributed equally
ISSN:0092-8674
1097-4172
1097-4172
DOI:10.1016/j.cell.2020.05.037