Predicting the mutational drivers of future SARS-CoV-2 variants of concern

SARS-CoV-2 evolution threatens vaccine- and natural infection-derived immunity as well as the efficacy of therapeutic antibodies. To improve public health preparedness, we sought to predict which existing amino acid mutations in SARS-CoV-2 might contribute to future variants of concern. We tested th...

Full description

Saved in:
Bibliographic Details
Published inScience translational medicine Vol. 14; no. 633; p. eabk3445
Main Authors Maher, M Cyrus, Bartha, Istvan, Weaver, Steven, di Iulio, Julia, Ferri, Elena, Soriaga, Leah, Lempp, Florian A, Hie, Brian L, Bryson, Bryan, Berger, Bonnie, Robertson, David L, Snell, Gyorgy, Corti, Davide, Virgin, Herbert W, Kosakovsky Pond, Sergei L, Telenti, Amalio
Format Journal Article
LanguageEnglish
Published United States 23.02.2022
Subjects
Online AccessGet more information

Cover

Loading…
Abstract SARS-CoV-2 evolution threatens vaccine- and natural infection-derived immunity as well as the efficacy of therapeutic antibodies. To improve public health preparedness, we sought to predict which existing amino acid mutations in SARS-CoV-2 might contribute to future variants of concern. We tested the predictive value of features comprising epidemiology, evolution, immunology, and neural network-based protein sequence modeling, and identified primary biological drivers of SARS-CoV-2 intra-pandemic evolution. We found evidence that ACE2-mediated transmissibility and resistance to population-level host immunity has waxed and waned as a primary driver of SARS-CoV-2 evolution over time. We retroactively identified with high accuracy (area under the receiver operator characteristic curve, AUROC=0.92-0.97) mutations that will spread, at up to four months in advance, across different phases of the pandemic. The behavior of the model was consistent with a plausible causal structure wherein epidemiological covariates combine the effects of diverse and shifting drivers of viral fitness. We applied our model to forecast mutations that will spread in the future and characterize how these mutations affect the binding of therapeutic antibodies. These findings demonstrate that it is possible to forecast the driver mutations that could appear in emerging SARS-CoV-2 variants of concern. We validate this result against Omicron, showing elevated predictive scores for its component mutations prior to emergence, and rapid score increase across daily forecasts during emergence. This modeling approach may be applied to any rapidly evolving pathogens with sufficiently dense genomic surveillance data, such as influenza, and unknown future pandemic viruses.
AbstractList SARS-CoV-2 evolution threatens vaccine- and natural infection-derived immunity as well as the efficacy of therapeutic antibodies. To improve public health preparedness, we sought to predict which existing amino acid mutations in SARS-CoV-2 might contribute to future variants of concern. We tested the predictive value of features comprising epidemiology, evolution, immunology, and neural network-based protein sequence modeling, and identified primary biological drivers of SARS-CoV-2 intra-pandemic evolution. We found evidence that ACE2-mediated transmissibility and resistance to population-level host immunity has waxed and waned as a primary driver of SARS-CoV-2 evolution over time. We retroactively identified with high accuracy (area under the receiver operator characteristic curve, AUROC=0.92-0.97) mutations that will spread, at up to four months in advance, across different phases of the pandemic. The behavior of the model was consistent with a plausible causal structure wherein epidemiological covariates combine the effects of diverse and shifting drivers of viral fitness. We applied our model to forecast mutations that will spread in the future and characterize how these mutations affect the binding of therapeutic antibodies. These findings demonstrate that it is possible to forecast the driver mutations that could appear in emerging SARS-CoV-2 variants of concern. We validate this result against Omicron, showing elevated predictive scores for its component mutations prior to emergence, and rapid score increase across daily forecasts during emergence. This modeling approach may be applied to any rapidly evolving pathogens with sufficiently dense genomic surveillance data, such as influenza, and unknown future pandemic viruses.
Author Snell, Gyorgy
Soriaga, Leah
Robertson, David L
Hie, Brian L
Telenti, Amalio
Bartha, Istvan
Ferri, Elena
Virgin, Herbert W
Weaver, Steven
Bryson, Bryan
Maher, M Cyrus
Corti, Davide
di Iulio, Julia
Lempp, Florian A
Kosakovsky Pond, Sergei L
Berger, Bonnie
Author_xml – sequence: 1
  givenname: M Cyrus
  orcidid: 0000-0002-6666-3574
  surname: Maher
  fullname: Maher, M Cyrus
  organization: Vir Biotechnology, San Francisco, CA 94158, USA
– sequence: 2
  givenname: Istvan
  orcidid: 0000-0003-3360-1999
  surname: Bartha
  fullname: Bartha, Istvan
  organization: Vir Biotechnology, San Francisco, CA 94158, USA
– sequence: 3
  givenname: Steven
  orcidid: 0000-0002-6931-7191
  surname: Weaver
  fullname: Weaver, Steven
  organization: Department of Biology, Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA 19122, USA
– sequence: 4
  givenname: Julia
  orcidid: 0000-0001-9343-127X
  surname: di Iulio
  fullname: di Iulio, Julia
  organization: Vir Biotechnology, San Francisco, CA 94158, USA
– sequence: 5
  givenname: Elena
  orcidid: 0000-0002-8672-4733
  surname: Ferri
  fullname: Ferri, Elena
  organization: Vir Biotechnology, San Francisco, CA 94158, USA
– sequence: 6
  givenname: Leah
  orcidid: 0000-0001-5852-741X
  surname: Soriaga
  fullname: Soriaga, Leah
  organization: Vir Biotechnology, San Francisco, CA 94158, USA
– sequence: 7
  givenname: Florian A
  orcidid: 0000-0001-6103-8078
  surname: Lempp
  fullname: Lempp, Florian A
  organization: Vir Biotechnology, San Francisco, CA 94158, USA
– sequence: 8
  givenname: Brian L
  orcidid: 0000-0003-3224-8142
  surname: Hie
  fullname: Hie, Brian L
  organization: Ragon Institute of MGH, MIT and Harvard, Cambridge, MA 02139, USA
– sequence: 9
  givenname: Bryan
  orcidid: 0000-0003-1716-6712
  surname: Bryson
  fullname: Bryson, Bryan
  organization: Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
– sequence: 10
  givenname: Bonnie
  orcidid: 0000-0002-2724-7228
  surname: Berger
  fullname: Berger, Bonnie
  organization: Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
– sequence: 11
  givenname: David L
  orcidid: 0000-0001-6338-0221
  surname: Robertson
  fullname: Robertson, David L
  organization: MRC-University of Glasgow Centre for Virus Research, University of Glasgow, Glasgow GS1 1QH, UK
– sequence: 12
  givenname: Gyorgy
  orcidid: 0000-0003-1475-659X
  surname: Snell
  fullname: Snell, Gyorgy
  organization: Vir Biotechnology, San Francisco, CA 94158, USA
– sequence: 13
  givenname: Davide
  orcidid: 0000-0002-5797-1364
  surname: Corti
  fullname: Corti, Davide
  organization: Vir Biotechnology, San Francisco, CA 94158, USA
– sequence: 14
  givenname: Herbert W
  orcidid: 0000-0001-8580-7628
  surname: Virgin
  fullname: Virgin, Herbert W
  organization: Department of Internal Medicine, UT Southwestern Medical Center, Dallas, TX 75390, USA
– sequence: 15
  givenname: Sergei L
  orcidid: 0000-0003-4817-4029
  surname: Kosakovsky Pond
  fullname: Kosakovsky Pond, Sergei L
  organization: Department of Biology, Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA 19122, USA
– sequence: 16
  givenname: Amalio
  orcidid: 0000-0001-6290-7677
  surname: Telenti
  fullname: Telenti, Amalio
  organization: Vir Biotechnology, San Francisco, CA 94158, USA
BackLink https://www.ncbi.nlm.nih.gov/pubmed/35014856$$D View this record in MEDLINE/PubMed
BookMark eNo1j9lKAzEUQIModtE_EMkPTM02dzKPZXCloFj1tWTVaCdTkkzBv1dcns7DgQNnhg7jEB1CZ5QsKGVwkU0oScW87Z1dKP3BhagP0JS2Aipggk3QLOd3QkDyGo7RhNeEClnDFN09JGeDKSG-4vLmcD8WVcIQ1RbbFPYuZTx47McyJofXy8d11Q0vFcN7lYKK5ceaIRqX4gk68mqb3ekf5-j56vKpu6lW99e33XJVGcGhVJKBptRKJ9uWOuC2AW-I5oS3WgoriWeNbmtwIIgxQjUNtcIzaKgTnCnD5uj8t7sb9ffvZpdCr9Ln5n-KfQE0KVFI
CitedBy_id crossref_primary_10_1016_j_xcrp_2023_101346
crossref_primary_10_3390_bios12110978
crossref_primary_10_1080_08839514_2025_2476237
crossref_primary_10_1016_j_aca_2023_341531
crossref_primary_10_1080_13669877_2022_2077411
crossref_primary_10_1038_s42003_024_07262_7
crossref_primary_10_1371_journal_pone_0275623
crossref_primary_10_1002_advs_202405058
crossref_primary_10_1016_j_plrev_2024_10_011
crossref_primary_10_1146_annurev_med_042921_020956
crossref_primary_10_3390_ijms232314683
crossref_primary_10_1038_s41467_024_47304_6
crossref_primary_10_1177_10943420231201154
crossref_primary_10_1007_s10489_024_06223_1
crossref_primary_10_1038_s41467_023_39199_6
crossref_primary_10_1093_ve_veae067
crossref_primary_10_3389_fpubh_2023_1015969
crossref_primary_10_1093_bib_bbae191
crossref_primary_10_1038_s41576_023_00623_8
crossref_primary_10_3390_math13010167
crossref_primary_10_1038_s41467_024_55593_0
crossref_primary_10_1038_s41598_023_42015_2
crossref_primary_10_1016_j_scitotenv_2023_167971
crossref_primary_10_2196_36860
crossref_primary_10_1016_j_csbj_2025_03_010
crossref_primary_10_1371_journal_pcbi_1012215
crossref_primary_10_3390_v14102301
crossref_primary_10_1080_07391102_2022_2095305
crossref_primary_10_1038_s41467_024_55461_x
crossref_primary_10_3389_fsysb_2024_1284668
crossref_primary_10_3389_fimmu_2022_1024924
crossref_primary_10_1093_bib_bbac190
crossref_primary_10_1371_journal_ppat_1010951
crossref_primary_10_3390_jcm11164723
crossref_primary_10_1016_j_isci_2023_106940
crossref_primary_10_1136_bmjhci_2022_100643
crossref_primary_10_3748_wjg_v29_i14_2064
crossref_primary_10_1016_j_cell_2023_02_001
crossref_primary_10_1155_2022_2109641
crossref_primary_10_1128_spectrum_03291_23
crossref_primary_10_1016_j_bsheal_2023_01_002
crossref_primary_10_1038_s41586_024_08564_w
crossref_primary_10_3390_v14112532
crossref_primary_10_1038_s41392_024_02066_x
crossref_primary_10_3389_fimmu_2023_1142394
crossref_primary_10_1016_j_bios_2024_117015
crossref_primary_10_1111_eva_13513
crossref_primary_10_1177_0306624X241288967
crossref_primary_10_1038_s41587_023_01763_2
crossref_primary_10_1038_s44318_024_00193_3
crossref_primary_10_1371_journal_pcbi_1011790
crossref_primary_10_1016_j_ebiom_2023_104939
crossref_primary_10_7554_eLife_92063
crossref_primary_10_1016_j_jmgm_2022_108379
crossref_primary_10_1016_j_patter_2023_100800
crossref_primary_10_1002_med4_74
crossref_primary_10_1128_spectrum_02269_23
crossref_primary_10_3390_v14030519
crossref_primary_10_1038_s41591_023_02483_5
crossref_primary_10_1038_s41591_022_01836_w
crossref_primary_10_1002_prot_26663
crossref_primary_10_3390_biology12091267
crossref_primary_10_1093_immadv_ltad005
crossref_primary_10_1007_s00249_022_01619_8
crossref_primary_10_1093_ve_vead055
crossref_primary_10_3389_fviro_2022_942555
crossref_primary_10_1002_jmv_29123
crossref_primary_10_37349_ei_2024_00126
crossref_primary_10_1016_j_cels_2022_01_003
crossref_primary_10_1016_j_bbadis_2023_166836
crossref_primary_10_3389_froh_2022_1001790
crossref_primary_10_1016_j_compbiomed_2022_106264
crossref_primary_10_7554_eLife_92063_3
crossref_primary_10_1038_s41467_023_37697_1
crossref_primary_10_1001_jamanetworkopen_2023_0191
crossref_primary_10_1038_s41467_024_44803_4
crossref_primary_10_1038_s41467_022_34439_7
crossref_primary_10_1038_s41598_023_48186_2
crossref_primary_10_1016_j_heliyon_2023_e13646
crossref_primary_10_1093_bib_bbae218
crossref_primary_10_3390_vetsci10010006
crossref_primary_10_1038_s41579_023_00878_2
crossref_primary_10_1002_rmv_2351
crossref_primary_10_1016_j_gene_2024_148426
crossref_primary_10_1017_sus_2024_5
crossref_primary_10_1038_s41598_023_30800_y
crossref_primary_10_3390_e25101463
crossref_primary_10_1364_BOE_495685
crossref_primary_10_1371_journal_ppat_1010994
crossref_primary_10_3390_math12172681
crossref_primary_10_52601_bpr_2024_240902
crossref_primary_10_1038_s41537_025_00585_2
crossref_primary_10_3389_fbinf_2022_1020189
crossref_primary_10_3390_v14122681
crossref_primary_10_1101_cshperspect_a041390
crossref_primary_10_1007_s11010_022_04588_w
crossref_primary_10_59717_j_xinn_med_2024_100091
ContentType Journal Article
DBID CGR
CUY
CVF
ECM
EIF
NPM
DOI 10.1126/scitranslmed.abk3445
DatabaseName Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
DatabaseTitle MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
DatabaseTitleList MEDLINE
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod no_fulltext_linktorsrc
Discipline Medicine
EISSN 1946-6242
ExternalDocumentID 35014856
Genre Research Support, Non-U.S. Gov't
Journal Article
Research Support, N.I.H., Extramural
GrantInformation_xml – fundername: NIAID NIH HHS
  grantid: R01 AI134384
– fundername: Medical Research Council
  grantid: MC_UU_12014/12
– fundername: Medical Research Council
  grantid: MC_PC_19026
GroupedDBID ---
0R~
4.4
53G
7~K
ABJNI
ACGFS
AENEX
AFQFN
AJGZS
ALMA_UNASSIGNED_HOLDINGS
BKF
C45
CGR
CUY
CVF
DU5
EBS
ECM
EIF
EMOBN
F5P
HZ~
NPM
O9-
OFXIZ
OVD
OVIDX
P2P
RHI
TEORI
ID FETCH-LOGICAL-c436t-826b11d8e8991e63d76fc0b3039b84d80f27b956e640cc4a771d4f2671e432ac2
IngestDate Sat May 31 02:12:18 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 633
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c436t-826b11d8e8991e63d76fc0b3039b84d80f27b956e640cc4a771d4f2671e432ac2
ORCID 0000-0002-8672-4733
0000-0001-6103-8078
0000-0003-1475-659X
0000-0002-2724-7228
0000-0003-3224-8142
0000-0003-4817-4029
0000-0002-5797-1364
0000-0001-6290-7677
0000-0001-9343-127X
0000-0001-5852-741X
0000-0003-1716-6712
0000-0001-8580-7628
0000-0003-3360-1999
0000-0002-6931-7191
0000-0002-6666-3574
0000-0001-6338-0221
OpenAccessLink https://pubmed.ncbi.nlm.nih.gov/PMC8939770
PMID 35014856
ParticipantIDs pubmed_primary_35014856
PublicationCentury 2000
PublicationDate 2022-02-23
PublicationDateYYYYMMDD 2022-02-23
PublicationDate_xml – month: 02
  year: 2022
  text: 2022-02-23
  day: 23
PublicationDecade 2020
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle Science translational medicine
PublicationTitleAlternate Sci Transl Med
PublicationYear 2022
SSID ssj0068356
Score 2.64096
Snippet SARS-CoV-2 evolution threatens vaccine- and natural infection-derived immunity as well as the efficacy of therapeutic antibodies. To improve public health...
SourceID pubmed
SourceType Index Database
StartPage eabk3445
SubjectTerms COVID-19 - virology
Humans
Mutation
Pandemics
SARS-CoV-2 - genetics
Title Predicting the mutational drivers of future SARS-CoV-2 variants of concern
URI https://www.ncbi.nlm.nih.gov/pubmed/35014856
Volume 14
hasFullText
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1ba9swFBbpBqUvY9fujh72FlRsSZacxxA20kLGWNKub0U3Q2mblMwptG_75zuypNlLsrLtxRjJCNvf5-NzjvQdIfQh01I4RwtiB9QX1c400YxqUgAgRlrKXJNwm3wW42N-dFqc9no_uuqSWh-Yu626kv9BFdoAV6-S_Qdkfw0KDXAO-MIREIbjX2H8ZemnWeqkeLpa1Sm1Z5fNegvvCoaqIf3p8OuUjBYnhPZvID5Oy1-Mly0u510fNX3utf-NXaYR1yfhJyqiPemPbperNtsO9xkmkQ6_1zct9745FReLho3UUrs97x-uLs8XSaytunkICGG9rjvYJhds54AL4uUmvxlX3iGRYKxjK53SF4yHYpKbhjxtPRmeFZ7xYMvlAMf1VQNuMz9ahhrl9_eulddOXTtoBwINv3OqT_eEX7kA91REvWVUWm3czh7aTUOsRSaNhzJ7jB7F0AIPA0-eoJ6bP0W7k4jbM3TU0gUDXXBLFxzpghcVDnTBLV1woovvjXR5jo4_fZyNxiRupUEMZ6ImEETqPLelg_A6d4JZKSqTafBfBrrktswqKjWEyk7wzBiupMwtr6iQueOMKkNfoAfzxdy9RFhUWeEUt1rkhmsmFK3KTLlMF0oqK-UrtB_ewtl1qJdylt7P6z_2vEF7LaXeoocVfKDuHXh7tX7fIPITXxlXbg
linkProvider National Library of Medicine
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Predicting+the+mutational+drivers+of+future+SARS-CoV-2+variants+of+concern&rft.jtitle=Science+translational+medicine&rft.au=Maher%2C+M+Cyrus&rft.au=Bartha%2C+Istvan&rft.au=Weaver%2C+Steven&rft.au=di+Iulio%2C+Julia&rft.date=2022-02-23&rft.eissn=1946-6242&rft.volume=14&rft.issue=633&rft.spage=eabk3445&rft_id=info:doi/10.1126%2Fscitranslmed.abk3445&rft_id=info%3Apmid%2F35014856&rft_id=info%3Apmid%2F35014856&rft.externalDocID=35014856