Ongoing global and regional adaptive evolution of SARS-CoV-2

Understanding the trends in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) evolution is paramount to control the COVID-19 pandemic. We analyzed more than 300,000 high-quality genome sequences of SARS-CoV-2 variants available as of January 2021. The results show that the ongoing evoluti...

Full description

Saved in:
Bibliographic Details
Published inProceedings of the National Academy of Sciences - PNAS Vol. 118; no. 29; pp. 1 - 10
Main Authors Rochman, Nash D., Wolf, Yuri I., Faure, Guilhem, Mutz, Pascal, Zhang, Feng, Koonin, Eugene V.
Format Journal Article
LanguageEnglish
Published Washington National Academy of Sciences 20.07.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Understanding the trends in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) evolution is paramount to control the COVID-19 pandemic. We analyzed more than 300,000 high-quality genome sequences of SARS-CoV-2 variants available as of January 2021. The results show that the ongoing evolution of SARS-CoV-2 during the pandemic is characterized primarily by purifying selection, but a small set of sites appear to evolve under positive selection. The receptor-binding domain of the spike protein and the region of the nucleocapsid protein associated with nuclear localization signals (NLS) are enriched with positively selected amino acid replacements. These replacements form a strongly connected network of apparent epistatic interactions and are signatures of major partitions in the SARS-CoV-2 phylogeny. Virus diversity within each geographic region has been steadily growing for the entirety of the pandemic, but analysis of the phylogenetic distances between pairs of regions reveals four distinct periods based on global partitioning of the tree and the emergence of key mutations. The initial period of rapid diversification into region-specific phylogenies that ended in February 2020 was followed by a major extinction event and global homogenization concomitant with the spread of D614G in the spike protein, ending in March 2020. The NLS-associated variants across multiple partitions rose to global prominence in March to July, during a period of stasis in terms of interregional diversity. Finally, beginning in July 2020, multiple mutations, some of which have since been demonstrated to enable antibody evasion, began to emerge associated with ongoing regional diversification, which might be indicative of speciation.
Bibliography:Author contributions: E.V.K. designed research; N.D.R. and G.F. performed research; N.D.R., Y.I.W., G.F., P.M., F.Z., and E.V.K. analyzed data; and N.D.R. and E.V.K. wrote the paper.
Contributed by Feng Zhang, May 4, 2021 (sent for review March 3, 2021; reviewed by Sergei Maslov and Joshua B. Plotkin)
Reviewers: S.M., University of Illinois at Urbana Champaign; J.B.P., University of Pennsylvania.
ISSN:0027-8424
1091-6490
DOI:10.1073/pnas.2104241118