High-Throughput Virtual Screening of Small Molecule Inhibitors for SARS-CoV-2 Protein Targets with Deep Fusion Models

Structure-based Deep Fusion models were recently shown to outperform several physics- and machine learning-based protein-ligand binding affinity prediction methods. As part of a multi-institutional COVID-19 pandemic response, over 500 million small molecules were computationally screened against fou...

Full description

Saved in:
Bibliographic Details
Published inarXiv.org
Main Authors Stevenson, Garrett A, Jones, Derek, Kim, Hyojin, Drew Bennett, W F, Bennion, Brian J, Borucki, Monica, Bourguet, Feliza, Epstein, Aidan, Franco, Magdalena, Harmon, Brooke, He, Stewart, Katz, Max P, Kirshner, Daniel, Lao, Victoria, Lau, Edmond Y, Lo, Jacky, McLoughlin, Kevin, Mosesso, Richard, Murugesh, Deepa K, Negrete, Oscar A, Saada, Edwin A, Segelke, Brent, Maxwell, Stefan, Torres, Marisa W, Weilhammer, Dina, Wong, Sergio, Yang, Yue, Zemla, Adam, Zhang, Xiaohua, Zhu, Fangqiang, Lightstone, Felice C, Allen, Jonathan E
Format Paper
LanguageEnglish
Published Ithaca Cornell University Library, arXiv.org 01.06.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Structure-based Deep Fusion models were recently shown to outperform several physics- and machine learning-based protein-ligand binding affinity prediction methods. As part of a multi-institutional COVID-19 pandemic response, over 500 million small molecules were computationally screened against four protein structures from the novel coronavirus (SARS-CoV-2), which causes COVID-19. Three enhancements to Deep Fusion were made in order to evaluate more than 5 billion docked poses on SARS-CoV-2 protein targets. First, the Deep Fusion concept was refined by formulating the architecture as one, coherently backpropagated model (Coherent Fusion) to improve binding-affinity prediction accuracy. Secondly, the model was trained using a distributed, genetic hyper-parameter optimization. Finally, a scalable, high-throughput screening capability was developed to maximize the number of ligands evaluated and expedite the path to experimental evaluation. In this work, we present both the methods developed for machine learning-based high-throughput screening and results from using our computational pipeline to find SARS-CoV-2 inhibitors.
ISSN:2331-8422