Score-Based Generative Models for Designing Binding Peptide Backbones

Score-based generative models (SGMs) have proven to be powerful tools for designing new proteins. Designing proteins that bind a pre-specified target is highly relevant to a range of medical and industrial applications. Despite the flurry of new SGMs in the last year, there has been little systemati...

Full description

Saved in:
Bibliographic Details
Published inarXiv.org
Main Authors Boom, John D, Greenig, Matthew, Sormanni, Pietro, Liò, Pietro
Format Paper
LanguageEnglish
Published Ithaca Cornell University Library, arXiv.org 03.11.2023
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Score-based generative models (SGMs) have proven to be powerful tools for designing new proteins. Designing proteins that bind a pre-specified target is highly relevant to a range of medical and industrial applications. Despite the flurry of new SGMs in the last year, there has been little systematic exploration of the impact of design choices in SGMs for protein design. Here we present LoopGen, a flexible SGM framework for the design of short binding peptide structures. We apply our framework to design antibody binding loop structures conditional on a target epitope and evaluate a variety of modelling choices in SGM-based protein design. We demonstrate that modelling residue orientations in addition to positions improves not only the quality of the output structures but also their diversity. Additionally, we identify variance schedules that result in significant performance improvements and observe patterns that may motivate the development of better schedules for protein design. Finally, we develop three novel tests to evaluate whether the model generates structures that are appropriately conditioned on an epitope, demonstrating that LoopGen's generated structures are dependent on the structure, sequence, and position of the epitope. Our findings will help guide future development and evaluation of generative models for binding proteins.
AbstractList Score-based generative models (SGMs) have proven to be powerful tools for designing new proteins. Designing proteins that bind a pre-specified target is highly relevant to a range of medical and industrial applications. Despite the flurry of new SGMs in the last year, there has been little systematic exploration of the impact of design choices in SGMs for protein design. Here we present LoopGen, a flexible SGM framework for the design of short binding peptide structures. We apply our framework to design antibody binding loop structures conditional on a target epitope and evaluate a variety of modelling choices in SGM-based protein design. We demonstrate that modelling residue orientations in addition to positions improves not only the quality of the output structures but also their diversity. Additionally, we identify variance schedules that result in significant performance improvements and observe patterns that may motivate the development of better schedules for protein design. Finally, we develop three novel tests to evaluate whether the model generates structures that are appropriately conditioned on an epitope, demonstrating that LoopGen's generated structures are dependent on the structure, sequence, and position of the epitope. Our findings will help guide future development and evaluation of generative models for binding proteins.
Author Greenig, Matthew
Liò, Pietro
Boom, John D
Sormanni, Pietro
Author_xml – sequence: 1
  givenname: John
  surname: Boom
  middlename: D
  fullname: Boom, John D
– sequence: 2
  givenname: Matthew
  surname: Greenig
  fullname: Greenig, Matthew
– sequence: 3
  givenname: Pietro
  surname: Sormanni
  fullname: Sormanni, Pietro
– sequence: 4
  givenname: Pietro
  surname: Liò
  fullname: Liò, Pietro
BookMark eNqNjcsKgkAUQIcoyMp_GGgt6Ew-1pbVJghqL5NzlTG5Y3O178-gD2h1FufAWbE5WoQZ84SUUZDthFgyn6gNw1AkqYhj6bHiVlkHQa4IND8BglODeQO_WA0d8do6fgAyDRpseG5Qf3mFfjAaeK6q52Na0IYtatUR-D-u2fZY3PfnoHf2NQINZWtHh5MqRZYmUZZGSSz_qz6Ikzwx
ContentType Paper
Copyright 2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Copyright_xml – notice: 2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
DBID 8FE
8FG
ABJCF
ABUWG
AFKRA
AZQEC
BENPR
BGLVJ
CCPQU
DWQXO
HCIFZ
L6V
M7S
PIMPY
PQEST
PQQKQ
PQUKI
PRINS
PTHSS
DatabaseName ProQuest SciTech Collection
ProQuest Technology Collection
Materials Science & Engineering Database (Proquest)
ProQuest Central (Alumni)
ProQuest Central
ProQuest Central Essentials
AUTh Library subscriptions: ProQuest Central
Technology Collection
ProQuest One Community College
ProQuest Central
SciTech Premium Collection
ProQuest Engineering Collection
ProQuest Engineering Database
ProQuest - Publicly Available Content Database
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
Engineering Collection
DatabaseTitle Publicly Available Content Database
Engineering Database
Technology Collection
ProQuest Central Essentials
ProQuest One Academic Eastern Edition
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
ProQuest Technology Collection
ProQuest SciTech Collection
ProQuest Central China
ProQuest Central
ProQuest Engineering Collection
ProQuest One Academic UKI Edition
ProQuest Central Korea
Materials Science & Engineering Collection
ProQuest One Academic
Engineering Collection
DatabaseTitleList Publicly Available Content Database
Database_xml – sequence: 1
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Physics
EISSN 2331-8422
Genre Working Paper/Pre-Print
GroupedDBID 8FE
8FG
ABJCF
ABUWG
AFKRA
ALMA_UNASSIGNED_HOLDINGS
AZQEC
BENPR
BGLVJ
CCPQU
DWQXO
FRJ
HCIFZ
L6V
M7S
M~E
PIMPY
PQEST
PQQKQ
PQUKI
PRINS
PTHSS
ID FETCH-proquest_journals_28761871653
IEDL.DBID BENPR
IngestDate Wed Sep 25 00:15:33 EDT 2024
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-proquest_journals_28761871653
OpenAccessLink https://www.proquest.com/docview/2876187165/abstract/?pq-origsite=%requestingapplication%
PQID 2876187165
PQPubID 2050157
ParticipantIDs proquest_journals_2876187165
PublicationCentury 2000
PublicationDate 20231103
PublicationDateYYYYMMDD 2023-11-03
PublicationDate_xml – month: 11
  year: 2023
  text: 20231103
  day: 03
PublicationDecade 2020
PublicationPlace Ithaca
PublicationPlace_xml – name: Ithaca
PublicationTitle arXiv.org
PublicationYear 2023
Publisher Cornell University Library, arXiv.org
Publisher_xml – name: Cornell University Library, arXiv.org
SSID ssj0002672553
Score 3.49599
SecondaryResourceType preprint
Snippet Score-based generative models (SGMs) have proven to be powerful tools for designing new proteins. Designing proteins that bind a pre-specified target is highly...
SourceID proquest
SourceType Aggregation Database
SubjectTerms Antibodies
Binding
Industrial applications
Modelling
Peptides
Proteins
Schedules
Title Score-Based Generative Models for Designing Binding Peptide Backbones
URI https://www.proquest.com/docview/2876187165/abstract/
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1LS8QwEB52twjefOJjXQJ6DaVNmmxPQrVrEVyKD9jbkqQpiKLrtl797WZiqwdhjyGQkBC--WYy3wzARZWqmNfWUCm0plyqlCphOGV1lFrBUfjuq33ORfHEbxfJYgBFr4XBtMoeEz1QV-8GY-ShY_YiQnafhEpjFMC04eXqg2L_KPxn7ZppDCGII44ftkGWz8v733hLLKRjz-wf5Ho7MtuBoFQru96FgX3bgy2ffmmafcgfsJokzZxJqchPJWiEIYKdyl4b4oglufapFs7QkOzZK1FIiQkplSWZMi8aS-4fwPksf7wqaL_3snspzfLvXOwQRs7lt0dAailU5biQTGLNDWpnTc2YYc4b0Gwq62MYb1rpZPP0KWxj03SvqGNjGLXrT3vmTGurJzCczm4m3d250d1X_g0CVIPP
link.rule.ids 786,790,12792,21416,33408,33779,43635,43840
linkProvider ProQuest
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3fS8MwED60RfTNn-icGtDXIGvSZH0Sqh1VZyk6YW8lSVMQRec6_39zsdMHYc-FlITwfd9d7r4DuKgTFfHGGiqF1pRLlVAlDKesGSRWcGx8926fhcif-d00nnYJt7Yrq1xiogfq-sNgjvzSKXsxQHUfX80-KU6NwtfVboTGOoScuVAlgDDNivLxN8sSCek0M_sHtJ49RtsQlmpm5zuwZt93YcMXXZp2D7In9JCkqSOSmvz4PyP4EJxP9tYSJyfJjS-wcPRC0hfff0JKLEOpLUmVedVotL8P56Nscp3T5b-r7n601d9u2AEELtC3h0AaKVTtFJCMI80NdsyahjHDXAyg2VA2R9BftVJv9ecz2MwnD-NqfFvcH8MWjk33PXWsD8Fi_mVPHLku9Gl3gt_BcoDr
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Score-Based+Generative+Models+for+Designing+Binding+Peptide+Backbones&rft.jtitle=arXiv.org&rft.au=Boom%2C+John+D&rft.au=Greenig%2C+Matthew&rft.au=Sormanni%2C+Pietro&rft.au=Li%C3%B2%2C+Pietro&rft.date=2023-11-03&rft.pub=Cornell+University+Library%2C+arXiv.org&rft.eissn=2331-8422