Protocol to use protein language models predicting and following experimental validation of function-enhancing variants of thymine-N-glycosylase
Protein language models (PLMs) are machine learning tools trained to predict masked amino acids within protein sequences, offering opportunities to enhance protein function without prior knowledge of their specific roles. Here, we present a protocol for optimizing thymine-DNA-glycosylase (TDG) using...
Saved in:
Published in | STAR protocols Vol. 5; no. 3; p. 103188 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
United States
Elsevier Inc
12.07.2024
Elsevier |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Protein language models (PLMs) are machine learning tools trained to predict masked amino acids within protein sequences, offering opportunities to enhance protein function without prior knowledge of their specific roles. Here, we present a protocol for optimizing thymine-DNA-glycosylase (TDG) using PLMs. We describe steps for “zero-shot” enzyme optimization, construction of plasmids, double plasmid transfection, and high-throughput sequencing and data analysis. This protocol holds promise for streamlining the engineering of gene editing tools, delivering improved activity while minimizing the experimental workload.
For complete details on the use and execution of this protocol, please refer to He et al.1
[Display omitted]
•Optimizing thymine-N-glycosylase using protein language models (PLMs)•Use of PLMs to optimize enzymes without extensive task-specific training data•Protocol for “zero-shot” enzyme optimization
Publisher’s note: Undertaking any experimental protocol requires adherence to local institutional guidelines for laboratory safety and ethics.
Protein language models (PLMs) are machine learning tools trained to predict masked amino acids within protein sequences, offering opportunities to enhance protein function without prior knowledge of their specific roles. Here, we present a protocol for optimizing thymine-DNA-glycosylase (TDG) using PLMs. We describe steps for “zero-shot” enzyme optimization, construction of plasmids, double plasmid transfection, and high-throughput sequencing and data analysis. This protocol holds promise for streamlining the engineering of gene editing tools, delivering improved activity while minimizing the experimental workload. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 Technical contact These authors contributed equally Lead contact |
ISSN: | 2666-1667 2666-1667 |
DOI: | 10.1016/j.xpro.2024.103188 |