Protocol to use protein language models predicting and following experimental validation of function-enhancing variants of thymine-N-glycosylase

Protein language models (PLMs) are machine learning tools trained to predict masked amino acids within protein sequences, offering opportunities to enhance protein function without prior knowledge of their specific roles. Here, we present a protocol for optimizing thymine-DNA-glycosylase (TDG) using...

Full description

Saved in:
Bibliographic Details
Published inSTAR protocols Vol. 5; no. 3; p. 103188
Main Authors He, Yan, Zhou, Xibin, Yuan, Fajie, Chang, Xing
Format Journal Article
LanguageEnglish
Published United States Elsevier Inc 12.07.2024
Elsevier
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Protein language models (PLMs) are machine learning tools trained to predict masked amino acids within protein sequences, offering opportunities to enhance protein function without prior knowledge of their specific roles. Here, we present a protocol for optimizing thymine-DNA-glycosylase (TDG) using PLMs. We describe steps for “zero-shot” enzyme optimization, construction of plasmids, double plasmid transfection, and high-throughput sequencing and data analysis. This protocol holds promise for streamlining the engineering of gene editing tools, delivering improved activity while minimizing the experimental workload. For complete details on the use and execution of this protocol, please refer to He et al.1 [Display omitted] •Optimizing thymine-N-glycosylase using protein language models (PLMs)•Use of PLMs to optimize enzymes without extensive task-specific training data•Protocol for “zero-shot” enzyme optimization Publisher’s note: Undertaking any experimental protocol requires adherence to local institutional guidelines for laboratory safety and ethics. Protein language models (PLMs) are machine learning tools trained to predict masked amino acids within protein sequences, offering opportunities to enhance protein function without prior knowledge of their specific roles. Here, we present a protocol for optimizing thymine-DNA-glycosylase (TDG) using PLMs. We describe steps for “zero-shot” enzyme optimization, construction of plasmids, double plasmid transfection, and high-throughput sequencing and data analysis. This protocol holds promise for streamlining the engineering of gene editing tools, delivering improved activity while minimizing the experimental workload.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Technical contact
These authors contributed equally
Lead contact
ISSN:2666-1667
2666-1667
DOI:10.1016/j.xpro.2024.103188