Structome: a tool for the rapid assembly of datasets for structural phylogenetics
Abstract Summary Protein structures carry signal of common ancestry and can therefore aid in reconstructing their evolutionary histories. To expedite the structure-informed inference process, a web server, Structome, has been developed that allows users to rapidly identify protein structures similar...
Saved in:
Published in | Bioinformatics advances Vol. 3; no. 1; p. vbad134 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
Oxford University Press
05.01.2023
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Abstract
Summary
Protein structures carry signal of common ancestry and can therefore aid in reconstructing their evolutionary histories. To expedite the structure-informed inference process, a web server, Structome, has been developed that allows users to rapidly identify protein structures similar to a query protein and to assemble datasets useful for structure-based phylogenetics. Structome was created by clustering ∼94% of the structures in RCSB PDB using 90% sequence identity and representing each cluster by a centroid structure. Structure similarity between centroid proteins was calculated, and annotations from PDB, SCOP, and CATH were integrated. To illustrate utility, an H3 histone was used as a query, and results show that the protein structures returned by Structome span both sequence and structural diversity of the histone fold. Additionally, the pre-computed nexus-formatted distance matrix, provided by Structome, enables analysis of evolutionary relationships between proteins not identifiable using searches based on sequence similarity alone. Our results demonstrate that, beginning with a single structure, Structome can be used to rapidly generate a dataset of structural neighbours and allows deep evolutionary history of proteins to be studied.
Availability and Implementation
Structome is available at: https://structome.bii.a-star.edu.sg. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 2635-0041 2635-0041 |
DOI: | 10.1093/bioadv/vbad134 |