GeoMine: interactive pattern mining of protein-ligand interfaces in the Protein Data Bank

The searching of user-defined 3D queries in molecular interfaces is a computationally challenging problem that is not satisfactorily solved so far. Most of the few existing tools focused on that purpose are desktop based and not openly available. Besides that, they show a lack of query versatility,...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics (Oxford, England) Vol. 37; no. 3; pp. 424 - 425
Main Authors Diedrich, Konrad, Graef, Joel, Schöning-Stierand, Katrin, Rarey, Matthias
Format Journal Article
LanguageEnglish
Published England 20.04.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The searching of user-defined 3D queries in molecular interfaces is a computationally challenging problem that is not satisfactorily solved so far. Most of the few existing tools focused on that purpose are desktop based and not openly available. Besides that, they show a lack of query versatility, search efficiency and user-friendliness. We address this issue with GeoMine, a publicly available web application that provides textual, numerical and geometrical search functionality for protein-ligand binding sites derived from structural data contained in the Protein Data Bank (PDB). The query generation is supported by a 3D representation of a start structure that provides interactively selectable elements like atoms, bonds and interactions. GeoMine gives full control over geometric variability in the query while performing a deterministic, precise search. Reasonably selective queries are processed on the entire set of protein-ligand complexes in the PDB within a few minutes. GeoMine offers an interactive and iterative search process of successive result analyses and query adaptations. From the numerous potential applications, we picked two from the field of side-effect analyze showcasing the usefulness of GeoMine. GeoMine is part of the ProteinsPlus web application suite and freely available at https://proteins.plus. Supplementary data are available at Bioinformatics online.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1367-4803
1367-4811
DOI:10.1093/bioinformatics/btaa693