DrugQuest - a text mining workflow for drug association discovery

Text mining and data integration methods are gaining ground in the field of health sciences due to the exponential growth of bio-medical literature and information stored in biological databases. While such methods mostly try to extract bioentity associations from PubMed, very few of them are dedica...

Full description

Saved in:

Bibliographic Details
Published in	BMC bioinformatics Vol. 17 Suppl 5; no. Suppl 5; p. 182
Main Authors	Papanikolaou, Nikolas, Pavlopoulos, Georgios A, Theodosiou, Theodosios, Vizirianakis, Ioannis S, Iliopoulos, Ioannis
Format	Journal Article
Language	English
Published	England BioMed Central Ltd 06.06.2016 BioMed Central
Subjects	Algorithms Cluster Analysis Data mining Databases, Factual Drug Discovery Humans Internet Methods Pharmaceutical Preparations - chemistry Pharmaceutical Preparations - metabolism User-Computer Interface Greece Text mining Document clustering Data integration Drug associations Knowledge discovery Name entity recognition Chemicals
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Text mining and data integration methods are gaining ground in the field of health sciences due to the exponential growth of bio-medical literature and information stored in biological databases. While such methods mostly try to extract bioentity associations from PubMed, very few of them are dedicated in mining other types of repositories such as chemical databases. Herein, we apply a text mining approach on the DrugBank database in order to explore drug associations based on the DrugBank "Description", "Indication", "Pharmacodynamics" and "Mechanism of Action" text fields. We apply Name Entity Recognition (NER) techniques on these fields to identify chemicals, proteins, genes, pathways, diseases, and we utilize the TextQuest algorithm to find additional biologically significant words. Using a plethora of similarity and partitional clustering techniques, we group the DrugBank records based on their common terms and investigate possible scenarios why these records are clustered together. Different views such as clustered chemicals based on their textual information, tag clouds consisting of Significant Terms along with the terms that were used for clustering are delivered to the user through a user-friendly web interface. DrugQuest is a text mining tool for knowledge discovery: it is designed to cluster DrugBank records based on text attributes in order to find new associations between drugs. The service is freely available at http://bioinformatics.med.uoc.gr/drugquest .
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	1471-2105 1471-2105
DOI:	10.1186/s12859-016-1041-6