Biblio-MetReS for user-friendly mining of genes and biological processes in scientific documents

One way to initiate the reconstruction of molecular circuits is by using automated text-mining techniques. Developing more efficient methods for such reconstruction is a topic of active research, and those methods are typically included by bioinformaticians in pipelines used to mine and curate large...

Full description

Saved in:

Bibliographic Details
Published in	PeerJ (San Francisco, CA) Vol. 2; p. e276
Main Authors	Usie, Anabel, Karathia, Hiren, Teixidó, Ivan, Alves, Rui, Solsona, Francesc
Format	Journal Article
Language	English
Published	United States PeerJ. Ltd 27.02.2014 PeerJ, Inc PeerJ Inc
Subjects	Analysis Automation Bibliography Bibliometrics Bioinformatics Computational Biology Computer programming Data mining Dictionaries Evolutionary genetics Genes Literature analysis Mining industry Molecular Biology Morphology Natural language Natural language processing Network reconstruction Online databases Parameter estimation Systems biology Web applications Systems biology Literature analysis Network reconstruction
Online Access	Get full text

Cover

Loading…

More Information
Summary:	One way to initiate the reconstruction of molecular circuits is by using automated text-mining techniques. Developing more efficient methods for such reconstruction is a topic of active research, and those methods are typically included by bioinformaticians in pipelines used to mine and curate large literature datasets. Nevertheless, experimental biologists have a limited number of available user-friendly tools that use text-mining for network reconstruction and require no programming skills to use. One of these tools is Biblio-MetReS. Originally, this tool permitted an on-the-fly analysis of documents contained in a number of web-based literature databases to identify co-occurrence of proteins/genes. This approach ensured results that were always up-to-date with the latest live version of the databases. However, this 'up-to-dateness' came at the cost of large execution times. Here we report an evolution of the application Biblio-MetReS that permits constructing co-occurrence networks for genes, GO processes, Pathways, or any combination of the three types of entities and graphically represent those entities. We show that the performance of Biblio-MetReS in identifying gene co-occurrence is as least as good as that of other comparable applications (STRING and iHOP). In addition, we also show that the identification of GO processes is on par to that reported in the latest BioCreAtIvE challenge. Finally, we also report the implementation of a new strategy that combines on-the-fly analysis of new documents with preprocessed information from documents that were encountered in previous analyses. This combination simultaneously decreases program run time and maintains 'up-to-dateness' of the results. http://metres.udl.cat/index.php/downloads, metres.cmb@gmail.com.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	2167-8359 2167-8359
DOI:	10.7717/peerj.276