Stability Indicators in Network Reconstruction

The number of available algorithms to infer a biological network from a dataset of high-throughput measurements is overwhelming and keeps growing. However, evaluating their performance is unfeasible unless a 'gold standard' is available to measure how close the reconstructed network is to...

Full description

Saved in:

Bibliographic Details
Published in	PloS one Vol. 9; no. 2; p. e89815
Main Authors	Filosi, Michele, Visintainer, Roberto, Riccadonna, Samantha, Jurman, Giuseppe, Furlanello, Cesare
Format	Journal Article
Language	English
Published	United States Public Library of Science 27.02.2014 Public Library of Science (PLoS)
Subjects	Accuracy Algorithms Analysis Biology Carcinoma, Hepatocellular - genetics Cell cycle Communication Computer Science Computer simulation Covariance Datasets DNA microarrays Gene expression Gene Regulatory Networks Genomes Ground truth Hepatitis Hepatocellular carcinoma Humans Indicators Liver cancer Liver Neoplasms - genetics Mathematics Medical prognosis Medical research MicroRNAs MicroRNAs - genetics Minimum inhibitory concentration miRNA Models, Biological Modularity Neural Networks (Computer) Reconstruction Reproducibility Resampling Schizophrenia Stability analysis Subgroups Variability Yeast Yeasts - physiology
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The number of available algorithms to infer a biological network from a dataset of high-throughput measurements is overwhelming and keeps growing. However, evaluating their performance is unfeasible unless a 'gold standard' is available to measure how close the reconstructed network is to the ground truth. One measure of this is the stability of these predictions to data resampling approaches. We introduce NetSI, a family of Network Stability Indicators, to assess quantitatively the stability of a reconstructed network in terms of inference variability due to data subsampling. In order to evaluate network stability, the main NetSI methods use a global/local network metric in combination with a resampling (bootstrap or cross-validation) procedure. In addition, we provide two normalized variability scores over data resampling to measure edge weight stability and node degree stability, and then introduce a stability ranking for edges and nodes. A complete implementation of the NetSI indicators, including the Hamming-Ipsen-Mikhailov (HIM) network distance adopted in this paper is available with the R package nettools. We demonstrate the use of the NetSI family by measuring network stability on four datasets against alternative network reconstruction methods. First, the effect of sample size on stability of inferred networks is studied in a gold standard framework on yeast-like data from the Gene Net Weaver simulator. We also consider the impact of varying modularity on a set of structurally different networks (50 nodes, from 2 to 10 modules), and then of complex feature covariance structure, showing the different behaviours of standard reconstruction methods based on Pearson correlation, Maximum Information Coefficient (MIC) and False Discovery Rate (FDR) strategy. Finally, we demonstrate a strong combined effect of different reconstruction methods and phenotype subgroups on a hepatocellular carcinoma miRNA microarray dataset (240 subjects), and we validate the analysis on a second dataset (166 subjects) with good reproducibility.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 Competing Interests: The authors have declared that no competing interests exist. Conceived and designed the experiments: GJ MF RV SR. Performed the experiments: GJ MF RV SR. Analyzed the data: GJ MF RV SR. Wrote the paper: GJ SR CF.
ISSN:	1932-6203 1932-6203
DOI:	10.1371/journal.pone.0089815