Unsupervised discovery of phenotype-specific multi-omics networks

Abstract Motivation Complex diseases often involve a wide spectrum of phenotypic traits. Better understanding of the biological mechanisms relevant to each trait promotes understanding of the etiology of the disease and the potential for targeted and effective treatment plans. There have been many e...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics Vol. 35; no. 21; pp. 4336 - 4343
Main Authors Shi, W Jenny, Zhuang, Yonghua, Russell, Pamela H, Hobbs, Brian D, Parker, Margaret M, Castaldi, Peter J, Rudra, Pratyaydipta, Vestal, Brian, Hersh, Craig P, Saba, Laura M, Kechris, Katerina
Format Journal Article
LanguageEnglish
Published England Oxford University Press 01.11.2019
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Abstract Motivation Complex diseases often involve a wide spectrum of phenotypic traits. Better understanding of the biological mechanisms relevant to each trait promotes understanding of the etiology of the disease and the potential for targeted and effective treatment plans. There have been many efforts towards omics data integration and network reconstruction, but limited work has examined the incorporation of relevant (quantitative) phenotypic traits. Results We propose a novel technique, sparse multiple canonical correlation network analysis (SmCCNet), for integrating multiple omics data types along with a quantitative phenotype of interest, and for constructing multi-omics networks that are specific to the phenotype. As a case study, we focus on miRNA–mRNA networks. Through simulations, we demonstrate that SmCCNet has better overall prediction performance compared to popular gene expression network construction and integration approaches under realistic settings. Applying SmCCNet to studies on chronic obstructive pulmonary disease (COPD) and breast cancer, we found enrichment of known relevant pathways (e.g. the Cadherin pathway for COPD and the interferon-gamma signaling pathway for breast cancer) as well as less known omics features that may be important to the diseases. Although those applications focus on miRNA–mRNA co-expression networks, SmCCNet is applicable to a variety of omics and other data types. It can also be easily generalized to incorporate multiple quantitative phenotype simultaneously. The versatility of SmCCNet suggests great potential of the approach in many areas. Availability and implementation The SmCCNet algorithm is written in R, and is freely available on the web at https://cran.r-project.org/web/packages/SmCCNet/index.html. Supplementary information Supplementary data are available at Bioinformatics online.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
The authors wish it to be known that, in their opinion, W. Jenny Shi and Yonghua Zhuang authors and should be regarded as Joint First Authors.
ISSN:1367-4803
1460-2059
1367-4811
DOI:10.1093/bioinformatics/btz226