Assessing the Impact of Sample Heterogeneity on Transcriptome Analysis of Human Diseases Using MDP Webtool

Transcriptome analyses have increased our understanding of the molecular mechanisms underlying human diseases. Most approaches aim to identify significant genes by comparing their expression values between healthy subjects and a group of patients with a certain disease. Given that studies normally c...

Full description

Saved in:
Bibliographic Details
Published inFrontiers in genetics Vol. 10; p. 971
Main Authors Gonçalves, André N. A., Lever, Melissa, Russo, Pedro S. T., Gomes-Correia, Bruno, Urbanski, Alysson H., Pollara, Gabriele, Noursadeghi, Mahdad, Maracaja-Coutinho, Vinicius, Nakaya, Helder I.
Format Journal Article
LanguageEnglish
Published Frontiers Media S.A 24.10.2019
Subjects
Online AccessGet full text
ISSN1664-8021
1664-8021
DOI10.3389/fgene.2019.00971

Cover

Loading…
More Information
Summary:Transcriptome analyses have increased our understanding of the molecular mechanisms underlying human diseases. Most approaches aim to identify significant genes by comparing their expression values between healthy subjects and a group of patients with a certain disease. Given that studies normally contain few samples, the heterogeneity among individuals caused by environmental factors or undetected illnesses can impact gene expression analyses. We present a systematic analysis of sample heterogeneity in a variety of gene expression studies relating to inflammatory and infectious diseases and show that novel immunological insights may arise once heterogeneity is addressed. The perturbation score of samples is quantified using nonperturbed subjects (i.e., healthy subjects) as a reference group. Such a score allows us to detect outlying samples and subgroups of diseased patients and even assess the molecular perturbation of single cells infected with viruses. We also show how removal of outlying samples can improve the "signal" of the disease and impact detection of differentially expressed genes. The method is made available via the mdp Bioconductor R package and as a user-friendly webtool, webMDP, available at http://mdp.sysbio.tools.Transcriptome analyses have increased our understanding of the molecular mechanisms underlying human diseases. Most approaches aim to identify significant genes by comparing their expression values between healthy subjects and a group of patients with a certain disease. Given that studies normally contain few samples, the heterogeneity among individuals caused by environmental factors or undetected illnesses can impact gene expression analyses. We present a systematic analysis of sample heterogeneity in a variety of gene expression studies relating to inflammatory and infectious diseases and show that novel immunological insights may arise once heterogeneity is addressed. The perturbation score of samples is quantified using nonperturbed subjects (i.e., healthy subjects) as a reference group. Such a score allows us to detect outlying samples and subgroups of diseased patients and even assess the molecular perturbation of single cells infected with viruses. We also show how removal of outlying samples can improve the "signal" of the disease and impact detection of differentially expressed genes. The method is made available via the mdp Bioconductor R package and as a user-friendly webtool, webMDP, available at http://mdp.sysbio.tools.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
This article was submitted to Systems Biology, a section of the journal Frontiers in Genetics
Edited by: Argyris Papantonis, University Medical Center Göttingen, Germany
Reviewed by: Debashis Sahoo, University of California, San Diego, United States; Lin Zhang, China University of Mining and Technology, China
ISSN:1664-8021
1664-8021
DOI:10.3389/fgene.2019.00971