metGWAS 1.0: An R workflow for network-driven over-representation analysis between independent metabolomic and meta-genome wide association studies

Background: Many diseases may result from disrupted metabolic regulation. Metabolite-GWAS studies assess the association of polymorphic variants with metabolite levels in body fluids. While these studies are successful, they have a high cost and technical expertise burden due to combining the analyt...

Full description

Saved in:
Bibliographic Details
Published inbioRxiv
Main Authors Md Saifur Rahman Khan, Obersterescu, Andreea, Gunderson, Erica, Wheeler, Michael B, Cox, Brian J
Format Paper
LanguageEnglish
Published Cold Spring Harbor Cold Spring Harbor Laboratory Press 11.08.2022
Cold Spring Harbor Laboratory
Edition1.1
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Background: Many diseases may result from disrupted metabolic regulation. Metabolite-GWAS studies assess the association of polymorphic variants with metabolite levels in body fluids. While these studies are successful, they have a high cost and technical expertise burden due to combining the analytical biochemistry of metabolomics with the computational genetics of GWAS. Currently, there are 100s of standalone metabolomics and GWAS studies related to similar diseases or phenotypes. A method that could statically evaluate these independent studies to find novel metabolites-genes association is of high interest. Although such an analysis is limited to genes with known metabolite interactions due to the unpaired nature of the data sets, any discovered associations may represent biomarkers and druggable targets for treatment and prevention. Methods: We developed a bioinformatics tool, metGWAS 1.0, that generates and statistically compares metabolic and genomic gene sets using a hypergeometric test. Metabolic gene sets are generated by mapping disease-associated metabolites to interacting proteins (genes) via online databases. Genomic gene sets are identified from a network representation of the GWAS Catalog comprising 100s of studies. Results: The metGWAS 1.0 tool was evaluated using standalone metabolomics datasets extracted from two metabolomics-GWAS case studies. In case-study 1, a cardiovascular disease association study, we identified nine genes (APOA5, PLA2G5, PLA2G2D, PLA2G2E, PLA2G2F, LRAT, PLA2G2A, PLB1, and PLA2G7) that interact with metabolites in the KEGG glycerophospholipid metabolism pathway and contain polymorphic variants associated with cardiovascular disease (P < 0.005). The gene APOA5 was matched from the original metabolomics-GWAS study. In case study 2, a urine metabolome study of kidney metabolism in healthy subjects, we found marginal significance (P = 0.10 and P = 0.13) for glycine, serine, and threonine metabolism and alanine, aspartate, and glutamate metabolism pathways to GWAS data relating to kidney disease. Conclusion: The metGWAS 1.0 platform provides insight into developing methods that bridge standalone metabolomics and disease and phenotype GWAS data. We show the potential to reproduce findings of paired metabolomics-GWAS data and provide novel associations of gene variation and metabolite expression. Competing Interest Statement The authors have declared no competing interest.
Bibliography:SourceType-Working Papers-1
ObjectType-Working Paper/Pre-Print-1
content type line 50
Competing Interest Statement: The authors have declared no competing interest.
ISSN:2692-8205
2692-8205
DOI:10.1101/2022.08.09.503325