Genome-wide association study of a semicontinuous trait: illustration of the impact of the modeling strategy through the study of Neutrophil Extracellular Traps levels

Abstract Over the last years, there has been a considerable expansion of genome-wide association studies (GWAS) for discovering biological pathways underlying pathological conditions or disease biomarkers. These GWAS are often limited to binary or quantitative traits analyzed through linear or logis...

Full description

Saved in:
Bibliographic Details
Published inNAR genomics and bioinformatics Vol. 5; no. 2; p. lqad062
Main Authors Munsch, Gaëlle, Proust, Carole, Labrouche-Colomer, Sylvie, Aïssi, Dylan, Boland, Anne, Morange, Pierre-Emmanuel, Roche, Anne, de Chaisemartin, Luc, Harroche, Annie, Olaso, Robert, Deleuze, Jean-François, James, Chloé, Emmerich, Joseph, Smadja, David M, Jacqmin-Gadda, Hélène, Trégouët, David-Alexandre
Format Journal Article
LanguageEnglish
Published England Oxford University Press 01.06.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Abstract Over the last years, there has been a considerable expansion of genome-wide association studies (GWAS) for discovering biological pathways underlying pathological conditions or disease biomarkers. These GWAS are often limited to binary or quantitative traits analyzed through linear or logistic models, respectively. In some situations, the distribution of the outcome may require more complex modeling, such as when the outcome exhibits a semicontinuous distribution characterized by an excess of zero values followed by a non-negative and right-skewed distribution. We here investigate three different modeling for semicontinuous data: Tobit, Negative Binomial and Compound Poisson-Gamma. Using both simulated data and a real GWAS on Neutrophil Extracellular Traps (NETs), an emerging biomarker in immuno-thrombosis, we demonstrate that Compound Poisson-Gamma was the most robust model with respect to low allele frequencies and outliers. This model further identified the MIR155HG locus as significantly (P = 1.4 × 10−8) associated with NETs plasma levels in a sample of 657 participants, a locus recently highlighted to be involved in NETs formation in mice. This work highlights the importance of the modeling strategy for GWAS of a semicontinuous outcome and suggests Compound Poisson-Gamma as an elegant but neglected alternative to Negative Binomial for modeling semicontinuous outcome in the context of genomic investigations.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
The authors wish it to be known that, in their opinion, the last two authors should be regarded as Joint Last Authors.
ISSN:2631-9268
2631-9268
DOI:10.1093/nargab/lqad062