Allele balance bias identifies systematic genotyping errors and false disease associations

In recent years, next‐generation sequencing (NGS) has become a cornerstone of clinical genetics and diagnostics. Many clinical applications require high precision, especially if rare events such as somatic mutations in cancer or genetic variants causing rare diseases need to be identified. Although...

Full description

Saved in:
Bibliographic Details
Published inHuman mutation Vol. 40; no. 1; pp. 115 - 126
Main Authors Muyas, Francesc, Bosio, Mattia, Puig, Anna, Susak, Hana, Domènech, Laura, Escaramis, Georgia, Zapata, Luis, Demidov, German, Estivill, Xavier, Rabionet, Raquel, Ossowski, Stephan
Format Journal Article
LanguageEnglish
Published United States Hindawi Limited 01.01.2019
Wiley
John Wiley and Sons Inc
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In recent years, next‐generation sequencing (NGS) has become a cornerstone of clinical genetics and diagnostics. Many clinical applications require high precision, especially if rare events such as somatic mutations in cancer or genetic variants causing rare diseases need to be identified. Although random sequencing errors can be modeled statistically and deep sequencing minimizes their impact, systematic errors remain a problem even at high depth of coverage. Understanding their source is crucial to increase precision of clinical NGS applications. In this work, we studied the relation between recurrent biases in allele balance (AB), systematic errors, and false positive variant calls across a large cohort of human samples analyzed by whole exome sequencing (WES). We have modeled the AB distribution for biallelic genotypes in 987 WES samples in order to identify positions recurrently deviating significantly from the expectation, a phenomenon we termed allele balance bias (ABB). Furthermore, we have developed a genotype callability score based on ABB for all positions of the human exome, which detects false positive variant calls that passed state‐of‐the‐art filters. Finally, we demonstrate the use of ABB for detection of false associations proposed by rare variant association studies. Availability: https://github.com/Francesc-Muyas/ABB. We studied the relation between recurrent biases in allele balance (AB), systematic errors and false positive variant calls across large cohorts of human samples analyzed by whole exome sequencing (WES). We modeled the allele balance distribution for biallelic genotypes in 987 WES samples and identified positions recurrently deviating from the expectation, a phenomenon we termed allele balance bias (ABB). Furthermore, we developed a genotype callability score for the human exome and demonstrate the use of ABB for detection of false associations proposed by rare variant association studies.
Bibliography:Communicated by Pui‐Yan Kwok
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1059-7794
1098-1004
DOI:10.1002/humu.23674