Catching hidden variation: systematic correction of reference minor allele annotation in clinical variant calling

Purpose We comprehensively assessed the influence of reference minor alleles (RMAs), one of the inherent problems of the human reference genome sequence. Methods The variant call format (VCF) files provided by the 1000 Genomes and Exome Aggregation Consortium (ExAC) consortia were used to identify R...

Full description

Saved in:
Bibliographic Details
Published inGenetics in medicine Vol. 20; no. 3; pp. 360 - 364
Main Authors Barbitoff, Yury A, Bezdvornykh, Igor V, Polev, Dmitrii E, Serebryakova, Elena A, Glotov, Andrey S, Glotov, Oleg S, Predeus, Alexander V
Format Journal Article
LanguageEnglish
Published New York Nature Publishing Group US 01.03.2018
Elsevier Limited
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Purpose We comprehensively assessed the influence of reference minor alleles (RMAs), one of the inherent problems of the human reference genome sequence. Methods The variant call format (VCF) files provided by the 1000 Genomes and Exome Aggregation Consortium (ExAC) consortia were used to identify RMA sites. All coding RMA sites were checked for concordance with UniProt and the presence of same codon variants. RMA-corrected predictions of functional effect were obtained with SIFT, PolyPhen-2, and PROVEAN standalone tools and compared with dbNSFP v2.9 for consistency. Results We systematically characterized the problem of RMAs and identified several possible ways in which RMA could interfere with accurate variant discovery and annotation. We have discovered a systematic bias in the automated variant effect prediction at the RMA loci, as well as widespread switching of functional consequences for variants located in the same codon as the RMA. As a convenient way to address the problem of RMAs we have developed a simple bioinformatic tool that identifies variation at RMA sites and provides correct annotations for all such substitutions. The tool is available free of charge at http://rmahunter.bioinf.me . Conclusion Correction of RMA annotation enhances the accuracy of next-generation sequencing–based methods in clinical practice.
ISSN:1098-3600
1530-0366
DOI:10.1038/gim.2017.168