rMVP: A Memory-efficient, Visualization-enhanced, and Parallel-accelerated Tool for Genome-wide Association Study

Along with the development of high-throughput sequencing technologies, both sample size and SNP number are increasing rapidly in genome-wide association studies (GWAS), and the associated computation is more challenging than ever. Here, we present a memory-efficient, visualization-enhanced, and para...

Full description

Saved in:
Bibliographic Details
Published inGenomics, proteomics & bioinformatics Vol. 19; no. 4; pp. 619 - 628
Main Authors Yin, Lilin, Zhang, Haohao, Tang, Zhenshuang, Xu, Jingya, Yin, Dong, Zhang, Zhiwu, Yuan, Xiaohui, Zhu, Mengjin, Zhao, Shuhong, Li, Xinyun, Liu, Xiaolei
Format Journal Article
LanguageEnglish
Published China Elsevier B.V 01.08.2021
Key Laboratory of Agricultural Animal Genetics,Breeding and Reproduction,Ministry of Education&College of Animal Science and Technology,Huazhong Agricultural University,Wuhan 430070,China
Key Laboratory of Swine Genetics and Breeding,Ministry of Agriculture,Huazhong Agricultural University,Wuhan 430070,China%School of Computer Science and Technology,Wuhan University of Technology,Wuhan 430070,China%Department of Crop and Soil Sciences,Washington State University,Pullman,WA 99164,USA
Elsevier
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Along with the development of high-throughput sequencing technologies, both sample size and SNP number are increasing rapidly in genome-wide association studies (GWAS), and the associated computation is more challenging than ever. Here, we present a memory-efficient, visualization-enhanced, and parallel-accelerated R package called “rMVP” to address the need for improved GWAS computation. rMVP can 1) effectively process large GWAS data, 2) rapidly evaluate population structure, 3) efficiently estimate variance components by Efficient Mixed-Model Association eXpedited (EMMAX), Factored Spectrally Transformed Linear Mixed Models (FaST-LMM), and Haseman-Elston (HE) regression algorithms, 4) implement parallel-accelerated association tests of markers using general linear model (GLM), mixed linear model (MLM), and fixed and random model circulating probability unification (FarmCPU) methods, 5) compute fast with a globally efficient design in the GWAS processes, and 6) generate various visualizations of GWAS-related information. Accelerated by block matrix multiplication strategy and multiple threads, the association test methods embedded in rMVP are significantly faster than PLINK, GEMMA, and FarmCPU_pkg. rMVP is freely available at https://github.com/xiaolei-lab/rMVP.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Equal contribution.
ISSN:1672-0229
2210-3244
DOI:10.1016/j.gpb.2020.10.007