SNP genotype calling and quality control for multi-batch-based studies
Background In genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by...
Saved in:
Published in | Genes & genomics Vol. 41; no. 8; pp. 927 - 939 |
---|---|
Main Authors | , , , , , |
Format | Journal Article |
Language | English |
Published |
Singapore
Springer Singapore
01.08.2019
Springer Nature B.V 한국유전학회 |
Subjects | |
Online Access | Get full text |
ISSN | 1976-9571 2092-9293 2092-9293 |
DOI | 10.1007/s13258-019-00827-5 |
Cover
Loading…
Abstract | Background
In genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by statistics, and newer approaches are required.
Methods
In this report, we propose a new method to detect the heterogeneity of probe intensities among different batches and a procedure for calling genotypes and QC in the presence of a batch effect. First, we conducted a multivariate analysis of variance (MANOVA) to test the differences in probe intensities among batches. If heterogeneity is detected, subjects should be clustered using a K-medoid algorithm using the averages of the probe intensity measurements for each batch and the genotypes of subjects in different clusters should be called separately.
Results
The proposed method was used to assess genotyping data of 3619 subjects consisting of 1074 patients with Alzheimer’s disease, 296 with mild cognitive impairment (MCI), and 1153 controls. The proposed method improves the accuracy of called genotypes without the need to filter a lot of subjects and SNPs, and therefore is a reasonable approach for controlling batch effects.
Conclusions
We proposed a new strategy that detects batch effects with probe intensity measurement and calls genotypes in the presence of batch effects. The application of the proposed method to real data shows that it produces a balanced approach. Furthermore, the proposed method can be extended to various scenarios with a simple modification. |
---|---|
AbstractList | BackgroundIn genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by statistics, and newer approaches are required.MethodsIn this report, we propose a new method to detect the heterogeneity of probe intensities among different batches and a procedure for calling genotypes and QC in the presence of a batch effect. First, we conducted a multivariate analysis of variance (MANOVA) to test the differences in probe intensities among batches. If heterogeneity is detected, subjects should be clustered using a K-medoid algorithm using the averages of the probe intensity measurements for each batch and the genotypes of subjects in different clusters should be called separately.ResultsThe proposed method was used to assess genotyping data of 3619 subjects consisting of 1074 patients with Alzheimer’s disease, 296 with mild cognitive impairment (MCI), and 1153 controls. The proposed method improves the accuracy of called genotypes without the need to filter a lot of subjects and SNPs, and therefore is a reasonable approach for controlling batch effects.ConclusionsWe proposed a new strategy that detects batch effects with probe intensity measurement and calls genotypes in the presence of batch effects. The application of the proposed method to real data shows that it produces a balanced approach. Furthermore, the proposed method can be extended to various scenarios with a simple modification. In genetic analyses, the term 'batch effect' refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by statistics, and newer approaches are required. In this report, we propose a new method to detect the heterogeneity of probe intensities among different batches and a procedure for calling genotypes and QC in the presence of a batch effect. First, we conducted a multivariate analysis of variance (MANOVA) to test the differences in probe intensities among batches. If heterogeneity is detected, subjects should be clustered using a K-medoid algorithm using the averages of the probe intensity measurements for each batch and the genotypes of subjects in different clusters should be called separately. The proposed method was used to assess genotyping data of 3619 subjects consisting of 1074 patients with Alzheimer's disease, 296 with mild cognitive impairment (MCI), and 1153 controls. The proposed method improves the accuracy of called genotypes without the need to filter a lot of subjects and SNPs, and therefore is a reasonable approach for controlling batch effects. We proposed a new strategy that detects batch effects with probe intensity measurement and calls genotypes in the presence of batch effects. The application of the proposed method to real data shows that it produces a balanced approach. Furthermore, the proposed method can be extended to various scenarios with a simple modification. Background In genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by statistics, and newer approaches are required. Methods In this report, we propose a new method to detect the heterogeneity of probe intensities among different batches and a procedure for calling genotypes and QC in the presence of a batch effect. First, we conducted a multivariate analysis of variance (MANOVA) to test the differences in probe intensities among batches. If heterogeneity is detected, subjects should be clustered using a K-medoid algorithm using the averages of the probe intensity measurements for each batch and the genotypes of subjects in different clusters should be called separately. Results The proposed method was used to assess genotyping data of 3619 subjects consisting of 1074 patients with Alzheimer’s disease, 296 with mild cognitive impairment (MCI), and 1153 controls. The proposed method improves the accuracy of called genotypes without the need to filter a lot of subjects and SNPs, and therefore is a reasonable approach for controlling batch effects. Conclusions We proposed a new strategy that detects batch effects with probe intensity measurement and calls genotypes in the presence of batch effects. The application of the proposed method to real data shows that it produces a balanced approach. Furthermore, the proposed method can be extended to various scenarios with a simple modification. KCI Citation Count: 0 In genetic analyses, the term 'batch effect' refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by statistics, and newer approaches are required.BACKGROUNDIn genetic analyses, the term 'batch effect' refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by statistics, and newer approaches are required.In this report, we propose a new method to detect the heterogeneity of probe intensities among different batches and a procedure for calling genotypes and QC in the presence of a batch effect. First, we conducted a multivariate analysis of variance (MANOVA) to test the differences in probe intensities among batches. If heterogeneity is detected, subjects should be clustered using a K-medoid algorithm using the averages of the probe intensity measurements for each batch and the genotypes of subjects in different clusters should be called separately.METHODSIn this report, we propose a new method to detect the heterogeneity of probe intensities among different batches and a procedure for calling genotypes and QC in the presence of a batch effect. First, we conducted a multivariate analysis of variance (MANOVA) to test the differences in probe intensities among batches. If heterogeneity is detected, subjects should be clustered using a K-medoid algorithm using the averages of the probe intensity measurements for each batch and the genotypes of subjects in different clusters should be called separately.The proposed method was used to assess genotyping data of 3619 subjects consisting of 1074 patients with Alzheimer's disease, 296 with mild cognitive impairment (MCI), and 1153 controls. The proposed method improves the accuracy of called genotypes without the need to filter a lot of subjects and SNPs, and therefore is a reasonable approach for controlling batch effects.RESULTSThe proposed method was used to assess genotyping data of 3619 subjects consisting of 1074 patients with Alzheimer's disease, 296 with mild cognitive impairment (MCI), and 1153 controls. The proposed method improves the accuracy of called genotypes without the need to filter a lot of subjects and SNPs, and therefore is a reasonable approach for controlling batch effects.We proposed a new strategy that detects batch effects with probe intensity measurement and calls genotypes in the presence of batch effects. The application of the proposed method to real data shows that it produces a balanced approach. Furthermore, the proposed method can be extended to various scenarios with a simple modification.CONCLUSIONSWe proposed a new strategy that detects batch effects with probe intensity measurement and calls genotypes in the presence of batch effects. The application of the proposed method to real data shows that it produces a balanced approach. Furthermore, the proposed method can be extended to various scenarios with a simple modification. BACKGROUND: In genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by statistics, and newer approaches are required. METHODS: In this report, we propose a new method to detect the heterogeneity of probe intensities among different batches and a procedure for calling genotypes and QC in the presence of a batch effect. First, we conducted a multivariate analysis of variance (MANOVA) to test the differences in probe intensities among batches. If heterogeneity is detected, subjects should be clustered using a K-medoid algorithm using the averages of the probe intensity measurements for each batch and the genotypes of subjects in different clusters should be called separately. RESULTS: The proposed method was used to assess genotyping data of 3619 subjects consisting of 1074 patients with Alzheimer’s disease, 296 with mild cognitive impairment (MCI), and 1153 controls. The proposed method improves the accuracy of called genotypes without the need to filter a lot of subjects and SNPs, and therefore is a reasonable approach for controlling batch effects. CONCLUSIONS: We proposed a new strategy that detects batch effects with probe intensity measurement and calls genotypes in the presence of batch effects. The application of the proposed method to real data shows that it produces a balanced approach. Furthermore, the proposed method can be extended to various scenarios with a simple modification. Background In genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by statistics, and newer approaches are required. Methods In this report, we propose a new method to detect the heterogeneity of probe intensities among different batches and a procedure for calling genotypes and QC in the presence of a batch effect. First, we conducted a multivariate analysis of variance (MANOVA) to test the differences in probe intensities among batches. If heterogeneity is detected, subjects should be clustered using a K-medoid algorithm using the averages of the probe intensity measurements for each batch and the genotypes of subjects in different clusters should be called separately. Results The proposed method was used to assess genotyping data of 3619 subjects consisting of 1074 patients with Alzheimer’s disease, 296 with mild cognitive impairment (MCI), and 1153 controls. The proposed method improves the accuracy of called genotypes without the need to filter a lot of subjects and SNPs, and therefore is a reasonable approach for controlling batch effects. Conclusions We proposed a new strategy that detects batch effects with probe intensity measurement and calls genotypes in the presence of batch effects. The application of the proposed method to real data shows that it produces a balanced approach. Furthermore, the proposed method can be extended to various scenarios with a simple modification. |
Author | Lee, Jang Jae Seo, Sujin Lee, Kun Ho Choi, Kyu Yeong Won, Sungho Park, Kyungtaek |
Author_xml | – sequence: 1 givenname: Sujin surname: Seo fullname: Seo, Sujin organization: Department of Public Health Science, Graduate School of Public Health, Seoul National University – sequence: 2 givenname: Kyungtaek surname: Park fullname: Park, Kyungtaek organization: Interdisciplinary Program of Bioinformatics, College of National Sciences, Seoul National University – sequence: 3 givenname: Jang Jae surname: Lee fullname: Lee, Jang Jae organization: National Research Center for Dementia, Chosun University – sequence: 4 givenname: Kyu Yeong surname: Choi fullname: Choi, Kyu Yeong organization: National Research Center for Dementia, Chosun University, Premedical Science, School of Medicine, Chousn University – sequence: 5 givenname: Kun Ho surname: Lee fullname: Lee, Kun Ho email: leekho@chosun.ac.kr organization: National Research Center for Dementia, Chosun University, Department of Biomedical Science, College of Natural Sciences, Chosun University – sequence: 6 givenname: Sungho orcidid: 0000-0001-5751-5089 surname: Won fullname: Won, Sungho email: sunghow@gmail.com organization: Department of Public Health Science, Graduate School of Public Health, Seoul National University, Interdisciplinary Program of Bioinformatics, College of National Sciences, Seoul National University, Institute of Health and Environment, Seoul National University |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/31062262$$D View this record in MEDLINE/PubMed https://www.kci.go.kr/kciportal/ci/sereArticleSearch/ciSereArtiView.kci?sereArticleSearchBean.artiId=ART002492841$$DAccess content in National Research Foundation of Korea (NRF) |
BookMark | eNqFkU1PFTEUhhuDkQvyB1yYSdzoYrQ9_ZouCQEhIWoU102n07kW5raXtrO4_95eBmLCArvo2TzPSfu-R-ggxOAQekfwZ4Kx_JIJBd61mKgW4w5ky1-hFWAFrQJFD9CKKClaxSU5RCc53-J6KGGCkTfokBIsAASs0MWvbz-atQux7LausWaafFg3JgzN_WwmX3aNjaGkODVjTM1mnopve1Psn3pnNzS5zIN3-S16PZopu5PHeYx-X5zfnF2219-_Xp2dXreWMShtP1BrMZUDH3vFMOnM2AEGrhzuhXFSjWR0HROSUUyY5D0bKCgHrhe9GYaeHqNPy96QRn1nvY7GP8x11HdJn_68udJcCVnDqezHhd2meD-7XPTGZ-umyQQX56yBYk6klLj7PwoUCKXQkYp-eIbexjmF-ulKCSZAUr6n3j9Sc79xg94mvzFpp5-Cr0C3ADbFnJMbtfXFFL8P2_hJE6z3NeulZl1r1g81a15VeKY-bX9RoouUKxzWLv179gvWXy7Lto4 |
CitedBy_id | crossref_primary_10_1093_cercor_bhac483 crossref_primary_10_1002_gepi_22586 crossref_primary_10_1038_s41598_022_07567_9 crossref_primary_10_1038_s41440_024_01784_7 crossref_primary_10_1093_bib_bbae389 crossref_primary_10_1016_j_anai_2021_11_022 crossref_primary_10_1016_j_jhazmat_2024_136574 crossref_primary_10_3803_EnM_2021_1241 crossref_primary_10_1038_s41398_024_02777_3 crossref_primary_10_4168_aair_2021_13_4_609 crossref_primary_10_4093_dmj_2021_0375 crossref_primary_10_1186_s12916_022_02723_4 crossref_primary_10_1016_j_envint_2024_108709 crossref_primary_10_1007_s10120_022_01285_x crossref_primary_10_1038_s41598_022_24766_6 crossref_primary_10_1186_s13059_024_03288_6 crossref_primary_10_3389_fendo_2024_1375459 |
Cites_doi | 10.1186/1471-2105-9-S9-S17 10.1038/nrg2825 10.1159/000092553 10.1371/journal.pgen.1000477 10.1038/nprot.2010.116 10.1080/01621459.1952.10483441 10.2307/2333003 10.1016/j.ajhg.2009.11.004 10.1212/WNL.34.7.939 10.1093/nar/gkr798 10.1002/9780470685983 10.1016/j.ygeno.2004.05.003 10.1186/1471-2105-12-68 10.1186/1471-2164-9-431 10.1186/1471-2105-11-356 10.1038/tpj.2010.36 10.1111/j.1365-2796.2004.01380.x |
ContentType | Journal Article |
Copyright | The Genetics Society of Korea 2019 Copyright Springer Nature B.V. 2019 |
Copyright_xml | – notice: The Genetics Society of Korea 2019 – notice: Copyright Springer Nature B.V. 2019 |
DBID | AAYXX CITATION CGR CUY CVF ECM EIF NPM 7X8 7S9 L.6 ACYCR |
DOI | 10.1007/s13258-019-00827-5 |
DatabaseName | CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed MEDLINE - Academic AGRICOLA AGRICOLA - Academic Korean Citation Index |
DatabaseTitle | CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) MEDLINE - Academic AGRICOLA AGRICOLA - Academic |
DatabaseTitleList | MEDLINE MEDLINE - Academic AGRICOLA |
Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Biology |
EISSN | 2092-9293 |
EndPage | 939 |
ExternalDocumentID | oai_kci_go_kr_ARTI_5967132 31062262 10_1007_s13258_019_00827_5 |
Genre | Research Support, Non-U.S. Gov't Journal Article |
GroupedDBID | --- -EM 06D 0R~ 0VY 1N0 203 29~ 2KG 30V 4.4 406 408 40D 5GY 67N 96X 9ZL AACDK AAHBH AAHNG AAIAL AAJBT AAJKR AANZL AARTL AASML AATNV AATVU AAUYE AAWCG AAYIU AAYQN AAYTO AAYZH AAZMS ABAKF ABDZT ABECU ABFTV ABJNI ABJOX ABKCH ABMQK ABPLI ABQBU ABSXP ABTEG ABTHY ABTKH ABTMW ABXPI ACAOD ACDTI ACGFS ACHSB ACKNC ACMDZ ACMLO ACOKC ACPIV ACPRK ACZOJ ADHHG ADHIR ADINQ ADKNI ADKPE ADRFC ADTPH ADURQ ADYFF ADZKW AEFQL AEGNC AEJHL AEJRE AEMSY AENEX AEOHA AEPYU AESKC AETCA AEVLU AEXYK AFBBN AFQWF AFWTZ AFZKB AGAYW AGDGC AGMZJ AGQEE AGQMX AGRTI AGWZB AGYKE AHAVH AHBYD AHYZX AIAKS AIGIU AIIXL AILAN AITGF AJRNO AJZVZ AKMHD ALFXC ALMA_UNASSIGNED_HOLDINGS AMKLP AMXSW AMYLF AMYQR ANMIH AOCGG AXYYD BGNMA CSCUP DDRTE DNIVK DPUIP DU5 EBLON EBS EIOEI EJD ESBYG FERAY FFXSO FIGPU FINBP FNLPD FRRFC FSGXE FYJPI GGCAI GGRSB GJIRD GQ6 GQ7 HMJXF HRMNR HZB I0C IAO IGS IHR IKXTQ IWAJR IXD J-C J0Z JBSCW JZLTJ KOV LLZTM M4Y NPVJJ NQJWS NU0 O9J PT4 R9I RLLFE ROL RSV S27 S3A S3B SBL SHX SISQX SJYHP SNE SNPRN SNX SOHCF SOJ SPISZ SRMVM SSLCW SSXJD STPWE T13 TSG U2A U9L UG4 UOJIU UTJUX UZXMN VC2 VFIZW W48 WK8 Z45 Z7U ZMTXR ZOVNA 2VQ AANXM AAPKM AARHV AAYXX ABBRH ABDBE ABFSG ACSTC AEBTG AEZWR AFDZB AFHIU AFLOW AFOHR AGJBK AHPBZ AHSBF AHWEU AIXLP AJBLW ATHPR AYFIA CITATION H13 HF~ HZ~ O9- S1Z CGR CUY CVF ECM EIF NPM ABRTQ 7X8 7S9 L.6 ACYCR |
ID | FETCH-LOGICAL-c442t-bd3cc037d5fb94018af820259e0b6ae79f1fe84674301475b4d329e2eb6baddb3 |
IEDL.DBID | U2A |
ISSN | 1976-9571 2092-9293 |
IngestDate | Sun Mar 09 07:52:27 EDT 2025 Fri Jul 11 10:17:46 EDT 2025 Fri Jul 11 10:06:00 EDT 2025 Fri Jul 25 11:18:05 EDT 2025 Wed Feb 19 02:30:49 EST 2025 Thu Apr 24 22:51:44 EDT 2025 Tue Jul 01 03:53:18 EDT 2025 Fri Feb 21 02:38:46 EST 2025 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 8 |
Keywords | Calling K-medoid clustering Batch effect Quality control Genome-wide association analysis |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c442t-bd3cc037d5fb94018af820259e0b6ae79f1fe84674301475b4d329e2eb6baddb3 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 https://doi.org/10.1007/s13258-019-00827-5 |
ORCID | 0000-0001-5751-5089 |
PMID | 31062262 |
PQID | 2264627351 |
PQPubID | 2044424 |
PageCount | 13 |
ParticipantIDs | nrf_kci_oai_kci_go_kr_ARTI_5967132 proquest_miscellaneous_2305177708 proquest_miscellaneous_2232133281 proquest_journals_2264627351 pubmed_primary_31062262 crossref_citationtrail_10_1007_s13258_019_00827_5 crossref_primary_10_1007_s13258_019_00827_5 springer_journals_10_1007_s13258_019_00827_5 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 2019-08-01 |
PublicationDateYYYYMMDD | 2019-08-01 |
PublicationDate_xml | – month: 08 year: 2019 text: 2019-08-01 day: 01 |
PublicationDecade | 2010 |
PublicationPlace | Singapore |
PublicationPlace_xml | – name: Singapore – name: Korea (South) – name: Heidelberg |
PublicationTitle | Genes & genomics |
PublicationTitleAbbrev | Genes Genom |
PublicationTitleAlternate | Genes Genomics |
PublicationYear | 2019 |
Publisher | Springer Singapore Springer Nature B.V 한국유전학회 |
Publisher_xml | – name: Springer Singapore – name: Springer Nature B.V – name: 한국유전학회 |
References | MiclausKWolfingerRVegaSChiericiMFurlanelloCLambertCHongHZhangLYinSGoodsaidFBatch effects in the BRLMM genotype calling algorithm influence GWAS results for the Affymetrix 500 K arrayPharmacogenom J201010433634610.1038/tpj.2010.361:CAS:528:DC%2BC3cXpsFWktrg%3D BrowningBLYuZSimultaneous genotype calling and haplotype phasing improves genotype accuracy and reduces false-positive associations for genome-wide association studiesAm J Hum Genet200985684786110.1016/j.ajhg.2009.11.0041:CAS:528:DC%2BC3cXht1eqt7w%3D199310402790566 ChaiHSTherneauTMBaileyKRKocherJ-PASpatial normalization improves the quality of genotype calling for Affymetrix SNP 6.0 arraysBMC Bioinf201011135610.1186/1471-2105-11-3561:CAS:528:DC%2BC3cXos1aktrw%3D KruskalWHWallisWAUse of ranks in one-criterion variance analysisJ Am Stat Assoc19524726058362110.1080/01621459.1952.10483441 SpencerCCSuZDonnellyPMarchiniJDesigning genome-wide association studies: sample size, power, imputation, and the choice of genotyping chipPLoS Genet200955e100047710.1371/journal.pgen.10004771:CAS:528:DC%2BD1MXmsVyhu7s%3D194920152688469 LeekJTScharpfRBBravoHCSimchaDLangmeadBJohnsonWEGemanDBaggerlyKIrizarryRATackling the widespread and critical impact of batch effects in high-throughput dataNat Rev Genet2010111073373910.1038/nrg28251:CAS:528:DC%2BC3cXhtFyju7%2FK20838408 WinbladBPalmerKKivipeltoMJelicVFratiglioniLWahlundLONordbergABäckmanLAlbertMAlmkvistOMild cognitive impairment–beyond controversies, towards a consensus: report of the International Working Group on Mild Cognitive ImpairmentJ Intern Med2004256324024610.1111/j.1365-2796.2004.01380.x1:STN:280:DC%2BD2cvhvFCrsQ%3D%3D NishidaNKoikeATajimaAOgasawaraYIshibashiYUeharaYInoueITokunagaKEvaluating the performance of Affymetrix SNP Array 6.0 platform with 400 Japanese individualsBMC Genom20089143110.1186/1471-2164-9-4311:CAS:528:DC%2BD1cXhtlejtb%2FO Affymetrix I (2013) Axiom® genotyping solution data analysis guide. URLhttp://media.affymetrix.com/support/downloads/manuals/axiom_genotyping_solution_analysis_guide.pdf. Accessed 29 Mar 2016 RitchieMELiuRCarvalhoBSIrizarryRAComparing genotyping algorithms for Illumina's Infinium whole-genome SNP BeadChipsBMC Bioinformatics201110.1186/1471-2105-12-68213854243063825 McKhannGDrachmanDFolsteinMKatzmanRPriceDStadlanEMClinical diagnosis of Alzheimer’s disease: report of the NINCDS-ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer’s DiseaseNeurology198434793994410.1212/WNL.34.7.9391:STN:280:DyaL2c3ks1altQ%3D%3D66108416610841 Dodge Y (2012) Statistical data analysis based on the L1-norm and related methods: Birkhäuser, Basel HaoKLiCRosenowCWongWHEstimation of genotype error rate using samples with pedigree information—an application on the GeneChip Mapping 10 K arrayGenomics200484462363010.1016/j.ygeno.2004.05.0031:CAS:528:DC%2BD2cXotVylsLc%3D15475239 CariasoMLennonGSNPedia: a wiki supporting personal genome annotation, interpretation and analysisNucl Acids Res201240D1D1308D131210.1093/nar/gkr7981:CAS:528:DC%2BC3MXhs12htbnE22140107 JamesGTests of linear hypotheses in univariate and multivariate analysis when the ratios of the population variances are unknownBiometrika1954411/2194310.2307/2333003 Pillai K (1985) Multivariate analysis of variance (MANOVA). Encyclop Stat Sci Affymetrix I (2015) SNPolisher User Guide (Version 1.5.2), pp 1–104. https://tools.thermofisher.com/content/sfs/manuals/SNPolisher_User_Guide.pdf. Accessed 24 April 2017 McKhannGDrachmanDFolsteinMKatzmanRPriceDStadlanEMClinical diagnosis of Alzheimer’s disease Report of the NINCDS-ADRDA Work Group* under the auspices of Department of Health and Human Services Task Force on Alzheimer’s DiseaseNeurology198434793910.1212/WNL.34.7.9391:STN:280:DyaL2c3ks1altQ%3D%3D66108416610841 AndersonCAPetterssonFHClarkeGMCardonLRMorrisAPZondervanKTData quality control in genetic case-control association studiesNat Protoc2010591564157310.1038/nprot.2010.1161:CAS:528:DC%2BC3cXht1aksLvN210851223025522 Scherer A (2009) Batch effects and noise in microarray experiments: sources and solutions, vol 868. Wiley MoskvinaVCraddockNHolmansPOwenMJO’DonovanMCEffects of differential genotyping error rate on the type I error probability of case-control studiesHum Hered2006611556410.1159/00009255316612103 HongHSuZGeWShiLPerkinsRFangHXuJChenJJHanTKaputJAssessing batch effects of genotype calling algorithm BRLMM for the Affymetrix GeneChip Human Mapping 500 K array set using 270 HapMap samplesBMC Bioinf200899S1710.1186/1471-2105-9-S9-S171:CAS:528:DC%2BD1cXhsFeis7nK 827_CR20 827_CR1 H Hong (827_CR9) 2008; 9 827_CR2 WH Kruskal (827_CR11) 1952; 47 ME Ritchie (827_CR24) 2011 827_CR7 HS Chai (827_CR6) 2010; 11 CA Anderson (827_CR3) 2010; 5 V Moskvina (827_CR18) 2006; 61 N Nishida (827_CR19) 2008; 9 CC Spencer (827_CR21) 2009; 5 M Cariaso (827_CR5) 2012; 40 K Miclaus (827_CR17) 2010; 10 JT Leek (827_CR12) 2010; 11 B Winblad (827_CR22) 2004; 256 BL Browning (827_CR4) 2009; 85 G McKhann (827_CR16) 1984; 34 G McKhann (827_CR15) 1984; 34 G James (827_CR10) 1954; 41 K Hao (827_CR8) 2004; 84 827_CR23 |
References_xml | – reference: SpencerCCSuZDonnellyPMarchiniJDesigning genome-wide association studies: sample size, power, imputation, and the choice of genotyping chipPLoS Genet200955e100047710.1371/journal.pgen.10004771:CAS:528:DC%2BD1MXmsVyhu7s%3D194920152688469 – reference: JamesGTests of linear hypotheses in univariate and multivariate analysis when the ratios of the population variances are unknownBiometrika1954411/2194310.2307/2333003 – reference: ChaiHSTherneauTMBaileyKRKocherJ-PASpatial normalization improves the quality of genotype calling for Affymetrix SNP 6.0 arraysBMC Bioinf201011135610.1186/1471-2105-11-3561:CAS:528:DC%2BC3cXos1aktrw%3D – reference: MiclausKWolfingerRVegaSChiericiMFurlanelloCLambertCHongHZhangLYinSGoodsaidFBatch effects in the BRLMM genotype calling algorithm influence GWAS results for the Affymetrix 500 K arrayPharmacogenom J201010433634610.1038/tpj.2010.361:CAS:528:DC%2BC3cXpsFWktrg%3D – reference: Affymetrix I (2013) Axiom® genotyping solution data analysis guide. URLhttp://media.affymetrix.com/support/downloads/manuals/axiom_genotyping_solution_analysis_guide.pdf. Accessed 29 Mar 2016 – reference: Affymetrix I (2015) SNPolisher User Guide (Version 1.5.2), pp 1–104. https://tools.thermofisher.com/content/sfs/manuals/SNPolisher_User_Guide.pdf. Accessed 24 April 2017 – reference: McKhannGDrachmanDFolsteinMKatzmanRPriceDStadlanEMClinical diagnosis of Alzheimer’s disease: report of the NINCDS-ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer’s DiseaseNeurology198434793994410.1212/WNL.34.7.9391:STN:280:DyaL2c3ks1altQ%3D%3D66108416610841 – reference: HongHSuZGeWShiLPerkinsRFangHXuJChenJJHanTKaputJAssessing batch effects of genotype calling algorithm BRLMM for the Affymetrix GeneChip Human Mapping 500 K array set using 270 HapMap samplesBMC Bioinf200899S1710.1186/1471-2105-9-S9-S171:CAS:528:DC%2BD1cXhsFeis7nK – reference: HaoKLiCRosenowCWongWHEstimation of genotype error rate using samples with pedigree information—an application on the GeneChip Mapping 10 K arrayGenomics200484462363010.1016/j.ygeno.2004.05.0031:CAS:528:DC%2BD2cXotVylsLc%3D15475239 – reference: MoskvinaVCraddockNHolmansPOwenMJO’DonovanMCEffects of differential genotyping error rate on the type I error probability of case-control studiesHum Hered2006611556410.1159/00009255316612103 – reference: CariasoMLennonGSNPedia: a wiki supporting personal genome annotation, interpretation and analysisNucl Acids Res201240D1D1308D131210.1093/nar/gkr7981:CAS:528:DC%2BC3MXhs12htbnE22140107 – reference: NishidaNKoikeATajimaAOgasawaraYIshibashiYUeharaYInoueITokunagaKEvaluating the performance of Affymetrix SNP Array 6.0 platform with 400 Japanese individualsBMC Genom20089143110.1186/1471-2164-9-4311:CAS:528:DC%2BD1cXhtlejtb%2FO – reference: RitchieMELiuRCarvalhoBSIrizarryRAComparing genotyping algorithms for Illumina's Infinium whole-genome SNP BeadChipsBMC Bioinformatics201110.1186/1471-2105-12-68213854243063825 – reference: Scherer A (2009) Batch effects and noise in microarray experiments: sources and solutions, vol 868. Wiley – reference: KruskalWHWallisWAUse of ranks in one-criterion variance analysisJ Am Stat Assoc19524726058362110.1080/01621459.1952.10483441 – reference: BrowningBLYuZSimultaneous genotype calling and haplotype phasing improves genotype accuracy and reduces false-positive associations for genome-wide association studiesAm J Hum Genet200985684786110.1016/j.ajhg.2009.11.0041:CAS:528:DC%2BC3cXht1eqt7w%3D199310402790566 – reference: WinbladBPalmerKKivipeltoMJelicVFratiglioniLWahlundLONordbergABäckmanLAlbertMAlmkvistOMild cognitive impairment–beyond controversies, towards a consensus: report of the International Working Group on Mild Cognitive ImpairmentJ Intern Med2004256324024610.1111/j.1365-2796.2004.01380.x1:STN:280:DC%2BD2cvhvFCrsQ%3D%3D – reference: LeekJTScharpfRBBravoHCSimchaDLangmeadBJohnsonWEGemanDBaggerlyKIrizarryRATackling the widespread and critical impact of batch effects in high-throughput dataNat Rev Genet2010111073373910.1038/nrg28251:CAS:528:DC%2BC3cXhtFyju7%2FK20838408 – reference: Dodge Y (2012) Statistical data analysis based on the L1-norm and related methods: Birkhäuser, Basel – reference: McKhannGDrachmanDFolsteinMKatzmanRPriceDStadlanEMClinical diagnosis of Alzheimer’s disease Report of the NINCDS-ADRDA Work Group* under the auspices of Department of Health and Human Services Task Force on Alzheimer’s DiseaseNeurology198434793910.1212/WNL.34.7.9391:STN:280:DyaL2c3ks1altQ%3D%3D66108416610841 – reference: Pillai K (1985) Multivariate analysis of variance (MANOVA). Encyclop Stat Sci – reference: AndersonCAPetterssonFHClarkeGMCardonLRMorrisAPZondervanKTData quality control in genetic case-control association studiesNat Protoc2010591564157310.1038/nprot.2010.1161:CAS:528:DC%2BC3cXht1aksLvN210851223025522 – ident: 827_CR2 – volume: 9 start-page: S17 issue: 9 year: 2008 ident: 827_CR9 publication-title: BMC Bioinf doi: 10.1186/1471-2105-9-S9-S17 – volume: 11 start-page: 733 issue: 10 year: 2010 ident: 827_CR12 publication-title: Nat Rev Genet doi: 10.1038/nrg2825 – ident: 827_CR1 – volume: 61 start-page: 55 issue: 1 year: 2006 ident: 827_CR18 publication-title: Hum Hered doi: 10.1159/000092553 – volume: 5 start-page: e1000477 issue: 5 year: 2009 ident: 827_CR21 publication-title: PLoS Genet doi: 10.1371/journal.pgen.1000477 – volume: 5 start-page: 1564 issue: 9 year: 2010 ident: 827_CR3 publication-title: Nat Protoc doi: 10.1038/nprot.2010.116 – volume: 47 start-page: 583 issue: 260 year: 1952 ident: 827_CR11 publication-title: J Am Stat Assoc doi: 10.1080/01621459.1952.10483441 – ident: 827_CR20 – volume: 41 start-page: 19 issue: 1/2 year: 1954 ident: 827_CR10 publication-title: Biometrika doi: 10.2307/2333003 – volume: 85 start-page: 847 issue: 6 year: 2009 ident: 827_CR4 publication-title: Am J Hum Genet doi: 10.1016/j.ajhg.2009.11.004 – volume: 34 start-page: 939 issue: 7 year: 1984 ident: 827_CR15 publication-title: Neurology doi: 10.1212/WNL.34.7.939 – volume: 40 start-page: D1308 issue: D1 year: 2012 ident: 827_CR5 publication-title: Nucl Acids Res doi: 10.1093/nar/gkr798 – ident: 827_CR23 doi: 10.1002/9780470685983 – volume: 84 start-page: 623 issue: 4 year: 2004 ident: 827_CR8 publication-title: Genomics doi: 10.1016/j.ygeno.2004.05.003 – year: 2011 ident: 827_CR24 publication-title: BMC Bioinformatics doi: 10.1186/1471-2105-12-68 – volume: 9 start-page: 431 issue: 1 year: 2008 ident: 827_CR19 publication-title: BMC Genom doi: 10.1186/1471-2164-9-431 – volume: 11 start-page: 356 issue: 1 year: 2010 ident: 827_CR6 publication-title: BMC Bioinf doi: 10.1186/1471-2105-11-356 – volume: 34 start-page: 939 issue: 7 year: 1984 ident: 827_CR16 publication-title: Neurology doi: 10.1212/WNL.34.7.939 – ident: 827_CR7 – volume: 10 start-page: 336 issue: 4 year: 2010 ident: 827_CR17 publication-title: Pharmacogenom J doi: 10.1038/tpj.2010.36 – volume: 256 start-page: 240 issue: 3 year: 2004 ident: 827_CR22 publication-title: J Intern Med doi: 10.1111/j.1365-2796.2004.01380.x |
SSID | ssj0000314641 |
Score | 2.236834 |
Snippet | Background
In genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is... In genetic analyses, the term 'batch effect' refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most... BackgroundIn genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is... BACKGROUND: In genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is... Background In genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is... |
SourceID | nrf proquest pubmed crossref springer |
SourceType | Open Website Aggregation Database Index Database Enrichment Source Publisher |
StartPage | 927 |
SubjectTerms | algorithms Alzheimer disease Alzheimer Disease - genetics Analysis of Variance Animal Genetics and Genomics Biomedical and Life Sciences Cognitive ability cognitive disorders Cognitive Dysfunction - genetics Genetic analysis Genetic Heterogeneity Genome-Wide Association Study - methods Genome-Wide Association Study - standards genotype Genotypes Genotyping Genotyping Techniques - methods Genotyping Techniques - standards Human Genetics Humans Life Sciences Microbial Genetics and Genomics Multivariate analysis patients Plant Genetics and Genomics Polymorphism, Single Nucleotide Quality control Research Article Single-nucleotide polymorphism Statistical analysis 생물학 |
Title | SNP genotype calling and quality control for multi-batch-based studies |
URI | https://link.springer.com/article/10.1007/s13258-019-00827-5 https://www.ncbi.nlm.nih.gov/pubmed/31062262 https://www.proquest.com/docview/2264627351 https://www.proquest.com/docview/2232133281 https://www.proquest.com/docview/2305177708 https://www.kci.go.kr/kciportal/ci/sereArticleSearch/ciSereArtiView.kci?sereArticleSearchBean.artiId=ART002492841 |
Volume | 41 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
ispartofPNX | Genes & Genomics, 2019, 41(8), , pp.927-939 |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LT9wwEB7xEFIvqAXahpdcxI1aSpw4jo-rii0FFSHBSnCy4kfaCpRFu8uBf89MHrtCPKSecsgkccbjmW_seQAcOhk7X8YaV5rOOGq_hFuVlByNja4cCohu-qf8Ps9PRtnptbzuksKmfbR7fyTZaOpFslsqJAVeaU52S3G5DKsSfXeS65EYzHdWqCB73rSsTNDWci1V0mXLvP6aZxZpuZ5Ur4HNFweljf0ZfoT1DjiyQTvTn2Ap1Buw1raSfNyE4eX5BaOCq7SnypDxlGbOytqzNm3ykXVB6QxRKmvCCLlFNfyXkx3zbNrGE27BaHh89eOEdz0SuMsyMePWp87FqfKyshp9paKskM3o04TY5mVQukqqQBgjI99JSZv5VOgggs0tqjabfoaVelyHr8CEyxPvZdDaF1mQyhI68WXqM20LH-sIkp5PxnUFxKmPxZ1ZlD4m3hrkrWl4a2QER_Nn7tvyGe9SHyD7za37Z6jqNV3_jM3txCC2_2WkztGjFhHs9rNjuvU2NZQOnCMSk0kE3-a3caXQ8UdZh_ED0aQCPXJRvEeTUs0ypeIigi_tzM_HjUA4x-_gAL73orAYwNs_tf1_5DvwQTTCSTK6CyuzyUPYQ9wzs_uwOvh5c3a834j7E_sF9bc |
linkProvider | Springer Nature |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlR3JbtQw9IlOheBC2QkUMIgbuMpix_GxqhimtB0h0ZHKyYqXABqUQbMc2q_ve1lmBJRKPeWQl8R5-7PfAvDOydj5MtYoaVpw1H4JtyopORobXTlkEN3MTzkZ56OJ-Hwmz7qisEWf7d4fSTaaelPslqWSEq80J7uluNyCbYExuBzA9v6nb0ebvRVqyZ43QysTtLZcS5V09TJXv-gPm7RVz6ur3M1_jkobCzTcgUm_9jbxZLq3Wto9d_FXW8eb_tx9uNe5pGy_5aEHcCvUD-F2O6Ty_BEMv46_MGrlSru1DElKBeysrD1rCzLPWZfuztD_ZU2CIreo4H9wspCeLdpMxccwGX48PRjxbvoCd0KkS2595lycKS8rqzEKK8oKCYjRUohtXgalq6QK5L0IisqUtMJnqQ5psLlFpWmzJzCoZ3V4Bix1eeK9DFr7QgSpLPk9vsy80LbwsY4g6fFvXNeanCZk_DKbpsqEH4P4MQ1-jIzg_fqZ321jjmuh3yJZzdT9NNRPm67fZ2Y6Nxg1HBqpc4zV0wh2e6qbTpIXhgqNc_TxZBLBm_VtlEE6WCnrMFsRTJZirJ8W18Fk1A1NqbiI4GnLUet1o4ud43dwAR967tgs4P8_9fxm4K_hzuj05NgcH46PXsDdtGE24rldGCznq_ASvaulfdUJ0yWO9RQg |
linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3dT9RAEJ8IRuOLEVGsIq7GN9xcd9vtdh-JcgGFCwlewtum-1EwmB45jgf-e2fa3h2Gj4SnPnTabmdmd36zOx8AX71KfahSgzPN5BxXP8GdFhVHY2Nqjwpi2v4ph6Nib5z_PFEnN7L422j3-ZFkl9NAVZqa2eAi1INl4lsmFQVhGU42THO1Ak9xORYU1DWWO4tdFirOXrTtKwXaXW6UFn3mzN2v-c86rTTT-i7geevQtLVFw1fwsgeRbKeT-ho8ic1reNa1lbxeh-Hx6IhR8VXaX2UoBEo5Z1UTWJdCec36AHWGiJW1IYXc4ZJ8xsmmBXbZxRa-gfFw9_f3Pd73S-A-z-WMu5B5n2Y6qNoZ9JvKqkaWo38TU1dUUZta1JHwRk5-lFYuD5k0UUZXOFzmXPYWVptJE98Bk74QIahoTCjzqLQjpBKqLOTGlSE1CYg5n6zvi4lTT4u_dlkGmXhrkbe25a1VCWwvnrnoSmk8SP0F2W_P_R9LFbDpejqx51OLOH_fKlOgdy0T2JxLx_Zz79JSanCBqEyJBD4vbuOsoaOQqomTK6LJJHrnsnyIJqP6ZVqnZQIbneQX40ZQXOB3cADf5qqwHMD9P_X-ceSf4PnRj6E92B_9-gAvZKunpK6bsDqbXsWPCIdmbqvV-H9qL_tS |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=SNP+genotype+calling+and+quality+control+for+multi-batch-based+studies&rft.jtitle=Genes+%26+genomics&rft.au=Seo%2C+Sujin&rft.au=Park%2C+Kyungtaek&rft.au=Lee%2C+Jang+Jae&rft.au=Choi%2C+Kyu+Yeong&rft.date=2019-08-01&rft.issn=2092-9293&rft.eissn=2092-9293&rft.volume=41&rft.issue=8&rft.spage=927&rft_id=info:doi/10.1007%2Fs13258-019-00827-5&rft.externalDBID=NO_FULL_TEXT |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1976-9571&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1976-9571&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1976-9571&client=summon |