SNP genotype calling and quality control for multi-batch-based studies

Background In genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by...

Full description

Saved in:
Bibliographic Details
Published inGenes & genomics Vol. 41; no. 8; pp. 927 - 939
Main Authors Seo, Sujin, Park, Kyungtaek, Lee, Jang Jae, Choi, Kyu Yeong, Lee, Kun Ho, Won, Sungho
Format Journal Article
LanguageEnglish
Published Singapore Springer Singapore 01.08.2019
Springer Nature B.V
한국유전학회
Subjects
Online AccessGet full text
ISSN1976-9571
2092-9293
2092-9293
DOI10.1007/s13258-019-00827-5

Cover

Loading…
Abstract Background In genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by statistics, and newer approaches are required. Methods In this report, we propose a new method to detect the heterogeneity of probe intensities among different batches and a procedure for calling genotypes and QC in the presence of a batch effect. First, we conducted a multivariate analysis of variance (MANOVA) to test the differences in probe intensities among batches. If heterogeneity is detected, subjects should be clustered using a K-medoid algorithm using the averages of the probe intensity measurements for each batch and the genotypes of subjects in different clusters should be called separately. Results The proposed method was used to assess genotyping data of 3619 subjects consisting of 1074 patients with Alzheimer’s disease, 296 with mild cognitive impairment (MCI), and 1153 controls. The proposed method improves the accuracy of called genotypes without the need to filter a lot of subjects and SNPs, and therefore is a reasonable approach for controlling batch effects. Conclusions We proposed a new strategy that detects batch effects with probe intensity measurement and calls genotypes in the presence of batch effects. The application of the proposed method to real data shows that it produces a balanced approach. Furthermore, the proposed method can be extended to various scenarios with a simple modification.
AbstractList BackgroundIn genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by statistics, and newer approaches are required.MethodsIn this report, we propose a new method to detect the heterogeneity of probe intensities among different batches and a procedure for calling genotypes and QC in the presence of a batch effect. First, we conducted a multivariate analysis of variance (MANOVA) to test the differences in probe intensities among batches. If heterogeneity is detected, subjects should be clustered using a K-medoid algorithm using the averages of the probe intensity measurements for each batch and the genotypes of subjects in different clusters should be called separately.ResultsThe proposed method was used to assess genotyping data of 3619 subjects consisting of 1074 patients with Alzheimer’s disease, 296 with mild cognitive impairment (MCI), and 1153 controls. The proposed method improves the accuracy of called genotypes without the need to filter a lot of subjects and SNPs, and therefore is a reasonable approach for controlling batch effects.ConclusionsWe proposed a new strategy that detects batch effects with probe intensity measurement and calls genotypes in the presence of batch effects. The application of the proposed method to real data shows that it produces a balanced approach. Furthermore, the proposed method can be extended to various scenarios with a simple modification.
In genetic analyses, the term 'batch effect' refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by statistics, and newer approaches are required. In this report, we propose a new method to detect the heterogeneity of probe intensities among different batches and a procedure for calling genotypes and QC in the presence of a batch effect. First, we conducted a multivariate analysis of variance (MANOVA) to test the differences in probe intensities among batches. If heterogeneity is detected, subjects should be clustered using a K-medoid algorithm using the averages of the probe intensity measurements for each batch and the genotypes of subjects in different clusters should be called separately. The proposed method was used to assess genotyping data of 3619 subjects consisting of 1074 patients with Alzheimer's disease, 296 with mild cognitive impairment (MCI), and 1153 controls. The proposed method improves the accuracy of called genotypes without the need to filter a lot of subjects and SNPs, and therefore is a reasonable approach for controlling batch effects. We proposed a new strategy that detects batch effects with probe intensity measurement and calls genotypes in the presence of batch effects. The application of the proposed method to real data shows that it produces a balanced approach. Furthermore, the proposed method can be extended to various scenarios with a simple modification.
Background In genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by statistics, and newer approaches are required. Methods In this report, we propose a new method to detect the heterogeneity of probe intensities among different batches and a procedure for calling genotypes and QC in the presence of a batch effect. First, we conducted a multivariate analysis of variance (MANOVA) to test the differences in probe intensities among batches. If heterogeneity is detected, subjects should be clustered using a K-medoid algorithm using the averages of the probe intensity measurements for each batch and the genotypes of subjects in different clusters should be called separately. Results The proposed method was used to assess genotyping data of 3619 subjects consisting of 1074 patients with Alzheimer’s disease, 296 with mild cognitive impairment (MCI), and 1153 controls. The proposed method improves the accuracy of called genotypes without the need to filter a lot of subjects and SNPs, and therefore is a reasonable approach for controlling batch effects. Conclusions We proposed a new strategy that detects batch effects with probe intensity measurement and calls genotypes in the presence of batch effects. The application of the proposed method to real data shows that it produces a balanced approach. Furthermore, the proposed method can be extended to various scenarios with a simple modification. KCI Citation Count: 0
In genetic analyses, the term 'batch effect' refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by statistics, and newer approaches are required.BACKGROUNDIn genetic analyses, the term 'batch effect' refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by statistics, and newer approaches are required.In this report, we propose a new method to detect the heterogeneity of probe intensities among different batches and a procedure for calling genotypes and QC in the presence of a batch effect. First, we conducted a multivariate analysis of variance (MANOVA) to test the differences in probe intensities among batches. If heterogeneity is detected, subjects should be clustered using a K-medoid algorithm using the averages of the probe intensity measurements for each batch and the genotypes of subjects in different clusters should be called separately.METHODSIn this report, we propose a new method to detect the heterogeneity of probe intensities among different batches and a procedure for calling genotypes and QC in the presence of a batch effect. First, we conducted a multivariate analysis of variance (MANOVA) to test the differences in probe intensities among batches. If heterogeneity is detected, subjects should be clustered using a K-medoid algorithm using the averages of the probe intensity measurements for each batch and the genotypes of subjects in different clusters should be called separately.The proposed method was used to assess genotyping data of 3619 subjects consisting of 1074 patients with Alzheimer's disease, 296 with mild cognitive impairment (MCI), and 1153 controls. The proposed method improves the accuracy of called genotypes without the need to filter a lot of subjects and SNPs, and therefore is a reasonable approach for controlling batch effects.RESULTSThe proposed method was used to assess genotyping data of 3619 subjects consisting of 1074 patients with Alzheimer's disease, 296 with mild cognitive impairment (MCI), and 1153 controls. The proposed method improves the accuracy of called genotypes without the need to filter a lot of subjects and SNPs, and therefore is a reasonable approach for controlling batch effects.We proposed a new strategy that detects batch effects with probe intensity measurement and calls genotypes in the presence of batch effects. The application of the proposed method to real data shows that it produces a balanced approach. Furthermore, the proposed method can be extended to various scenarios with a simple modification.CONCLUSIONSWe proposed a new strategy that detects batch effects with probe intensity measurement and calls genotypes in the presence of batch effects. The application of the proposed method to real data shows that it produces a balanced approach. Furthermore, the proposed method can be extended to various scenarios with a simple modification.
BACKGROUND: In genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by statistics, and newer approaches are required. METHODS: In this report, we propose a new method to detect the heterogeneity of probe intensities among different batches and a procedure for calling genotypes and QC in the presence of a batch effect. First, we conducted a multivariate analysis of variance (MANOVA) to test the differences in probe intensities among batches. If heterogeneity is detected, subjects should be clustered using a K-medoid algorithm using the averages of the probe intensity measurements for each batch and the genotypes of subjects in different clusters should be called separately. RESULTS: The proposed method was used to assess genotyping data of 3619 subjects consisting of 1074 patients with Alzheimer’s disease, 296 with mild cognitive impairment (MCI), and 1153 controls. The proposed method improves the accuracy of called genotypes without the need to filter a lot of subjects and SNPs, and therefore is a reasonable approach for controlling batch effects. CONCLUSIONS: We proposed a new strategy that detects batch effects with probe intensity measurement and calls genotypes in the presence of batch effects. The application of the proposed method to real data shows that it produces a balanced approach. Furthermore, the proposed method can be extended to various scenarios with a simple modification.
Background In genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most important step in quality control (QC) processes that precede analyses. Currently, batch effects are not appropriately controlled by statistics, and newer approaches are required. Methods In this report, we propose a new method to detect the heterogeneity of probe intensities among different batches and a procedure for calling genotypes and QC in the presence of a batch effect. First, we conducted a multivariate analysis of variance (MANOVA) to test the differences in probe intensities among batches. If heterogeneity is detected, subjects should be clustered using a K-medoid algorithm using the averages of the probe intensity measurements for each batch and the genotypes of subjects in different clusters should be called separately. Results The proposed method was used to assess genotyping data of 3619 subjects consisting of 1074 patients with Alzheimer’s disease, 296 with mild cognitive impairment (MCI), and 1153 controls. The proposed method improves the accuracy of called genotypes without the need to filter a lot of subjects and SNPs, and therefore is a reasonable approach for controlling batch effects. Conclusions We proposed a new strategy that detects batch effects with probe intensity measurement and calls genotypes in the presence of batch effects. The application of the proposed method to real data shows that it produces a balanced approach. Furthermore, the proposed method can be extended to various scenarios with a simple modification.
Author Lee, Jang Jae
Seo, Sujin
Lee, Kun Ho
Choi, Kyu Yeong
Won, Sungho
Park, Kyungtaek
Author_xml – sequence: 1
  givenname: Sujin
  surname: Seo
  fullname: Seo, Sujin
  organization: Department of Public Health Science, Graduate School of Public Health, Seoul National University
– sequence: 2
  givenname: Kyungtaek
  surname: Park
  fullname: Park, Kyungtaek
  organization: Interdisciplinary Program of Bioinformatics, College of National Sciences, Seoul National University
– sequence: 3
  givenname: Jang Jae
  surname: Lee
  fullname: Lee, Jang Jae
  organization: National Research Center for Dementia, Chosun University
– sequence: 4
  givenname: Kyu Yeong
  surname: Choi
  fullname: Choi, Kyu Yeong
  organization: National Research Center for Dementia, Chosun University, Premedical Science, School of Medicine, Chousn University
– sequence: 5
  givenname: Kun Ho
  surname: Lee
  fullname: Lee, Kun Ho
  email: leekho@chosun.ac.kr
  organization: National Research Center for Dementia, Chosun University, Department of Biomedical Science, College of Natural Sciences, Chosun University
– sequence: 6
  givenname: Sungho
  orcidid: 0000-0001-5751-5089
  surname: Won
  fullname: Won, Sungho
  email: sunghow@gmail.com
  organization: Department of Public Health Science, Graduate School of Public Health, Seoul National University, Interdisciplinary Program of Bioinformatics, College of National Sciences, Seoul National University, Institute of Health and Environment, Seoul National University
BackLink https://www.ncbi.nlm.nih.gov/pubmed/31062262$$D View this record in MEDLINE/PubMed
https://www.kci.go.kr/kciportal/ci/sereArticleSearch/ciSereArtiView.kci?sereArticleSearchBean.artiId=ART002492841$$DAccess content in National Research Foundation of Korea (NRF)
BookMark eNqFkU1PFTEUhhuDkQvyB1yYSdzoYrQ9_ZouCQEhIWoU102n07kW5raXtrO4_95eBmLCArvo2TzPSfu-R-ggxOAQekfwZ4Kx_JIJBd61mKgW4w5ky1-hFWAFrQJFD9CKKClaxSU5RCc53-J6KGGCkTfokBIsAASs0MWvbz-atQux7LausWaafFg3JgzN_WwmX3aNjaGkODVjTM1mnopve1Psn3pnNzS5zIN3-S16PZopu5PHeYx-X5zfnF2219-_Xp2dXreWMShtP1BrMZUDH3vFMOnM2AEGrhzuhXFSjWR0HROSUUyY5D0bKCgHrhe9GYaeHqNPy96QRn1nvY7GP8x11HdJn_68udJcCVnDqezHhd2meD-7XPTGZ-umyQQX56yBYk6klLj7PwoUCKXQkYp-eIbexjmF-ulKCSZAUr6n3j9Sc79xg94mvzFpp5-Cr0C3ADbFnJMbtfXFFL8P2_hJE6z3NeulZl1r1g81a15VeKY-bX9RoouUKxzWLv179gvWXy7Lto4
CitedBy_id crossref_primary_10_1093_cercor_bhac483
crossref_primary_10_1002_gepi_22586
crossref_primary_10_1038_s41598_022_07567_9
crossref_primary_10_1038_s41440_024_01784_7
crossref_primary_10_1093_bib_bbae389
crossref_primary_10_1016_j_anai_2021_11_022
crossref_primary_10_1016_j_jhazmat_2024_136574
crossref_primary_10_3803_EnM_2021_1241
crossref_primary_10_1038_s41398_024_02777_3
crossref_primary_10_4168_aair_2021_13_4_609
crossref_primary_10_4093_dmj_2021_0375
crossref_primary_10_1186_s12916_022_02723_4
crossref_primary_10_1016_j_envint_2024_108709
crossref_primary_10_1007_s10120_022_01285_x
crossref_primary_10_1038_s41598_022_24766_6
crossref_primary_10_1186_s13059_024_03288_6
crossref_primary_10_3389_fendo_2024_1375459
Cites_doi 10.1186/1471-2105-9-S9-S17
10.1038/nrg2825
10.1159/000092553
10.1371/journal.pgen.1000477
10.1038/nprot.2010.116
10.1080/01621459.1952.10483441
10.2307/2333003
10.1016/j.ajhg.2009.11.004
10.1212/WNL.34.7.939
10.1093/nar/gkr798
10.1002/9780470685983
10.1016/j.ygeno.2004.05.003
10.1186/1471-2105-12-68
10.1186/1471-2164-9-431
10.1186/1471-2105-11-356
10.1038/tpj.2010.36
10.1111/j.1365-2796.2004.01380.x
ContentType Journal Article
Copyright The Genetics Society of Korea 2019
Copyright Springer Nature B.V. 2019
Copyright_xml – notice: The Genetics Society of Korea 2019
– notice: Copyright Springer Nature B.V. 2019
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7X8
7S9
L.6
ACYCR
DOI 10.1007/s13258-019-00827-5
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
AGRICOLA
AGRICOLA - Academic
Korean Citation Index
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
AGRICOLA
AGRICOLA - Academic
DatabaseTitleList
MEDLINE

MEDLINE - Academic
AGRICOLA

Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 2092-9293
EndPage 939
ExternalDocumentID oai_kci_go_kr_ARTI_5967132
31062262
10_1007_s13258_019_00827_5
Genre Research Support, Non-U.S. Gov't
Journal Article
GroupedDBID ---
-EM
06D
0R~
0VY
1N0
203
29~
2KG
30V
4.4
406
408
40D
5GY
67N
96X
9ZL
AACDK
AAHBH
AAHNG
AAIAL
AAJBT
AAJKR
AANZL
AARTL
AASML
AATNV
AATVU
AAUYE
AAWCG
AAYIU
AAYQN
AAYTO
AAYZH
AAZMS
ABAKF
ABDZT
ABECU
ABFTV
ABJNI
ABJOX
ABKCH
ABMQK
ABPLI
ABQBU
ABSXP
ABTEG
ABTHY
ABTKH
ABTMW
ABXPI
ACAOD
ACDTI
ACGFS
ACHSB
ACKNC
ACMDZ
ACMLO
ACOKC
ACPIV
ACPRK
ACZOJ
ADHHG
ADHIR
ADINQ
ADKNI
ADKPE
ADRFC
ADTPH
ADURQ
ADYFF
ADZKW
AEFQL
AEGNC
AEJHL
AEJRE
AEMSY
AENEX
AEOHA
AEPYU
AESKC
AETCA
AEVLU
AEXYK
AFBBN
AFQWF
AFWTZ
AFZKB
AGAYW
AGDGC
AGMZJ
AGQEE
AGQMX
AGRTI
AGWZB
AGYKE
AHAVH
AHBYD
AHYZX
AIAKS
AIGIU
AIIXL
AILAN
AITGF
AJRNO
AJZVZ
AKMHD
ALFXC
ALMA_UNASSIGNED_HOLDINGS
AMKLP
AMXSW
AMYLF
AMYQR
ANMIH
AOCGG
AXYYD
BGNMA
CSCUP
DDRTE
DNIVK
DPUIP
DU5
EBLON
EBS
EIOEI
EJD
ESBYG
FERAY
FFXSO
FIGPU
FINBP
FNLPD
FRRFC
FSGXE
FYJPI
GGCAI
GGRSB
GJIRD
GQ6
GQ7
HMJXF
HRMNR
HZB
I0C
IAO
IGS
IHR
IKXTQ
IWAJR
IXD
J-C
J0Z
JBSCW
JZLTJ
KOV
LLZTM
M4Y
NPVJJ
NQJWS
NU0
O9J
PT4
R9I
RLLFE
ROL
RSV
S27
S3A
S3B
SBL
SHX
SISQX
SJYHP
SNE
SNPRN
SNX
SOHCF
SOJ
SPISZ
SRMVM
SSLCW
SSXJD
STPWE
T13
TSG
U2A
U9L
UG4
UOJIU
UTJUX
UZXMN
VC2
VFIZW
W48
WK8
Z45
Z7U
ZMTXR
ZOVNA
2VQ
AANXM
AAPKM
AARHV
AAYXX
ABBRH
ABDBE
ABFSG
ACSTC
AEBTG
AEZWR
AFDZB
AFHIU
AFLOW
AFOHR
AGJBK
AHPBZ
AHSBF
AHWEU
AIXLP
AJBLW
ATHPR
AYFIA
CITATION
H13
HF~
HZ~
O9-
S1Z
CGR
CUY
CVF
ECM
EIF
NPM
ABRTQ
7X8
7S9
L.6
ACYCR
ID FETCH-LOGICAL-c442t-bd3cc037d5fb94018af820259e0b6ae79f1fe84674301475b4d329e2eb6baddb3
IEDL.DBID U2A
ISSN 1976-9571
2092-9293
IngestDate Sun Mar 09 07:52:27 EDT 2025
Fri Jul 11 10:17:46 EDT 2025
Fri Jul 11 10:06:00 EDT 2025
Fri Jul 25 11:18:05 EDT 2025
Wed Feb 19 02:30:49 EST 2025
Thu Apr 24 22:51:44 EDT 2025
Tue Jul 01 03:53:18 EDT 2025
Fri Feb 21 02:38:46 EST 2025
IsPeerReviewed true
IsScholarly true
Issue 8
Keywords Calling
K-medoid clustering
Batch effect
Quality control
Genome-wide association analysis
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c442t-bd3cc037d5fb94018af820259e0b6ae79f1fe84674301475b4d329e2eb6baddb3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
https://doi.org/10.1007/s13258-019-00827-5
ORCID 0000-0001-5751-5089
PMID 31062262
PQID 2264627351
PQPubID 2044424
PageCount 13
ParticipantIDs nrf_kci_oai_kci_go_kr_ARTI_5967132
proquest_miscellaneous_2305177708
proquest_miscellaneous_2232133281
proquest_journals_2264627351
pubmed_primary_31062262
crossref_citationtrail_10_1007_s13258_019_00827_5
crossref_primary_10_1007_s13258_019_00827_5
springer_journals_10_1007_s13258_019_00827_5
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2019-08-01
PublicationDateYYYYMMDD 2019-08-01
PublicationDate_xml – month: 08
  year: 2019
  text: 2019-08-01
  day: 01
PublicationDecade 2010
PublicationPlace Singapore
PublicationPlace_xml – name: Singapore
– name: Korea (South)
– name: Heidelberg
PublicationTitle Genes & genomics
PublicationTitleAbbrev Genes Genom
PublicationTitleAlternate Genes Genomics
PublicationYear 2019
Publisher Springer Singapore
Springer Nature B.V
한국유전학회
Publisher_xml – name: Springer Singapore
– name: Springer Nature B.V
– name: 한국유전학회
References MiclausKWolfingerRVegaSChiericiMFurlanelloCLambertCHongHZhangLYinSGoodsaidFBatch effects in the BRLMM genotype calling algorithm influence GWAS results for the Affymetrix 500 K arrayPharmacogenom J201010433634610.1038/tpj.2010.361:CAS:528:DC%2BC3cXpsFWktrg%3D
BrowningBLYuZSimultaneous genotype calling and haplotype phasing improves genotype accuracy and reduces false-positive associations for genome-wide association studiesAm J Hum Genet200985684786110.1016/j.ajhg.2009.11.0041:CAS:528:DC%2BC3cXht1eqt7w%3D199310402790566
ChaiHSTherneauTMBaileyKRKocherJ-PASpatial normalization improves the quality of genotype calling for Affymetrix SNP 6.0 arraysBMC Bioinf201011135610.1186/1471-2105-11-3561:CAS:528:DC%2BC3cXos1aktrw%3D
KruskalWHWallisWAUse of ranks in one-criterion variance analysisJ Am Stat Assoc19524726058362110.1080/01621459.1952.10483441
SpencerCCSuZDonnellyPMarchiniJDesigning genome-wide association studies: sample size, power, imputation, and the choice of genotyping chipPLoS Genet200955e100047710.1371/journal.pgen.10004771:CAS:528:DC%2BD1MXmsVyhu7s%3D194920152688469
LeekJTScharpfRBBravoHCSimchaDLangmeadBJohnsonWEGemanDBaggerlyKIrizarryRATackling the widespread and critical impact of batch effects in high-throughput dataNat Rev Genet2010111073373910.1038/nrg28251:CAS:528:DC%2BC3cXhtFyju7%2FK20838408
WinbladBPalmerKKivipeltoMJelicVFratiglioniLWahlundLONordbergABäckmanLAlbertMAlmkvistOMild cognitive impairment–beyond controversies, towards a consensus: report of the International Working Group on Mild Cognitive ImpairmentJ Intern Med2004256324024610.1111/j.1365-2796.2004.01380.x1:STN:280:DC%2BD2cvhvFCrsQ%3D%3D
NishidaNKoikeATajimaAOgasawaraYIshibashiYUeharaYInoueITokunagaKEvaluating the performance of Affymetrix SNP Array 6.0 platform with 400 Japanese individualsBMC Genom20089143110.1186/1471-2164-9-4311:CAS:528:DC%2BD1cXhtlejtb%2FO
Affymetrix I (2013) Axiom® genotyping solution data analysis guide. URLhttp://media.affymetrix.com/support/downloads/manuals/axiom_genotyping_solution_analysis_guide.pdf. Accessed 29 Mar 2016
RitchieMELiuRCarvalhoBSIrizarryRAComparing genotyping algorithms for Illumina's Infinium whole-genome SNP BeadChipsBMC Bioinformatics201110.1186/1471-2105-12-68213854243063825
McKhannGDrachmanDFolsteinMKatzmanRPriceDStadlanEMClinical diagnosis of Alzheimer’s disease: report of the NINCDS-ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer’s DiseaseNeurology198434793994410.1212/WNL.34.7.9391:STN:280:DyaL2c3ks1altQ%3D%3D66108416610841
Dodge Y (2012) Statistical data analysis based on the L1-norm and related methods: Birkhäuser, Basel
HaoKLiCRosenowCWongWHEstimation of genotype error rate using samples with pedigree information—an application on the GeneChip Mapping 10 K arrayGenomics200484462363010.1016/j.ygeno.2004.05.0031:CAS:528:DC%2BD2cXotVylsLc%3D15475239
CariasoMLennonGSNPedia: a wiki supporting personal genome annotation, interpretation and analysisNucl Acids Res201240D1D1308D131210.1093/nar/gkr7981:CAS:528:DC%2BC3MXhs12htbnE22140107
JamesGTests of linear hypotheses in univariate and multivariate analysis when the ratios of the population variances are unknownBiometrika1954411/2194310.2307/2333003
Pillai K (1985) Multivariate analysis of variance (MANOVA). Encyclop Stat Sci
Affymetrix I (2015) SNPolisher User Guide (Version 1.5.2), pp 1–104. https://tools.thermofisher.com/content/sfs/manuals/SNPolisher_User_Guide.pdf. Accessed 24 April 2017
McKhannGDrachmanDFolsteinMKatzmanRPriceDStadlanEMClinical diagnosis of Alzheimer’s disease Report of the NINCDS-ADRDA Work Group* under the auspices of Department of Health and Human Services Task Force on Alzheimer’s DiseaseNeurology198434793910.1212/WNL.34.7.9391:STN:280:DyaL2c3ks1altQ%3D%3D66108416610841
AndersonCAPetterssonFHClarkeGMCardonLRMorrisAPZondervanKTData quality control in genetic case-control association studiesNat Protoc2010591564157310.1038/nprot.2010.1161:CAS:528:DC%2BC3cXht1aksLvN210851223025522
Scherer A (2009) Batch effects and noise in microarray experiments: sources and solutions, vol 868. Wiley
MoskvinaVCraddockNHolmansPOwenMJO’DonovanMCEffects of differential genotyping error rate on the type I error probability of case-control studiesHum Hered2006611556410.1159/00009255316612103
HongHSuZGeWShiLPerkinsRFangHXuJChenJJHanTKaputJAssessing batch effects of genotype calling algorithm BRLMM for the Affymetrix GeneChip Human Mapping 500 K array set using 270 HapMap samplesBMC Bioinf200899S1710.1186/1471-2105-9-S9-S171:CAS:528:DC%2BD1cXhsFeis7nK
827_CR20
827_CR1
H Hong (827_CR9) 2008; 9
827_CR2
WH Kruskal (827_CR11) 1952; 47
ME Ritchie (827_CR24) 2011
827_CR7
HS Chai (827_CR6) 2010; 11
CA Anderson (827_CR3) 2010; 5
V Moskvina (827_CR18) 2006; 61
N Nishida (827_CR19) 2008; 9
CC Spencer (827_CR21) 2009; 5
M Cariaso (827_CR5) 2012; 40
K Miclaus (827_CR17) 2010; 10
JT Leek (827_CR12) 2010; 11
B Winblad (827_CR22) 2004; 256
BL Browning (827_CR4) 2009; 85
G McKhann (827_CR16) 1984; 34
G McKhann (827_CR15) 1984; 34
G James (827_CR10) 1954; 41
K Hao (827_CR8) 2004; 84
827_CR23
References_xml – reference: SpencerCCSuZDonnellyPMarchiniJDesigning genome-wide association studies: sample size, power, imputation, and the choice of genotyping chipPLoS Genet200955e100047710.1371/journal.pgen.10004771:CAS:528:DC%2BD1MXmsVyhu7s%3D194920152688469
– reference: JamesGTests of linear hypotheses in univariate and multivariate analysis when the ratios of the population variances are unknownBiometrika1954411/2194310.2307/2333003
– reference: ChaiHSTherneauTMBaileyKRKocherJ-PASpatial normalization improves the quality of genotype calling for Affymetrix SNP 6.0 arraysBMC Bioinf201011135610.1186/1471-2105-11-3561:CAS:528:DC%2BC3cXos1aktrw%3D
– reference: MiclausKWolfingerRVegaSChiericiMFurlanelloCLambertCHongHZhangLYinSGoodsaidFBatch effects in the BRLMM genotype calling algorithm influence GWAS results for the Affymetrix 500 K arrayPharmacogenom J201010433634610.1038/tpj.2010.361:CAS:528:DC%2BC3cXpsFWktrg%3D
– reference: Affymetrix I (2013) Axiom® genotyping solution data analysis guide. URLhttp://media.affymetrix.com/support/downloads/manuals/axiom_genotyping_solution_analysis_guide.pdf. Accessed 29 Mar 2016
– reference: Affymetrix I (2015) SNPolisher User Guide (Version 1.5.2), pp 1–104. https://tools.thermofisher.com/content/sfs/manuals/SNPolisher_User_Guide.pdf. Accessed 24 April 2017
– reference: McKhannGDrachmanDFolsteinMKatzmanRPriceDStadlanEMClinical diagnosis of Alzheimer’s disease: report of the NINCDS-ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer’s DiseaseNeurology198434793994410.1212/WNL.34.7.9391:STN:280:DyaL2c3ks1altQ%3D%3D66108416610841
– reference: HongHSuZGeWShiLPerkinsRFangHXuJChenJJHanTKaputJAssessing batch effects of genotype calling algorithm BRLMM for the Affymetrix GeneChip Human Mapping 500 K array set using 270 HapMap samplesBMC Bioinf200899S1710.1186/1471-2105-9-S9-S171:CAS:528:DC%2BD1cXhsFeis7nK
– reference: HaoKLiCRosenowCWongWHEstimation of genotype error rate using samples with pedigree information—an application on the GeneChip Mapping 10 K arrayGenomics200484462363010.1016/j.ygeno.2004.05.0031:CAS:528:DC%2BD2cXotVylsLc%3D15475239
– reference: MoskvinaVCraddockNHolmansPOwenMJO’DonovanMCEffects of differential genotyping error rate on the type I error probability of case-control studiesHum Hered2006611556410.1159/00009255316612103
– reference: CariasoMLennonGSNPedia: a wiki supporting personal genome annotation, interpretation and analysisNucl Acids Res201240D1D1308D131210.1093/nar/gkr7981:CAS:528:DC%2BC3MXhs12htbnE22140107
– reference: NishidaNKoikeATajimaAOgasawaraYIshibashiYUeharaYInoueITokunagaKEvaluating the performance of Affymetrix SNP Array 6.0 platform with 400 Japanese individualsBMC Genom20089143110.1186/1471-2164-9-4311:CAS:528:DC%2BD1cXhtlejtb%2FO
– reference: RitchieMELiuRCarvalhoBSIrizarryRAComparing genotyping algorithms for Illumina's Infinium whole-genome SNP BeadChipsBMC Bioinformatics201110.1186/1471-2105-12-68213854243063825
– reference: Scherer A (2009) Batch effects and noise in microarray experiments: sources and solutions, vol 868. Wiley
– reference: KruskalWHWallisWAUse of ranks in one-criterion variance analysisJ Am Stat Assoc19524726058362110.1080/01621459.1952.10483441
– reference: BrowningBLYuZSimultaneous genotype calling and haplotype phasing improves genotype accuracy and reduces false-positive associations for genome-wide association studiesAm J Hum Genet200985684786110.1016/j.ajhg.2009.11.0041:CAS:528:DC%2BC3cXht1eqt7w%3D199310402790566
– reference: WinbladBPalmerKKivipeltoMJelicVFratiglioniLWahlundLONordbergABäckmanLAlbertMAlmkvistOMild cognitive impairment–beyond controversies, towards a consensus: report of the International Working Group on Mild Cognitive ImpairmentJ Intern Med2004256324024610.1111/j.1365-2796.2004.01380.x1:STN:280:DC%2BD2cvhvFCrsQ%3D%3D
– reference: LeekJTScharpfRBBravoHCSimchaDLangmeadBJohnsonWEGemanDBaggerlyKIrizarryRATackling the widespread and critical impact of batch effects in high-throughput dataNat Rev Genet2010111073373910.1038/nrg28251:CAS:528:DC%2BC3cXhtFyju7%2FK20838408
– reference: Dodge Y (2012) Statistical data analysis based on the L1-norm and related methods: Birkhäuser, Basel
– reference: McKhannGDrachmanDFolsteinMKatzmanRPriceDStadlanEMClinical diagnosis of Alzheimer’s disease Report of the NINCDS-ADRDA Work Group* under the auspices of Department of Health and Human Services Task Force on Alzheimer’s DiseaseNeurology198434793910.1212/WNL.34.7.9391:STN:280:DyaL2c3ks1altQ%3D%3D66108416610841
– reference: Pillai K (1985) Multivariate analysis of variance (MANOVA). Encyclop Stat Sci
– reference: AndersonCAPetterssonFHClarkeGMCardonLRMorrisAPZondervanKTData quality control in genetic case-control association studiesNat Protoc2010591564157310.1038/nprot.2010.1161:CAS:528:DC%2BC3cXht1aksLvN210851223025522
– ident: 827_CR2
– volume: 9
  start-page: S17
  issue: 9
  year: 2008
  ident: 827_CR9
  publication-title: BMC Bioinf
  doi: 10.1186/1471-2105-9-S9-S17
– volume: 11
  start-page: 733
  issue: 10
  year: 2010
  ident: 827_CR12
  publication-title: Nat Rev Genet
  doi: 10.1038/nrg2825
– ident: 827_CR1
– volume: 61
  start-page: 55
  issue: 1
  year: 2006
  ident: 827_CR18
  publication-title: Hum Hered
  doi: 10.1159/000092553
– volume: 5
  start-page: e1000477
  issue: 5
  year: 2009
  ident: 827_CR21
  publication-title: PLoS Genet
  doi: 10.1371/journal.pgen.1000477
– volume: 5
  start-page: 1564
  issue: 9
  year: 2010
  ident: 827_CR3
  publication-title: Nat Protoc
  doi: 10.1038/nprot.2010.116
– volume: 47
  start-page: 583
  issue: 260
  year: 1952
  ident: 827_CR11
  publication-title: J Am Stat Assoc
  doi: 10.1080/01621459.1952.10483441
– ident: 827_CR20
– volume: 41
  start-page: 19
  issue: 1/2
  year: 1954
  ident: 827_CR10
  publication-title: Biometrika
  doi: 10.2307/2333003
– volume: 85
  start-page: 847
  issue: 6
  year: 2009
  ident: 827_CR4
  publication-title: Am J Hum Genet
  doi: 10.1016/j.ajhg.2009.11.004
– volume: 34
  start-page: 939
  issue: 7
  year: 1984
  ident: 827_CR15
  publication-title: Neurology
  doi: 10.1212/WNL.34.7.939
– volume: 40
  start-page: D1308
  issue: D1
  year: 2012
  ident: 827_CR5
  publication-title: Nucl Acids Res
  doi: 10.1093/nar/gkr798
– ident: 827_CR23
  doi: 10.1002/9780470685983
– volume: 84
  start-page: 623
  issue: 4
  year: 2004
  ident: 827_CR8
  publication-title: Genomics
  doi: 10.1016/j.ygeno.2004.05.003
– year: 2011
  ident: 827_CR24
  publication-title: BMC Bioinformatics
  doi: 10.1186/1471-2105-12-68
– volume: 9
  start-page: 431
  issue: 1
  year: 2008
  ident: 827_CR19
  publication-title: BMC Genom
  doi: 10.1186/1471-2164-9-431
– volume: 11
  start-page: 356
  issue: 1
  year: 2010
  ident: 827_CR6
  publication-title: BMC Bioinf
  doi: 10.1186/1471-2105-11-356
– volume: 34
  start-page: 939
  issue: 7
  year: 1984
  ident: 827_CR16
  publication-title: Neurology
  doi: 10.1212/WNL.34.7.939
– ident: 827_CR7
– volume: 10
  start-page: 336
  issue: 4
  year: 2010
  ident: 827_CR17
  publication-title: Pharmacogenom J
  doi: 10.1038/tpj.2010.36
– volume: 256
  start-page: 240
  issue: 3
  year: 2004
  ident: 827_CR22
  publication-title: J Intern Med
  doi: 10.1111/j.1365-2796.2004.01380.x
SSID ssj0000314641
Score 2.236834
Snippet Background In genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is...
In genetic analyses, the term 'batch effect' refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is the most...
BackgroundIn genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is...
BACKGROUND: In genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is...
Background In genetic analyses, the term ‘batch effect’ refers to systematic differences caused by batch heterogeneity. Controlling this unintended effect is...
SourceID nrf
proquest
pubmed
crossref
springer
SourceType Open Website
Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 927
SubjectTerms algorithms
Alzheimer disease
Alzheimer Disease - genetics
Analysis of Variance
Animal Genetics and Genomics
Biomedical and Life Sciences
Cognitive ability
cognitive disorders
Cognitive Dysfunction - genetics
Genetic analysis
Genetic Heterogeneity
Genome-Wide Association Study - methods
Genome-Wide Association Study - standards
genotype
Genotypes
Genotyping
Genotyping Techniques - methods
Genotyping Techniques - standards
Human Genetics
Humans
Life Sciences
Microbial Genetics and Genomics
Multivariate analysis
patients
Plant Genetics and Genomics
Polymorphism, Single Nucleotide
Quality control
Research Article
Single-nucleotide polymorphism
Statistical analysis
생물학
Title SNP genotype calling and quality control for multi-batch-based studies
URI https://link.springer.com/article/10.1007/s13258-019-00827-5
https://www.ncbi.nlm.nih.gov/pubmed/31062262
https://www.proquest.com/docview/2264627351
https://www.proquest.com/docview/2232133281
https://www.proquest.com/docview/2305177708
https://www.kci.go.kr/kciportal/ci/sereArticleSearch/ciSereArtiView.kci?sereArticleSearchBean.artiId=ART002492841
Volume 41
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
ispartofPNX Genes & Genomics, 2019, 41(8), , pp.927-939
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LT9wwEB7xEFIvqAXahpdcxI1aSpw4jo-rii0FFSHBSnCy4kfaCpRFu8uBf89MHrtCPKSecsgkccbjmW_seQAcOhk7X8YaV5rOOGq_hFuVlByNja4cCohu-qf8Ps9PRtnptbzuksKmfbR7fyTZaOpFslsqJAVeaU52S3G5DKsSfXeS65EYzHdWqCB73rSsTNDWci1V0mXLvP6aZxZpuZ5Ur4HNFweljf0ZfoT1DjiyQTvTn2Ap1Buw1raSfNyE4eX5BaOCq7SnypDxlGbOytqzNm3ykXVB6QxRKmvCCLlFNfyXkx3zbNrGE27BaHh89eOEdz0SuMsyMePWp87FqfKyshp9paKskM3o04TY5mVQukqqQBgjI99JSZv5VOgggs0tqjabfoaVelyHr8CEyxPvZdDaF1mQyhI68WXqM20LH-sIkp5PxnUFxKmPxZ1ZlD4m3hrkrWl4a2QER_Nn7tvyGe9SHyD7za37Z6jqNV3_jM3txCC2_2WkztGjFhHs9rNjuvU2NZQOnCMSk0kE3-a3caXQ8UdZh_ED0aQCPXJRvEeTUs0ypeIigi_tzM_HjUA4x-_gAL73orAYwNs_tf1_5DvwQTTCSTK6CyuzyUPYQ9wzs_uwOvh5c3a834j7E_sF9bc
linkProvider Springer Nature
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlR3JbtQw9IlOheBC2QkUMIgbuMpix_GxqhimtB0h0ZHKyYqXABqUQbMc2q_ve1lmBJRKPeWQl8R5-7PfAvDOydj5MtYoaVpw1H4JtyopORobXTlkEN3MTzkZ56OJ-Hwmz7qisEWf7d4fSTaaelPslqWSEq80J7uluNyCbYExuBzA9v6nb0ebvRVqyZ43QysTtLZcS5V09TJXv-gPm7RVz6ur3M1_jkobCzTcgUm_9jbxZLq3Wto9d_FXW8eb_tx9uNe5pGy_5aEHcCvUD-F2O6Ty_BEMv46_MGrlSru1DElKBeysrD1rCzLPWZfuztD_ZU2CIreo4H9wspCeLdpMxccwGX48PRjxbvoCd0KkS2595lycKS8rqzEKK8oKCYjRUohtXgalq6QK5L0IisqUtMJnqQ5psLlFpWmzJzCoZ3V4Bix1eeK9DFr7QgSpLPk9vsy80LbwsY4g6fFvXNeanCZk_DKbpsqEH4P4MQ1-jIzg_fqZ321jjmuh3yJZzdT9NNRPm67fZ2Y6Nxg1HBqpc4zV0wh2e6qbTpIXhgqNc_TxZBLBm_VtlEE6WCnrMFsRTJZirJ8W18Fk1A1NqbiI4GnLUet1o4ud43dwAR967tgs4P8_9fxm4K_hzuj05NgcH46PXsDdtGE24rldGCznq_ASvaulfdUJ0yWO9RQg
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3dT9RAEJ8IRuOLEVGsIq7GN9xcd9vtdh-JcgGFCwlewtum-1EwmB45jgf-e2fa3h2Gj4SnPnTabmdmd36zOx8AX71KfahSgzPN5BxXP8GdFhVHY2Nqjwpi2v4ph6Nib5z_PFEnN7L422j3-ZFkl9NAVZqa2eAi1INl4lsmFQVhGU42THO1Ak9xORYU1DWWO4tdFirOXrTtKwXaXW6UFn3mzN2v-c86rTTT-i7geevQtLVFw1fwsgeRbKeT-ho8ic1reNa1lbxeh-Hx6IhR8VXaX2UoBEo5Z1UTWJdCec36AHWGiJW1IYXc4ZJ8xsmmBXbZxRa-gfFw9_f3Pd73S-A-z-WMu5B5n2Y6qNoZ9JvKqkaWo38TU1dUUZta1JHwRk5-lFYuD5k0UUZXOFzmXPYWVptJE98Bk74QIahoTCjzqLQjpBKqLOTGlSE1CYg5n6zvi4lTT4u_dlkGmXhrkbe25a1VCWwvnrnoSmk8SP0F2W_P_R9LFbDpejqx51OLOH_fKlOgdy0T2JxLx_Zz79JSanCBqEyJBD4vbuOsoaOQqomTK6LJJHrnsnyIJqP6ZVqnZQIbneQX40ZQXOB3cADf5qqwHMD9P_X-ceSf4PnRj6E92B_9-gAvZKunpK6bsDqbXsWPCIdmbqvV-H9qL_tS
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=SNP+genotype+calling+and+quality+control+for+multi-batch-based+studies&rft.jtitle=Genes+%26+genomics&rft.au=Seo%2C+Sujin&rft.au=Park%2C+Kyungtaek&rft.au=Lee%2C+Jang+Jae&rft.au=Choi%2C+Kyu+Yeong&rft.date=2019-08-01&rft.issn=2092-9293&rft.eissn=2092-9293&rft.volume=41&rft.issue=8&rft.spage=927&rft_id=info:doi/10.1007%2Fs13258-019-00827-5&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1976-9571&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1976-9571&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1976-9571&client=summon