Robust principal component analysis for accurate outlier sample detection in RNA-Seq data

High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimens...

Full description

Saved in:
Bibliographic Details
Published inBMC bioinformatics Vol. 21; no. 1; pp. 269 - 20
Main Authors Chen, Xiaoying, Zhang, Bo, Wang, Ting, Bonni, Azad, Zhao, Guoyan
Format Journal Article
LanguageEnglish
Published England BioMed Central Ltd 29.06.2020
BioMed Central
BMC
Subjects
Online AccessGet full text

Cover

Loading…
Abstract High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimensionality of the data with few biological replicates make it challenging to accurately detect those samples, and this issue is not well studied in the literature currently. Robust statistics is a family of theories and techniques aim to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. Robust statistics have been widely used in multivariate data analysis for outlier detection in chemometrics and engineering. Here we apply robust statistics on RNA-seq data analysis. We report the use of two robust principal component analysis (rPCA) methods, PcaHubert and PcaGrid, to detect outlier samples in multiple simulated and real biological RNA-seq data sets with positive control outlier samples. PcaGrid achieved 100% sensitivity and 100% specificity in all the tests using positive control outliers with varying degrees of divergence. We applied rPCA methods and classical principal component analysis (cPCA) on an RNA-Seq data set profiling gene expression of the external granule layer in the cerebellum of control and conditional SnoN knockout mice. Both rPCA methods detected the same two outlier samples but cPCA failed to detect any. We performed differentially expressed gene detection before and after outlier removal as well as with and without batch effect modeling. We validated gene expression changes using quantitative reverse transcription PCR and used the result as reference to compare the performance of eight different data analysis strategies. Removing outliers without batch effect modeling performed the best in term of detecting biologically relevant differentially expressed genes. rPCA implemented in the PcaGrid function is an accurate and objective method to detect outlier samples. It is well suited for high-dimensional data with small sample sizes like RNA-seq data. Outlier removal can significantly improve the performance of differential gene detection and downstream functional analysis.
AbstractList Background High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimensionality of the data with few biological replicates make it challenging to accurately detect those samples, and this issue is not well studied in the literature currently. Robust statistics is a family of theories and techniques aim to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. Robust statistics have been widely used in multivariate data analysis for outlier detection in chemometrics and engineering. Here we apply robust statistics on RNA-seq data analysis. Results We report the use of two robust principal component analysis (rPCA) methods, PcaHubert and PcaGrid, to detect outlier samples in multiple simulated and real biological RNA-seq data sets with positive control outlier samples. PcaGrid achieved 100% sensitivity and 100% specificity in all the tests using positive control outliers with varying degrees of divergence. We applied rPCA methods and classical principal component analysis (cPCA) on an RNA-Seq data set profiling gene expression of the external granule layer in the cerebellum of control and conditional SnoN knockout mice. Both rPCA methods detected the same two outlier samples but cPCA failed to detect any. We performed differentially expressed gene detection before and after outlier removal as well as with and without batch effect modeling. We validated gene expression changes using quantitative reverse transcription PCR and used the result as reference to compare the performance of eight different data analysis strategies. Removing outliers without batch effect modeling performed the best in term of detecting biologically relevant differentially expressed genes. Conclusions rPCA implemented in the PcaGrid function is an accurate and objective method to detect outlier samples. It is well suited for high-dimensional data with small sample sizes like RNA-seq data. Outlier removal can significantly improve the performance of differential gene detection and downstream functional analysis. Keywords: Robust principal component analysis, PcaGrid, PcaHubert, Outlier detection, RNA-seq, High-dimensional data, Anomaly detection
High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimensionality of the data with few biological replicates make it challenging to accurately detect those samples, and this issue is not well studied in the literature currently. Robust statistics is a family of theories and techniques aim to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. Robust statistics have been widely used in multivariate data analysis for outlier detection in chemometrics and engineering. Here we apply robust statistics on RNA-seq data analysis.BACKGROUNDHigh throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimensionality of the data with few biological replicates make it challenging to accurately detect those samples, and this issue is not well studied in the literature currently. Robust statistics is a family of theories and techniques aim to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. Robust statistics have been widely used in multivariate data analysis for outlier detection in chemometrics and engineering. Here we apply robust statistics on RNA-seq data analysis.We report the use of two robust principal component analysis (rPCA) methods, PcaHubert and PcaGrid, to detect outlier samples in multiple simulated and real biological RNA-seq data sets with positive control outlier samples. PcaGrid achieved 100% sensitivity and 100% specificity in all the tests using positive control outliers with varying degrees of divergence. We applied rPCA methods and classical principal component analysis (cPCA) on an RNA-Seq data set profiling gene expression of the external granule layer in the cerebellum of control and conditional SnoN knockout mice. Both rPCA methods detected the same two outlier samples but cPCA failed to detect any. We performed differentially expressed gene detection before and after outlier removal as well as with and without batch effect modeling. We validated gene expression changes using quantitative reverse transcription PCR and used the result as reference to compare the performance of eight different data analysis strategies. Removing outliers without batch effect modeling performed the best in term of detecting biologically relevant differentially expressed genes.RESULTSWe report the use of two robust principal component analysis (rPCA) methods, PcaHubert and PcaGrid, to detect outlier samples in multiple simulated and real biological RNA-seq data sets with positive control outlier samples. PcaGrid achieved 100% sensitivity and 100% specificity in all the tests using positive control outliers with varying degrees of divergence. We applied rPCA methods and classical principal component analysis (cPCA) on an RNA-Seq data set profiling gene expression of the external granule layer in the cerebellum of control and conditional SnoN knockout mice. Both rPCA methods detected the same two outlier samples but cPCA failed to detect any. We performed differentially expressed gene detection before and after outlier removal as well as with and without batch effect modeling. We validated gene expression changes using quantitative reverse transcription PCR and used the result as reference to compare the performance of eight different data analysis strategies. Removing outliers without batch effect modeling performed the best in term of detecting biologically relevant differentially expressed genes.rPCA implemented in the PcaGrid function is an accurate and objective method to detect outlier samples. It is well suited for high-dimensional data with small sample sizes like RNA-seq data. Outlier removal can significantly improve the performance of differential gene detection and downstream functional analysis.CONCLUSIONSrPCA implemented in the PcaGrid function is an accurate and objective method to detect outlier samples. It is well suited for high-dimensional data with small sample sizes like RNA-seq data. Outlier removal can significantly improve the performance of differential gene detection and downstream functional analysis.
Background High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimensionality of the data with few biological replicates make it challenging to accurately detect those samples, and this issue is not well studied in the literature currently. Robust statistics is a family of theories and techniques aim to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. Robust statistics have been widely used in multivariate data analysis for outlier detection in chemometrics and engineering. Here we apply robust statistics on RNA-seq data analysis. Results We report the use of two robust principal component analysis (rPCA) methods, PcaHubert and PcaGrid, to detect outlier samples in multiple simulated and real biological RNA-seq data sets with positive control outlier samples. PcaGrid achieved 100% sensitivity and 100% specificity in all the tests using positive control outliers with varying degrees of divergence. We applied rPCA methods and classical principal component analysis (cPCA) on an RNA-Seq data set profiling gene expression of the external granule layer in the cerebellum of control and conditional SnoN knockout mice. Both rPCA methods detected the same two outlier samples but cPCA failed to detect any. We performed differentially expressed gene detection before and after outlier removal as well as with and without batch effect modeling. We validated gene expression changes using quantitative reverse transcription PCR and used the result as reference to compare the performance of eight different data analysis strategies. Removing outliers without batch effect modeling performed the best in term of detecting biologically relevant differentially expressed genes. Conclusions rPCA implemented in the PcaGrid function is an accurate and objective method to detect outlier samples. It is well suited for high-dimensional data with small sample sizes like RNA-seq data. Outlier removal can significantly improve the performance of differential gene detection and downstream functional analysis.
High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimensionality of the data with few biological replicates make it challenging to accurately detect those samples, and this issue is not well studied in the literature currently. Robust statistics is a family of theories and techniques aim to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. Robust statistics have been widely used in multivariate data analysis for outlier detection in chemometrics and engineering. Here we apply robust statistics on RNA-seq data analysis. We report the use of two robust principal component analysis (rPCA) methods, PcaHubert and PcaGrid, to detect outlier samples in multiple simulated and real biological RNA-seq data sets with positive control outlier samples. PcaGrid achieved 100% sensitivity and 100% specificity in all the tests using positive control outliers with varying degrees of divergence. We applied rPCA methods and classical principal component analysis (cPCA) on an RNA-Seq data set profiling gene expression of the external granule layer in the cerebellum of control and conditional SnoN knockout mice. Both rPCA methods detected the same two outlier samples but cPCA failed to detect any. We performed differentially expressed gene detection before and after outlier removal as well as with and without batch effect modeling. We validated gene expression changes using quantitative reverse transcription PCR and used the result as reference to compare the performance of eight different data analysis strategies. Removing outliers without batch effect modeling performed the best in term of detecting biologically relevant differentially expressed genes. rPCA implemented in the PcaGrid function is an accurate and objective method to detect outlier samples. It is well suited for high-dimensional data with small sample sizes like RNA-seq data. Outlier removal can significantly improve the performance of differential gene detection and downstream functional analysis.
Abstract Background High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimensionality of the data with few biological replicates make it challenging to accurately detect those samples, and this issue is not well studied in the literature currently. Robust statistics is a family of theories and techniques aim to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. Robust statistics have been widely used in multivariate data analysis for outlier detection in chemometrics and engineering. Here we apply robust statistics on RNA-seq data analysis. Results We report the use of two robust principal component analysis (rPCA) methods, PcaHubert and PcaGrid, to detect outlier samples in multiple simulated and real biological RNA-seq data sets with positive control outlier samples. PcaGrid achieved 100% sensitivity and 100% specificity in all the tests using positive control outliers with varying degrees of divergence. We applied rPCA methods and classical principal component analysis (cPCA) on an RNA-Seq data set profiling gene expression of the external granule layer in the cerebellum of control and conditional SnoN knockout mice. Both rPCA methods detected the same two outlier samples but cPCA failed to detect any. We performed differentially expressed gene detection before and after outlier removal as well as with and without batch effect modeling. We validated gene expression changes using quantitative reverse transcription PCR and used the result as reference to compare the performance of eight different data analysis strategies. Removing outliers without batch effect modeling performed the best in term of detecting biologically relevant differentially expressed genes. Conclusions rPCA implemented in the PcaGrid function is an accurate and objective method to detect outlier samples. It is well suited for high-dimensional data with small sample sizes like RNA-seq data. Outlier removal can significantly improve the performance of differential gene detection and downstream functional analysis.
High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimensionality of the data with few biological replicates make it challenging to accurately detect those samples, and this issue is not well studied in the literature currently. Robust statistics is a family of theories and techniques aim to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. Robust statistics have been widely used in multivariate data analysis for outlier detection in chemometrics and engineering. Here we apply robust statistics on RNA-seq data analysis. We report the use of two robust principal component analysis (rPCA) methods, PcaHubert and PcaGrid, to detect outlier samples in multiple simulated and real biological RNA-seq data sets with positive control outlier samples. PcaGrid achieved 100% sensitivity and 100% specificity in all the tests using positive control outliers with varying degrees of divergence. We applied rPCA methods and classical principal component analysis (cPCA) on an RNA-Seq data set profiling gene expression of the external granule layer in the cerebellum of control and conditional SnoN knockout mice. Both rPCA methods detected the same two outlier samples but cPCA failed to detect any. We performed differentially expressed gene detection before and after outlier removal as well as with and without batch effect modeling. We validated gene expression changes using quantitative reverse transcription PCR and used the result as reference to compare the performance of eight different data analysis strategies. Removing outliers without batch effect modeling performed the best in term of detecting biologically relevant differentially expressed genes. rPCA implemented in the PcaGrid function is an accurate and objective method to detect outlier samples. It is well suited for high-dimensional data with small sample sizes like RNA-seq data. Outlier removal can significantly improve the performance of differential gene detection and downstream functional analysis.
ArticleNumber 269
Audience Academic
Author Bonni, Azad
Zhao, Guoyan
Wang, Ting
Chen, Xiaoying
Zhang, Bo
Author_xml – sequence: 1
  givenname: Xiaoying
  surname: Chen
  fullname: Chen, Xiaoying
– sequence: 2
  givenname: Bo
  surname: Zhang
  fullname: Zhang, Bo
– sequence: 3
  givenname: Ting
  surname: Wang
  fullname: Wang, Ting
– sequence: 4
  givenname: Azad
  surname: Bonni
  fullname: Bonni, Azad
– sequence: 5
  givenname: Guoyan
  orcidid: 0000-0001-5615-6774
  surname: Zhao
  fullname: Zhao, Guoyan
BackLink https://www.ncbi.nlm.nih.gov/pubmed/32600248$$D View this record in MEDLINE/PubMed
BookMark eNp9kktv1DAUhSNURNuBP8ACRWJTFim283I2SKOKx0gVSFNYsLJu7JvBoySe2g6i_547nVKaCqFIjmV_59i-95wmR6MbMUlecnbOuazeBi5k2WRMsIzlFZMZe5Kc8KLmmeCsPHowP05OQ9gyxmvJymfJcS4qxkQhT5Lva9dOIaY7b0dtd9Cn2g07OmeMKYzQ3wQb0s75FLSePERM3RR7iz4NMOx6TA1G1NG6MbVjuv68zK7wOjUQ4XnytIM-4Iu7_yL59uH914tP2eWXj6uL5WWmy6aOmQbZygoxl9B0LWu4qEosiqYAXqEQtSwNMx1tY8XQdNAYgabCtmsr4KibfJGsDr7GwVbROwbwN8qBVbcLzm8U-Gh1j4qTnGnsWGN4IY2GXOY1MmmwE7KtSvJ6d_DaTe2ARlMVPPQz0_nOaH-ojfup6lwUTSPI4OzOwLvrCUNUgw0a-x5GdFNQouANkzU1htDXj9CtmzyVfE8J6pwoCvGX2gA9wI6do3P13lQtKyEFRaDmRJ3_g6LP4GA1dbOztD4TvJkJiIn4K25gCkGtrtZz9tXDotxX40-ICBAHQHsXgsfuHuFM7ZOqDklVlFR1m1QaF4l8JNI2wj5IdHXb_0_6G8aD6-A
CitedBy_id crossref_primary_10_1111_acel_14093
crossref_primary_10_1371_journal_pbio_3002989
crossref_primary_10_1080_02664763_2022_2044018
crossref_primary_10_3390_rs16010187
crossref_primary_10_1016_j_euroneuro_2021_10_274
crossref_primary_10_1093_jn_nxac043
crossref_primary_10_3390_biology13110915
crossref_primary_10_4014_jmb_2012_12034
crossref_primary_10_1007_s11042_022_13285_1
crossref_primary_10_1016_j_trac_2024_117852
crossref_primary_10_1038_s41598_021_93250_4
crossref_primary_10_1186_s12859_024_05975_4
crossref_primary_10_1016_j_eswa_2024_126245
crossref_primary_10_1038_s41598_023_37521_2
crossref_primary_10_1111_gbb_12753
crossref_primary_10_3389_fgene_2022_788580
crossref_primary_10_1038_s41598_023_36134_z
crossref_primary_10_3389_fpls_2022_857535
crossref_primary_10_1016_j_scitotenv_2024_178288
crossref_primary_10_1038_s41698_022_00299_z
crossref_primary_10_1038_s41467_023_41352_0
crossref_primary_10_1371_journal_pgen_1010833
crossref_primary_10_3390_genes14020387
crossref_primary_10_1016_j_taap_2024_116865
crossref_primary_10_3390_math9080882
crossref_primary_10_55525_tjst_1293057
crossref_primary_10_1007_s13721_022_00364_4
crossref_primary_10_1016_j_xpro_2021_100539
crossref_primary_10_1038_s41467_024_48025_6
crossref_primary_10_1007_s12145_022_00869_6
crossref_primary_10_1016_j_ajhg_2024_10_019
crossref_primary_10_1007_s12035_025_04803_x
crossref_primary_10_1016_j_crfs_2023_100514
crossref_primary_10_1186_s13024_023_00638_z
crossref_primary_10_1111_mec_16220
crossref_primary_10_1016_j_jprot_2024_105178
crossref_primary_10_1038_s41598_020_79624_0
crossref_primary_10_15324_kjcls_2023_55_4_235
crossref_primary_10_1016_j_heliyon_2024_e41242
crossref_primary_10_1111_mec_17145
crossref_primary_10_3390_en14133951
crossref_primary_10_1084_jem_20231758
crossref_primary_10_1093_jas_skac019
crossref_primary_10_1371_journal_pone_0260119
crossref_primary_10_3390_molecules25184350
crossref_primary_10_2147_JIR_S469297
crossref_primary_10_1002_cyto_a_24921
crossref_primary_10_1371_journal_pone_0257356
crossref_primary_10_1093_nar_gkab1175
crossref_primary_10_1089_cmb_2022_0243
crossref_primary_10_3389_fmolb_2021_791331
Cites_doi 10.3389/fmicb.2016.00794
10.1016/j.ygeno.2010.01.003
10.1016/j.ins.2012.10.017
10.1016/j.neuron.2015.05.005
10.1093/bioinformatics/btn224
10.1093/bioinformatics/btt688
10.18637/jss.v032.i03
10.1186/s12859-016-1212-5
10.1200/JCO.2017.35.15_suppl.e13025
10.1111/j.1474-9726.2012.00857.x
10.1093/biostatistics/kxv027
10.1016/j.ydbio.2012.04.018
10.1186/gb-2013-14-9-r95
10.1214/088342307000000087
10.1093/bioinformatics/btu638
10.1093/bioinformatics/btv425
10.14806/ej.17.1.200
10.1101/gr.231357.117
10.1016/j.neuron.2006.03.034
10.1186/s12859-018-2149-7
10.1186/s13059-014-0550-8
10.2202/1544-6115.1426
10.1016/j.tibtech.2017.02.012
10.1016/j.chemolab.2007.01.004
10.1093/bioinformatics/btr026
10.1198/004017004000000563
10.1016/j.aca.2011.03.055
10.1093/bioinformatics/btm487
10.1093/bioinformatics/btv272
10.1242/dev.00182
10.1093/bioinformatics/btn647
10.1186/s13059-016-0881-8
10.1523/JNEUROSCI.0688-18.2018
10.1186/1752-0509-6-63
10.1016/j.microc.2012.03.028
10.1007/BF02595862
10.1093/bioinformatics/bts635
10.1093/bioinformatics/btx790
10.1038/nbt.4096
10.1088/1742-6596/705/1/012003
10.1038/nmeth.1226
ContentType Journal Article
Copyright COPYRIGHT 2020 BioMed Central Ltd.
2020. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
The Author(s) 2020
Copyright_xml – notice: COPYRIGHT 2020 BioMed Central Ltd.
– notice: 2020. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
– notice: The Author(s) 2020
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
ISR
3V.
7QO
7SC
7X7
7XB
88E
8AL
8AO
8FD
8FE
8FG
8FH
8FI
8FJ
8FK
ABUWG
AEUYN
AFKRA
ARAPS
AZQEC
BBNVY
BENPR
BGLVJ
BHPHI
CCPQU
DWQXO
FR3
FYUFA
GHDGH
GNUQQ
HCIFZ
JQ2
K7-
K9.
L7M
LK8
L~C
L~D
M0N
M0S
M1P
M7P
P5Z
P62
P64
PHGZM
PHGZT
PIMPY
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
Q9U
7X8
5PM
DOA
DOI 10.1186/s12859-020-03608-0
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
Gale In Context: Science
ProQuest Central (Corporate)
Biotechnology Research Abstracts
Computer and Information Systems Abstracts
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Medical Database (Alumni Edition)
Computing Database (Alumni Edition)
ProQuest Pharma Collection
Technology Research Database
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Natural Science Collection
ProQuest Hospital Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Central (Alumni)
ProQuest One Sustainability (subscription)
ProQuest Central UK/Ireland
Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
Biological Science Collection
ProQuest Central
Technology Collection
Natural Science Collection
ProQuest One Community College
ProQuest Central
Engineering Research Database
Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Central Student
SciTech Premium Collection
ProQuest Computer Science Collection
Computer Science Database
ProQuest Health & Medical Complete (Alumni)
Advanced Technologies Database with Aerospace
Biological Sciences
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
Computing Database
Health & Medical Collection (Alumni)
Medical Database
Biological Science Database
ProQuest Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
Biotechnology and BioEngineering Abstracts
ProQuest Central Premium
ProQuest One Academic
Publicly Available Content Database
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
ProQuest Central Basic
MEDLINE - Academic
PubMed Central (Full Participant titles)
Directory of Open Access Journals (DOAJ)
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Publicly Available Content Database
Computer Science Database
ProQuest Central Student
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
SciTech Premium Collection
ProQuest Central China
ProQuest One Applied & Life Sciences
ProQuest One Sustainability
Health Research Premium Collection
Natural Science Collection
Health & Medical Research Collection
Biological Science Collection
ProQuest Central (New)
ProQuest Medical Library (Alumni)
Advanced Technologies & Aerospace Collection
ProQuest Biological Science Collection
ProQuest One Academic Eastern Edition
ProQuest Hospital Collection
ProQuest Technology Collection
Health Research Premium Collection (Alumni)
Biological Science Database
ProQuest Hospital Collection (Alumni)
Biotechnology and BioEngineering Abstracts
ProQuest Health & Medical Complete
ProQuest One Academic UKI Edition
Engineering Research Database
ProQuest One Academic
ProQuest One Academic (New)
Technology Collection
Technology Research Database
Computer and Information Systems Abstracts – Academic
ProQuest One Academic Middle East (New)
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Natural Science Collection
ProQuest Pharma Collection
ProQuest Central
ProQuest Health & Medical Research Collection
Biotechnology Research Abstracts
Health and Medicine Complete (Alumni Edition)
ProQuest Central Korea
Advanced Technologies Database with Aerospace
ProQuest Computing
ProQuest Central Basic
ProQuest Computing (Alumni Edition)
ProQuest SciTech Collection
Computer and Information Systems Abstracts Professional
Advanced Technologies & Aerospace Database
ProQuest Medical Library
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList
MEDLINE - Academic
Publicly Available Content Database
MEDLINE



Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 3
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 4
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1471-2105
EndPage 20
ExternalDocumentID oai_doaj_org_article_1fa90cef09d148dca3837e08def28b65
PMC7324992
A628212871
32600248
10_1186_s12859_020_03608_0
Genre Journal Article
GrantInformation_xml – fundername: NIDA NIH HHS
  grantid: 5R25DA027995
– fundername: NHGRI NIH HHS
  grantid: R01 HG007175
– fundername: NIEHS NIH HHS
  grantid: U24 ES026699
– fundername: NHGRI NIH HHS
  grantid: U41 HG010972
– fundername: NIEHS NIH HHS
  grantid: U24ES026699
– fundername: NHGRI NIH HHS
  grantid: U41HG010972
– fundername: NIDA NIH HHS
  grantid: R25 DA027995
– fundername: NHGRI NIH HHS
  grantid: U01HG009391
– fundername: NCI NIH HHS
  grantid: U01 CA200060
– fundername: national institute of health
  grantid: NS041021
– fundername: ;
  grantid: R01HG007175; U01HG009391; U41HG010972
– fundername: ;
  grantid: U24ES026699
– fundername: ;
  grantid: 5R25DA027995
– fundername: ;
  grantid: NS041021
– fundername: ;
  grantid: Goldman Sachs Philanthropy Fund
GroupedDBID ---
0R~
23N
2WC
53G
5VS
6J9
7X7
88E
8AO
8FE
8FG
8FH
8FI
8FJ
AAFWJ
AAJSJ
AAKPC
AASML
AAYXX
ABDBF
ABUWG
ACGFO
ACGFS
ACIHN
ACIWK
ACPRK
ACUHS
ADBBV
ADMLS
ADUKV
AEAQA
AENEX
AEUYN
AFKRA
AFPKN
AFRAH
AHBYD
AHMBA
AHYZX
ALIPV
ALMA_UNASSIGNED_HOLDINGS
AMKLP
AMTXH
AOIJS
ARAPS
AZQEC
BAPOH
BAWUL
BBNVY
BCNDV
BENPR
BFQNJ
BGLVJ
BHPHI
BMC
BPHCQ
BVXVI
C6C
CCPQU
CITATION
CS3
DIK
DU5
DWQXO
E3Z
EAD
EAP
EAS
EBD
EBLON
EBS
EMB
EMK
EMOBN
ESX
F5P
FYUFA
GNUQQ
GROUPED_DOAJ
GX1
HCIFZ
HMCUK
HYE
IAO
ICD
IHR
INH
INR
ISR
ITC
K6V
K7-
KQ8
LK8
M1P
M48
M7P
MK~
ML0
M~E
O5R
O5S
OK1
OVT
P2P
P62
PGMZT
PHGZM
PHGZT
PIMPY
PQQKQ
PROAC
PSQYO
RBZ
RNS
ROL
RPM
RSV
SBL
SOJ
SV3
TR2
TUS
UKHRP
W2D
WOQ
WOW
XH6
XSB
CGR
CUY
CVF
ECM
EIF
NPM
PJZUB
PPXIY
PQGLB
PMFND
3V.
7QO
7SC
7XB
8AL
8FD
8FK
FR3
JQ2
K9.
L7M
L~C
L~D
M0N
P64
PKEHL
PQEST
PQUKI
PRINS
Q9U
7X8
5PM
PUEGO
ID FETCH-LOGICAL-c597t-ca8b86ee38a9fb091265e4494a16e22785d0dfe38e60edfa9d2ed6ebfb6a1ec93
IEDL.DBID M48
ISSN 1471-2105
IngestDate Wed Aug 27 01:23:57 EDT 2025
Thu Aug 21 14:36:56 EDT 2025
Mon Jul 21 09:41:11 EDT 2025
Fri Jul 25 10:39:46 EDT 2025
Tue Jun 17 21:38:53 EDT 2025
Tue Jun 10 20:28:13 EDT 2025
Fri Jun 27 04:49:48 EDT 2025
Mon Jul 21 05:33:02 EDT 2025
Thu Apr 24 22:53:08 EDT 2025
Tue Jul 01 03:38:30 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords High-dimensional data
Robust principal component analysis
RNA-seq
Outlier detection
Anomaly detection
PcaGrid
PcaHubert
Language English
License Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c597t-ca8b86ee38a9fb091265e4494a16e22785d0dfe38e60edfa9d2ed6ebfb6a1ec93
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ORCID 0000-0001-5615-6774
OpenAccessLink http://journals.scholarsportal.info/openUrl.xqy?doi=10.1186/s12859-020-03608-0
PMID 32600248
PQID 2424712442
PQPubID 44065
PageCount 20
ParticipantIDs doaj_primary_oai_doaj_org_article_1fa90cef09d148dca3837e08def28b65
pubmedcentral_primary_oai_pubmedcentral_nih_gov_7324992
proquest_miscellaneous_2419087147
proquest_journals_2424712442
gale_infotracmisc_A628212871
gale_infotracacademiconefile_A628212871
gale_incontextgauss_ISR_A628212871
pubmed_primary_32600248
crossref_primary_10_1186_s12859_020_03608_0
crossref_citationtrail_10_1186_s12859_020_03608_0
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2020-06-29
PublicationDateYYYYMMDD 2020-06-29
PublicationDate_xml – month: 06
  year: 2020
  text: 2020-06-29
  day: 29
PublicationDecade 2020
PublicationPlace England
PublicationPlace_xml – name: England
– name: London
PublicationTitle BMC bioinformatics
PublicationTitleAlternate BMC Bioinformatics
PublicationYear 2020
Publisher BioMed Central Ltd
BioMed Central
BMC
Publisher_xml – name: BioMed Central Ltd
– name: BioMed Central
– name: BMC
References V Todorov (3608_CR22) 2009; 32
GD Orvis (3608_CR25) 2012; 367
A Mortazavi (3608_CR3) 2008; 5
3608_CR7
GV Cohen Freue (3608_CR9) 2007; 23
P Manga (3608_CR17) 2016; 7
A Dobin (3608_CR32) 2013; 29
A Kauffmann (3608_CR11) 2010; 95
M Hubert (3608_CR37) 2008; 23
Y Liu (3608_CR16) 2014; 30
P Du (3608_CR14) 2008; 24
M Martin (3608_CR30) 2011; 17
M Gierlinski (3608_CR43) 2015; 31
S Yang (3608_CR13) 2007; 2
D Pan (3608_CR41) 2012; 11
A Conesa (3608_CR4) 2016; 17
C Croux (3608_CR36) 2007; 87
MB Lopes (3608_CR18) 2018; 19
N Locantore (3608_CR38) 1999; 8
J Stegmuller (3608_CR40) 2006; 50
3608_CR35
T Omura (3608_CR28) 2015; 86
P Filzmoser (3608_CR20) 2011; 705
AM Kenney (3608_CR39) 2003; 130
V Nygaard (3608_CR44) 2016; 17
3608_CR1
C Xu (3608_CR29) 2018; 28
R Schmieder (3608_CR31) 2011; 27
PJ Rousseeuw (3608_CR2) 2018; 8
MI Love (3608_CR34) 2014; 15
AD Shieh (3608_CR12) 2009; 8
A Kauffmann (3608_CR10) 2009; 25
SS Norton (3608_CR5) 2018; 34
P Filzmoser (3608_CR21) 2013; 245
MC Oldham (3608_CR8) 2012; 6
M Hubert (3608_CR19) 2005; 47
AC Frazee (3608_CR26) 2015; 31
3608_CR42
GA Merino (3608_CR6) 2016; 705
S Anders (3608_CR33) 2015; 31
Y Oytam (3608_CR45) 2016; 17
M Cláudia Pascoal (3608_CR24) 2010
WWB Goh (3608_CR46) 2017; 35
F Rapaport (3608_CR15) 2013; 14
WFD Rocha (3608_CR23) 2013; 109
3608_CR27
References_xml – volume: 7
  start-page: 794
  year: 2016
  ident: 3608_CR17
  publication-title: Front Microbiol
  doi: 10.3389/fmicb.2016.00794
– volume: 95
  start-page: 138
  issue: 3
  year: 2010
  ident: 3608_CR11
  publication-title: Genomics
  doi: 10.1016/j.ygeno.2010.01.003
– volume: 245
  start-page: 4
  year: 2013
  ident: 3608_CR21
  publication-title: Inform Sci
  doi: 10.1016/j.ins.2012.10.017
– volume: 86
  start-page: 1215
  issue: 5
  year: 2015
  ident: 3608_CR28
  publication-title: Neuron
  doi: 10.1016/j.neuron.2015.05.005
– volume: 2
  start-page: 351
  year: 2007
  ident: 3608_CR13
  publication-title: Cancer Inform
– volume: 24
  start-page: 1547
  issue: 13
  year: 2008
  ident: 3608_CR14
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btn224
– volume: 30
  start-page: 301
  issue: 3
  year: 2014
  ident: 3608_CR16
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btt688
– volume: 32
  start-page: 1
  issue: 3
  year: 2009
  ident: 3608_CR22
  publication-title: J Stat Softw
  doi: 10.18637/jss.v032.i03
– volume: 17
  start-page: 332
  issue: 1
  year: 2016
  ident: 3608_CR45
  publication-title: BMC Bioinformatics
  doi: 10.1186/s12859-016-1212-5
– ident: 3608_CR42
  doi: 10.1200/JCO.2017.35.15_suppl.e13025
– ident: 3608_CR7
– volume: 11
  start-page: 902
  issue: 5
  year: 2012
  ident: 3608_CR41
  publication-title: Aging Cell
  doi: 10.1111/j.1474-9726.2012.00857.x
– volume: 17
  start-page: 29
  issue: 1
  year: 2016
  ident: 3608_CR44
  publication-title: Biostatistics
  doi: 10.1093/biostatistics/kxv027
– volume-title: Detection of outliers using robust principal component analysis: a simulation study, vol. 77
  year: 2010
  ident: 3608_CR24
– volume: 367
  start-page: 25
  issue: 1
  year: 2012
  ident: 3608_CR25
  publication-title: Dev Biol
  doi: 10.1016/j.ydbio.2012.04.018
– volume: 14
  start-page: R95
  issue: 9
  year: 2013
  ident: 3608_CR15
  publication-title: Genome Biol
  doi: 10.1186/gb-2013-14-9-r95
– volume: 23
  start-page: 92
  issue: 1
  year: 2008
  ident: 3608_CR37
  publication-title: Stat Sci
  doi: 10.1214/088342307000000087
– ident: 3608_CR1
– volume: 31
  start-page: 166
  issue: 2
  year: 2015
  ident: 3608_CR33
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btu638
– volume: 31
  start-page: 3625
  issue: 22
  year: 2015
  ident: 3608_CR43
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btv425
– volume: 17
  start-page: 10
  year: 2011
  ident: 3608_CR30
  publication-title: EMBnet J
  doi: 10.14806/ej.17.1.200
– volume: 28
  start-page: 1097
  issue: 8
  year: 2018
  ident: 3608_CR29
  publication-title: Genome Res
  doi: 10.1101/gr.231357.117
– volume: 50
  start-page: 389
  issue: 3
  year: 2006
  ident: 3608_CR40
  publication-title: Neuron
  doi: 10.1016/j.neuron.2006.03.034
– volume: 19
  start-page: 168
  issue: 1
  year: 2018
  ident: 3608_CR18
  publication-title: BMC Bioinformatics
  doi: 10.1186/s12859-018-2149-7
– volume: 15
  start-page: 550
  issue: 12
  year: 2014
  ident: 3608_CR34
  publication-title: Genome Biol
  doi: 10.1186/s13059-014-0550-8
– volume: 8
  start-page: Article 13
  year: 2009
  ident: 3608_CR12
  publication-title: Stat Appl Genet Mol Biol
  doi: 10.2202/1544-6115.1426
– volume: 35
  start-page: 498
  issue: 6
  year: 2017
  ident: 3608_CR46
  publication-title: Trends Biotechnol
  doi: 10.1016/j.tibtech.2017.02.012
– volume: 87
  start-page: 218
  issue: 2
  year: 2007
  ident: 3608_CR36
  publication-title: Chemometr Intell Lab
  doi: 10.1016/j.chemolab.2007.01.004
– volume: 27
  start-page: 863
  issue: 6
  year: 2011
  ident: 3608_CR31
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btr026
– volume: 47
  start-page: 64
  issue: 1
  year: 2005
  ident: 3608_CR19
  publication-title: Technometrics
  doi: 10.1198/004017004000000563
– volume: 705
  start-page: 2
  issue: 1–2
  year: 2011
  ident: 3608_CR20
  publication-title: Anal Chim Acta
  doi: 10.1016/j.aca.2011.03.055
– volume: 23
  start-page: 3162
  issue: 23
  year: 2007
  ident: 3608_CR9
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btm487
– volume: 31
  start-page: 2778
  issue: 17
  year: 2015
  ident: 3608_CR26
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btv272
– volume: 130
  start-page: 15
  issue: 1
  year: 2003
  ident: 3608_CR39
  publication-title: Development
  doi: 10.1242/dev.00182
– volume: 25
  start-page: 415
  issue: 3
  year: 2009
  ident: 3608_CR10
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btn647
– volume: 17
  start-page: 13
  year: 2016
  ident: 3608_CR4
  publication-title: Genome Biol
  doi: 10.1186/s13059-016-0881-8
– ident: 3608_CR27
  doi: 10.1523/JNEUROSCI.0688-18.2018
– volume: 6
  start-page: 63
  year: 2012
  ident: 3608_CR8
  publication-title: BMC Syst Biol
  doi: 10.1186/1752-0509-6-63
– volume: 109
  start-page: 112
  year: 2013
  ident: 3608_CR23
  publication-title: Microchem J
  doi: 10.1016/j.microc.2012.03.028
– volume: 8
  start-page: 1
  issue: 1
  year: 1999
  ident: 3608_CR38
  publication-title: Test
  doi: 10.1007/BF02595862
– volume: 8
  start-page: 1
  issue: 2
  year: 2018
  ident: 3608_CR2
  publication-title: WIREs: Data Mining Knowl Discovery
– volume: 29
  start-page: 15
  issue: 1
  year: 2013
  ident: 3608_CR32
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bts635
– volume: 34
  start-page: 1488
  issue: 9
  year: 2018
  ident: 3608_CR5
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btx790
– ident: 3608_CR35
  doi: 10.1038/nbt.4096
– volume: 705
  start-page: 012003
  year: 2016
  ident: 3608_CR6
  publication-title: J Phys Conf Ser
  doi: 10.1088/1742-6596/705/1/012003
– volume: 5
  start-page: 621
  issue: 7
  year: 2008
  ident: 3608_CR3
  publication-title: Nat Methods
  doi: 10.1038/nmeth.1226
SSID ssj0017805
Score 2.537247
Snippet High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme...
Background High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition,...
Abstract Background High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data...
SourceID doaj
pubmedcentral
proquest
gale
pubmed
crossref
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
Enrichment Source
StartPage 269
SubjectTerms Algorithms
Analysis
Animals
Bias
Cerebellum
Cerebellum - metabolism
Chemometrics
Computer simulation
Data acquisition
Data analysis
Data points
Data processing
Datasets
Experiments
Female
Functional analysis
Gene expression
Gene sequencing
Genes
High-dimensional data
Information management
Male
Mice, Knockout
Modelling
Multivariate analysis
Outlier detection
Outliers (statistics)
PcaGrid
PcaHubert
Performance enhancement
Performance evaluation
Principal Component Analysis
Principal components analysis
Proto-Oncogene Proteins - genetics
Reverse Transcriptase Polymerase Chain Reaction
Reverse transcription
Ribonucleic acid
RNA
RNA sequencing
RNA-seq
RNA-Seq - methods
Robust principal component analysis
Sample variance
Samples
Simulation
Statistical analysis
Statistical methods
Transcription (Genetics)
SummonAdditionalLinks – databaseName: Directory of Open Access Journals (DOAJ)
  dbid: DOA
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3Pi9UwEA6yIHgR1591V4kieJCyaZqXlxyf4rIK7uGtC-sppMlEF6Rvte3B_96ZtO_xiqAXr820tF9nMjPtzDeMvTJQyRBsLEGkpkSPH9Hm0Nxt0o0AjBBU5in4dK7PLtXHq8XV3qgvqgkb6YFH4E6q5K0IkISNGLnH4CmlAmEiJGkandlL0edtk6np_wEx9W9bZIw-6SriaSspVcIdm374z9xQZuv_c0_ec0rzgsk9D3R6j92dQke-Gm_5kN2C9j67PQ6T_PWAfVlvmqHr-c34-RwlqVx80-LFuJ-oRziGqNyHMBBBBKdiIPSKvPNEEcwj9Lkuq-XXLV-fr8oL-MGpgvQhuzx9__ndWTkNTigD5gd9GbxpjAaojbepQXSkXoBSVvlKA_W-LqKICZdBC4gIcJQQNTSp0b6CYOtH7KDF-3vCeFKeRkcoqOukEADvRTSaevdUwtCsLli1xdGFiVWchlt8dzm7MNqN2DvE3mXsnSjYm905NyOnxl-l39Lr2UkSH3Y-gFriJi1x_9KSgr2kl-uI8aKlkpqvfug69-Fi7VYas86KEseCvZ6E0gafIfipQwGRIJKsmeTxTBJNMsyXtzrkpi2hc9SHs6RoShbsxW6ZzqQytxY2A8lgfIYXUMuCPR5VbvfcNY0SkMoUbDlTxhkw85X2-lsmDF9i1GytfPo_kDxid2S2I11Ke8wO-p8DPMO4rG-eZxP8DXEdNXU
  priority: 102
  providerName: Directory of Open Access Journals
– databaseName: ProQuest Technology Collection
  dbid: 8FG
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3Nb9UwDI9gCIkL4pvCQAEhcUDV-pGXl5zQA_EYSOzwxqRxitLE2Sah9m1tD_z32G1eWYW0a-NUiRPHdmL_zNg7BXnhnPYpZKFKUeN7lDkUdx1klQFaCGLAKfhxJA9PxPfTxWm8cGtjWOXuTBwOat84uiM_oDSGJSmj4uP2MqWqUfS6Gkto3GZ3ctQ0FNKl1l-nVwTC698lyih50OaE1paSw4TnNj37z5TRgNn__8l8TTXNwyav6aH1A3Y_GpB8Na74Q3YL6kfs7lhS8s9j9mvTVH3b8e14iY6UFDTe1PgzbiMACUdDlVvneoKJ4BQShLqRt5aAgrmHbojOqvlFzTdHq_QYLjnFkT5hJ-svPz8fprF8QurQS-hSZ1WlJECprA4V2gWFXIAQWthcAmXALnzmAzaDzMAHq30BXkIVKmlzcLp8yvZqHN9zxoOwVEBCQFkGgQywNvNKUgafCGiglQnLd3w0LmKLU4mL32bwMZQ0I-8N8t4MvDdZwj5MfbYjssaN1J9oeSZKQsUePjRXZyYKmclxEpmDkGmPXh4Ok9xvyJSHUKhKLhL2lhbXEO5FTYE1Z7ZvW_PteGNWEn3PnNzHhL2PRKHBOTgb8xSQEwSVNaPcn1GiYLp5824PmXgwtObfNk7Ym6mZelKwWw1NTzRopeEPxDJhz8YtN827pIIChVAJW84244wx85b64nyADV-i7ax18eLmYb1k94pBQmRa6H2211318Artrq56PQjXX1qwK_s
  priority: 102
  providerName: ProQuest
Title Robust principal component analysis for accurate outlier sample detection in RNA-Seq data
URI https://www.ncbi.nlm.nih.gov/pubmed/32600248
https://www.proquest.com/docview/2424712442
https://www.proquest.com/docview/2419087147
https://pubmed.ncbi.nlm.nih.gov/PMC7324992
https://doaj.org/article/1fa90cef09d148dca3837e08def28b65
Volume 21
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3da9tADBf9YLCX0X177cJtDPYwvPnjcj4_jJGOZl2gYSQLZE_H-T66QnHaOIH1v59kO1nNSl9s8OnMnSxZki39BPBOujgxJrehi3wRosW3qHOo7rkXReTQQ-A1TsHZWJzO-Gjen-_Apt1Ry8DqztCO-knNlpcf_1zffEGF_1wrvBSfqphQ2EIKhPB9TL_zd2EfLVNGHQ3O-L-_CoTfvymcuXNexzjVGP7_v6lvmapuGuUtuzQ8gEetQ8kGjQQ8hh1XPoEHTYvJm6fwa7Io1tWKXTUf1ZGSksgXJd6M6RaQhKHjyrQxa4KNYJQihLaSVZqAg5l1qzpbq2QXJZuMB-HUXTPKK30Gs-HJz6-nYdtOITQYNaxCo2UhhXOp1Lkv0E9IRN9xnnMdC0cVsX0bWY_DTkTOep3bxFnhCl8IHTuTp89hr8T1vQTmuaaGEtylqefIAK0jKwVV9HGPDlsaQLzhozIt1ji1vLhUdcwhhWp4r5D3qua9igL4sJ1z1SBt3Et9TI9nS0ko2fWFxfJctUqnYtxEZJyPcotRHy6TwnEXSet8IgvRD-AtPVxFOBglJdqc63VVqe_TiRoIjEVjCicDeN8S-QXuwei2bgE5QdBZHcqjDiUqqukOb2RIbeRcUXVORj5WEsCb7TDNpOS30i3WRINeG96AZwG8aERuu--UGgwkXAaQdYSxw5juSHnxu4YRz9CXzvPk1f3LOoSHSa0hIkzyI9hbLdfuNfphq6IHu9k8w6McfuvB_mAwmo7wfHwy_jHp1d82erX6_QWZpDUX
linkProvider Scholars Portal
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwELaqIgQXxJtAAYNAHFDUxPF6kwNCy2PZpe0etq3Unoxjj0sllGybXaH-KX4jM3ksjZB66zWeRPZ4xjPjzHzD2JsUYmFt5kKIfB6ixXeoc6jumVd5BOghyBqnYG-mJofy-9HgaIP96WphKK2yOxPrg9qVlu7It6mMYUjGSHxcnIXUNYr-rnYtNBqx2IGL3xiyVR-mX3B_3wox_nrweRK2XQVCi87zMrQmzVMFkKQm8zmaS6EGIGUmTayACkMHLnIeh0FF4LzJnACnIPe5MjFYAl_CI_-GTNCSU2X6-Nv6rwX1B-gKc1K1XcWEDhdSgIZ2gtIMesav7hHwvyW4ZAr7aZqX7N74LrvTOqx81EjYPbYBxX12s2lhefGAHc_LfFUt-aK5tEdKSlIvC_wYNy3gCUfHmBtrVwRLwSkFCW0xrwwBE3MHyzobrOCnBZ_PRuE-nHHKW33IDq-FsY_YZoHze8K4l4YaVkhIEi-RAcZELlVUMSg9OoRJwOKOj9q2WObUUuOXrmOaVOmG9xp5r2ve6yhg79fvLBokjyupP9H2rCkJhbt-UJ6f6FapdYyLiCz4KHMYVeI0KdyHKHXgRZqrQcBe0-ZqwtkoKJHnxKyqSk_353qkMNaNKVwN2LuWyJe4BmvaugjkBEFz9Si3epR4ENj-cCdDuj2IKv1PbQL2aj1Mb1JyXQHlimjQK8QPyGHAHjcit153Qg0MhEwDNuwJY48x_ZHi9GcNUz5EXz3LxNOrp_WS3Zoc7O3q3els5xm7LWptUaHIttjm8nwFz9HnW-YvakXj7Md1a_Zfeg9rHw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Robust+principal+component+analysis+for+accurate+outlier+sample+detection+in+RNA-Seq+data&rft.jtitle=BMC+bioinformatics&rft.au=Chen%2C+Xiaoying&rft.au=Zhang%2C+Bo&rft.au=Wang%2C+Ting&rft.au=Azad+Bonni&rft.date=2020-06-29&rft.pub=BioMed+Central&rft.eissn=1471-2105&rft.volume=21&rft.spage=1&rft_id=info:doi/10.1186%2Fs12859-020-03608-0
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1471-2105&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1471-2105&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1471-2105&client=summon