Robust principal component analysis for accurate outlier sample detection in RNA-Seq data

High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimens...

Full description

Saved in:

Bibliographic Details
Published in	BMC bioinformatics Vol. 21; no. 1; pp. 269 - 20
Main Authors	Chen, Xiaoying, Zhang, Bo, Wang, Ting, Bonni, Azad, Zhao, Guoyan
Format	Journal Article
Language	English
Published	England BioMed Central Ltd 29.06.2020 BioMed Central BMC
Subjects	Algorithms Analysis Animals Bias Cerebellum Cerebellum - metabolism Chemometrics Computer simulation Data acquisition Data analysis Data points Data processing Datasets Experiments Female Functional analysis Gene expression Gene sequencing Genes High-dimensional data Information management Male Mice, Knockout Modelling Multivariate analysis Outlier detection Outliers (statistics) PcaGrid PcaHubert Performance enhancement Performance evaluation Principal Component Analysis Principal components analysis Proto-Oncogene Proteins - genetics Reverse Transcriptase Polymerase Chain Reaction Reverse transcription Ribonucleic acid RNA RNA sequencing RNA-seq RNA-Seq - methods Robust principal component analysis Sample variance Samples Simulation Statistical analysis Statistical methods Transcription (Genetics) High-dimensional data Robust principal component analysis RNA-seq Outlier detection Anomaly detection PcaGrid PcaHubert
Online Access	Get full text

Cover

Loading…

Abstract	High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimensionality of the data with few biological replicates make it challenging to accurately detect those samples, and this issue is not well studied in the literature currently. Robust statistics is a family of theories and techniques aim to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. Robust statistics have been widely used in multivariate data analysis for outlier detection in chemometrics and engineering. Here we apply robust statistics on RNA-seq data analysis. We report the use of two robust principal component analysis (rPCA) methods, PcaHubert and PcaGrid, to detect outlier samples in multiple simulated and real biological RNA-seq data sets with positive control outlier samples. PcaGrid achieved 100% sensitivity and 100% specificity in all the tests using positive control outliers with varying degrees of divergence. We applied rPCA methods and classical principal component analysis (cPCA) on an RNA-Seq data set profiling gene expression of the external granule layer in the cerebellum of control and conditional SnoN knockout mice. Both rPCA methods detected the same two outlier samples but cPCA failed to detect any. We performed differentially expressed gene detection before and after outlier removal as well as with and without batch effect modeling. We validated gene expression changes using quantitative reverse transcription PCR and used the result as reference to compare the performance of eight different data analysis strategies. Removing outliers without batch effect modeling performed the best in term of detecting biologically relevant differentially expressed genes. rPCA implemented in the PcaGrid function is an accurate and objective method to detect outlier samples. It is well suited for high-dimensional data with small sample sizes like RNA-seq data. Outlier removal can significantly improve the performance of differential gene detection and downstream functional analysis.
AbstractList	Background High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimensionality of the data with few biological replicates make it challenging to accurately detect those samples, and this issue is not well studied in the literature currently. Robust statistics is a family of theories and techniques aim to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. Robust statistics have been widely used in multivariate data analysis for outlier detection in chemometrics and engineering. Here we apply robust statistics on RNA-seq data analysis. Results We report the use of two robust principal component analysis (rPCA) methods, PcaHubert and PcaGrid, to detect outlier samples in multiple simulated and real biological RNA-seq data sets with positive control outlier samples. PcaGrid achieved 100% sensitivity and 100% specificity in all the tests using positive control outliers with varying degrees of divergence. We applied rPCA methods and classical principal component analysis (cPCA) on an RNA-Seq data set profiling gene expression of the external granule layer in the cerebellum of control and conditional SnoN knockout mice. Both rPCA methods detected the same two outlier samples but cPCA failed to detect any. We performed differentially expressed gene detection before and after outlier removal as well as with and without batch effect modeling. We validated gene expression changes using quantitative reverse transcription PCR and used the result as reference to compare the performance of eight different data analysis strategies. Removing outliers without batch effect modeling performed the best in term of detecting biologically relevant differentially expressed genes. Conclusions rPCA implemented in the PcaGrid function is an accurate and objective method to detect outlier samples. It is well suited for high-dimensional data with small sample sizes like RNA-seq data. Outlier removal can significantly improve the performance of differential gene detection and downstream functional analysis. Keywords: Robust principal component analysis, PcaGrid, PcaHubert, Outlier detection, RNA-seq, High-dimensional data, Anomaly detection High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimensionality of the data with few biological replicates make it challenging to accurately detect those samples, and this issue is not well studied in the literature currently. Robust statistics is a family of theories and techniques aim to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. Robust statistics have been widely used in multivariate data analysis for outlier detection in chemometrics and engineering. Here we apply robust statistics on RNA-seq data analysis.BACKGROUNDHigh throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimensionality of the data with few biological replicates make it challenging to accurately detect those samples, and this issue is not well studied in the literature currently. Robust statistics is a family of theories and techniques aim to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. Robust statistics have been widely used in multivariate data analysis for outlier detection in chemometrics and engineering. Here we apply robust statistics on RNA-seq data analysis.We report the use of two robust principal component analysis (rPCA) methods, PcaHubert and PcaGrid, to detect outlier samples in multiple simulated and real biological RNA-seq data sets with positive control outlier samples. PcaGrid achieved 100% sensitivity and 100% specificity in all the tests using positive control outliers with varying degrees of divergence. We applied rPCA methods and classical principal component analysis (cPCA) on an RNA-Seq data set profiling gene expression of the external granule layer in the cerebellum of control and conditional SnoN knockout mice. Both rPCA methods detected the same two outlier samples but cPCA failed to detect any. We performed differentially expressed gene detection before and after outlier removal as well as with and without batch effect modeling. We validated gene expression changes using quantitative reverse transcription PCR and used the result as reference to compare the performance of eight different data analysis strategies. Removing outliers without batch effect modeling performed the best in term of detecting biologically relevant differentially expressed genes.RESULTSWe report the use of two robust principal component analysis (rPCA) methods, PcaHubert and PcaGrid, to detect outlier samples in multiple simulated and real biological RNA-seq data sets with positive control outlier samples. PcaGrid achieved 100% sensitivity and 100% specificity in all the tests using positive control outliers with varying degrees of divergence. We applied rPCA methods and classical principal component analysis (cPCA) on an RNA-Seq data set profiling gene expression of the external granule layer in the cerebellum of control and conditional SnoN knockout mice. Both rPCA methods detected the same two outlier samples but cPCA failed to detect any. We performed differentially expressed gene detection before and after outlier removal as well as with and without batch effect modeling. We validated gene expression changes using quantitative reverse transcription PCR and used the result as reference to compare the performance of eight different data analysis strategies. Removing outliers without batch effect modeling performed the best in term of detecting biologically relevant differentially expressed genes.rPCA implemented in the PcaGrid function is an accurate and objective method to detect outlier samples. It is well suited for high-dimensional data with small sample sizes like RNA-seq data. Outlier removal can significantly improve the performance of differential gene detection and downstream functional analysis.CONCLUSIONSrPCA implemented in the PcaGrid function is an accurate and objective method to detect outlier samples. It is well suited for high-dimensional data with small sample sizes like RNA-seq data. Outlier removal can significantly improve the performance of differential gene detection and downstream functional analysis. Background High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimensionality of the data with few biological replicates make it challenging to accurately detect those samples, and this issue is not well studied in the literature currently. Robust statistics is a family of theories and techniques aim to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. Robust statistics have been widely used in multivariate data analysis for outlier detection in chemometrics and engineering. Here we apply robust statistics on RNA-seq data analysis. Results We report the use of two robust principal component analysis (rPCA) methods, PcaHubert and PcaGrid, to detect outlier samples in multiple simulated and real biological RNA-seq data sets with positive control outlier samples. PcaGrid achieved 100% sensitivity and 100% specificity in all the tests using positive control outliers with varying degrees of divergence. We applied rPCA methods and classical principal component analysis (cPCA) on an RNA-Seq data set profiling gene expression of the external granule layer in the cerebellum of control and conditional SnoN knockout mice. Both rPCA methods detected the same two outlier samples but cPCA failed to detect any. We performed differentially expressed gene detection before and after outlier removal as well as with and without batch effect modeling. We validated gene expression changes using quantitative reverse transcription PCR and used the result as reference to compare the performance of eight different data analysis strategies. Removing outliers without batch effect modeling performed the best in term of detecting biologically relevant differentially expressed genes. Conclusions rPCA implemented in the PcaGrid function is an accurate and objective method to detect outlier samples. It is well suited for high-dimensional data with small sample sizes like RNA-seq data. Outlier removal can significantly improve the performance of differential gene detection and downstream functional analysis. High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimensionality of the data with few biological replicates make it challenging to accurately detect those samples, and this issue is not well studied in the literature currently. Robust statistics is a family of theories and techniques aim to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. Robust statistics have been widely used in multivariate data analysis for outlier detection in chemometrics and engineering. Here we apply robust statistics on RNA-seq data analysis. We report the use of two robust principal component analysis (rPCA) methods, PcaHubert and PcaGrid, to detect outlier samples in multiple simulated and real biological RNA-seq data sets with positive control outlier samples. PcaGrid achieved 100% sensitivity and 100% specificity in all the tests using positive control outliers with varying degrees of divergence. We applied rPCA methods and classical principal component analysis (cPCA) on an RNA-Seq data set profiling gene expression of the external granule layer in the cerebellum of control and conditional SnoN knockout mice. Both rPCA methods detected the same two outlier samples but cPCA failed to detect any. We performed differentially expressed gene detection before and after outlier removal as well as with and without batch effect modeling. We validated gene expression changes using quantitative reverse transcription PCR and used the result as reference to compare the performance of eight different data analysis strategies. Removing outliers without batch effect modeling performed the best in term of detecting biologically relevant differentially expressed genes. rPCA implemented in the PcaGrid function is an accurate and objective method to detect outlier samples. It is well suited for high-dimensional data with small sample sizes like RNA-seq data. Outlier removal can significantly improve the performance of differential gene detection and downstream functional analysis. Abstract Background High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimensionality of the data with few biological replicates make it challenging to accurately detect those samples, and this issue is not well studied in the literature currently. Robust statistics is a family of theories and techniques aim to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. Robust statistics have been widely used in multivariate data analysis for outlier detection in chemometrics and engineering. Here we apply robust statistics on RNA-seq data analysis. Results We report the use of two robust principal component analysis (rPCA) methods, PcaHubert and PcaGrid, to detect outlier samples in multiple simulated and real biological RNA-seq data sets with positive control outlier samples. PcaGrid achieved 100% sensitivity and 100% specificity in all the tests using positive control outliers with varying degrees of divergence. We applied rPCA methods and classical principal component analysis (cPCA) on an RNA-Seq data set profiling gene expression of the external granule layer in the cerebellum of control and conditional SnoN knockout mice. Both rPCA methods detected the same two outlier samples but cPCA failed to detect any. We performed differentially expressed gene detection before and after outlier removal as well as with and without batch effect modeling. We validated gene expression changes using quantitative reverse transcription PCR and used the result as reference to compare the performance of eight different data analysis strategies. Removing outliers without batch effect modeling performed the best in term of detecting biologically relevant differentially expressed genes. Conclusions rPCA implemented in the PcaGrid function is an accurate and objective method to detect outlier samples. It is well suited for high-dimensional data with small sample sizes like RNA-seq data. Outlier removal can significantly improve the performance of differential gene detection and downstream functional analysis. High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme deviation of a sample from samples of the same treatment group may occur due to technical variation or true biological differences. The high-dimensionality of the data with few biological replicates make it challenging to accurately detect those samples, and this issue is not well studied in the literature currently. Robust statistics is a family of theories and techniques aim to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. Robust statistics have been widely used in multivariate data analysis for outlier detection in chemometrics and engineering. Here we apply robust statistics on RNA-seq data analysis. We report the use of two robust principal component analysis (rPCA) methods, PcaHubert and PcaGrid, to detect outlier samples in multiple simulated and real biological RNA-seq data sets with positive control outlier samples. PcaGrid achieved 100% sensitivity and 100% specificity in all the tests using positive control outliers with varying degrees of divergence. We applied rPCA methods and classical principal component analysis (cPCA) on an RNA-Seq data set profiling gene expression of the external granule layer in the cerebellum of control and conditional SnoN knockout mice. Both rPCA methods detected the same two outlier samples but cPCA failed to detect any. We performed differentially expressed gene detection before and after outlier removal as well as with and without batch effect modeling. We validated gene expression changes using quantitative reverse transcription PCR and used the result as reference to compare the performance of eight different data analysis strategies. Removing outliers without batch effect modeling performed the best in term of detecting biologically relevant differentially expressed genes. rPCA implemented in the PcaGrid function is an accurate and objective method to detect outlier samples. It is well suited for high-dimensional data with small sample sizes like RNA-seq data. Outlier removal can significantly improve the performance of differential gene detection and downstream functional analysis.
ArticleNumber	269
Audience	Academic
Author	Bonni, Azad Zhao, Guoyan Wang, Ting Chen, Xiaoying Zhang, Bo
Author_xml	– sequence: 1 givenname: Xiaoying surname: Chen fullname: Chen, Xiaoying – sequence: 2 givenname: Bo surname: Zhang fullname: Zhang, Bo – sequence: 3 givenname: Ting surname: Wang fullname: Wang, Ting – sequence: 4 givenname: Azad surname: Bonni fullname: Bonni, Azad – sequence: 5 givenname: Guoyan orcidid: 0000-0001-5615-6774 surname: Zhao fullname: Zhao, Guoyan
BackLink	https://www.ncbi.nlm.nih.gov/pubmed/32600248$$D View this record in MEDLINE/PubMed
BookMark	eNp9kktv1DAUhSNURNuBP8ACRWJTFim283I2SKOKx0gVSFNYsLJu7JvBoySe2g6i_547nVKaCqFIjmV_59i-95wmR6MbMUlecnbOuazeBi5k2WRMsIzlFZMZe5Kc8KLmmeCsPHowP05OQ9gyxmvJymfJcS4qxkQhT5Lva9dOIaY7b0dtd9Cn2g07OmeMKYzQ3wQb0s75FLSePERM3RR7iz4NMOx6TA1G1NG6MbVjuv68zK7wOjUQ4XnytIM-4Iu7_yL59uH914tP2eWXj6uL5WWmy6aOmQbZygoxl9B0LWu4qEosiqYAXqEQtSwNMx1tY8XQdNAYgabCtmsr4KibfJGsDr7GwVbROwbwN8qBVbcLzm8U-Gh1j4qTnGnsWGN4IY2GXOY1MmmwE7KtSvJ6d_DaTe2ARlMVPPQz0_nOaH-ojfup6lwUTSPI4OzOwLvrCUNUgw0a-x5GdFNQouANkzU1htDXj9CtmzyVfE8J6pwoCvGX2gA9wI6do3P13lQtKyEFRaDmRJ3_g6LP4GA1dbOztD4TvJkJiIn4K25gCkGtrtZz9tXDotxX40-ICBAHQHsXgsfuHuFM7ZOqDklVlFR1m1QaF4l8JNI2wj5IdHXb_0_6G8aD6-A
CitedBy_id	crossref_primary_10_1111_acel_14093 crossref_primary_10_1371_journal_pbio_3002989 crossref_primary_10_1080_02664763_2022_2044018 crossref_primary_10_3390_rs16010187 crossref_primary_10_1016_j_euroneuro_2021_10_274 crossref_primary_10_1093_jn_nxac043 crossref_primary_10_3390_biology13110915 crossref_primary_10_4014_jmb_2012_12034 crossref_primary_10_1007_s11042_022_13285_1 crossref_primary_10_1016_j_trac_2024_117852 crossref_primary_10_1038_s41598_021_93250_4 crossref_primary_10_1186_s12859_024_05975_4 crossref_primary_10_1016_j_eswa_2024_126245 crossref_primary_10_1038_s41598_023_37521_2 crossref_primary_10_1111_gbb_12753 crossref_primary_10_3389_fgene_2022_788580 crossref_primary_10_1038_s41598_023_36134_z crossref_primary_10_3389_fpls_2022_857535 crossref_primary_10_1016_j_scitotenv_2024_178288 crossref_primary_10_1038_s41698_022_00299_z crossref_primary_10_1038_s41467_023_41352_0 crossref_primary_10_1371_journal_pgen_1010833 crossref_primary_10_3390_genes14020387 crossref_primary_10_1016_j_taap_2024_116865 crossref_primary_10_3390_math9080882 crossref_primary_10_55525_tjst_1293057 crossref_primary_10_1007_s13721_022_00364_4 crossref_primary_10_1016_j_xpro_2021_100539 crossref_primary_10_1038_s41467_024_48025_6 crossref_primary_10_1007_s12145_022_00869_6 crossref_primary_10_1016_j_ajhg_2024_10_019 crossref_primary_10_1007_s12035_025_04803_x crossref_primary_10_1016_j_crfs_2023_100514 crossref_primary_10_1186_s13024_023_00638_z crossref_primary_10_1111_mec_16220 crossref_primary_10_1016_j_jprot_2024_105178 crossref_primary_10_1038_s41598_020_79624_0 crossref_primary_10_15324_kjcls_2023_55_4_235 crossref_primary_10_1016_j_heliyon_2024_e41242 crossref_primary_10_1111_mec_17145 crossref_primary_10_3390_en14133951 crossref_primary_10_1084_jem_20231758 crossref_primary_10_1093_jas_skac019 crossref_primary_10_1371_journal_pone_0260119 crossref_primary_10_3390_molecules25184350 crossref_primary_10_2147_JIR_S469297 crossref_primary_10_1002_cyto_a_24921 crossref_primary_10_1371_journal_pone_0257356 crossref_primary_10_1093_nar_gkab1175 crossref_primary_10_1089_cmb_2022_0243 crossref_primary_10_3389_fmolb_2021_791331
Cites_doi	10.3389/fmicb.2016.00794 10.1016/j.ygeno.2010.01.003 10.1016/j.ins.2012.10.017 10.1016/j.neuron.2015.05.005 10.1093/bioinformatics/btn224 10.1093/bioinformatics/btt688 10.18637/jss.v032.i03 10.1186/s12859-016-1212-5 10.1200/JCO.2017.35.15_suppl.e13025 10.1111/j.1474-9726.2012.00857.x 10.1093/biostatistics/kxv027 10.1016/j.ydbio.2012.04.018 10.1186/gb-2013-14-9-r95 10.1214/088342307000000087 10.1093/bioinformatics/btu638 10.1093/bioinformatics/btv425 10.14806/ej.17.1.200 10.1101/gr.231357.117 10.1016/j.neuron.2006.03.034 10.1186/s12859-018-2149-7 10.1186/s13059-014-0550-8 10.2202/1544-6115.1426 10.1016/j.tibtech.2017.02.012 10.1016/j.chemolab.2007.01.004 10.1093/bioinformatics/btr026 10.1198/004017004000000563 10.1016/j.aca.2011.03.055 10.1093/bioinformatics/btm487 10.1093/bioinformatics/btv272 10.1242/dev.00182 10.1093/bioinformatics/btn647 10.1186/s13059-016-0881-8 10.1523/JNEUROSCI.0688-18.2018 10.1186/1752-0509-6-63 10.1016/j.microc.2012.03.028 10.1007/BF02595862 10.1093/bioinformatics/bts635 10.1093/bioinformatics/btx790 10.1038/nbt.4096 10.1088/1742-6596/705/1/012003 10.1038/nmeth.1226
ContentType	Journal Article
Copyright	COPYRIGHT 2020 BioMed Central Ltd. 2020. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. The Author(s) 2020
Copyright_xml	– notice: COPYRIGHT 2020 BioMed Central Ltd. – notice: 2020. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. – notice: The Author(s) 2020
DBID	AAYXX CITATION CGR CUY CVF ECM EIF NPM ISR 3V. 7QO 7SC 7X7 7XB 88E 8AL 8AO 8FD 8FE 8FG 8FH 8FI 8FJ 8FK ABUWG AEUYN AFKRA ARAPS AZQEC BBNVY BENPR BGLVJ BHPHI CCPQU DWQXO FR3 FYUFA GHDGH GNUQQ HCIFZ JQ2 K7- K9. L7M LK8 L~C L~D M0N M0S M1P M7P P5Z P62 P64 PHGZM PHGZT PIMPY PJZUB PKEHL PPXIY PQEST PQGLB PQQKQ PQUKI PRINS Q9U 7X8 5PM DOA
DOI	10.1186/s12859-020-03608-0
DatabaseName	CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed Gale In Context: Science ProQuest Central (Corporate) Biotechnology Research Abstracts Computer and Information Systems Abstracts Health & Medical Collection ProQuest Central (purchase pre-March 2016) Medical Database (Alumni Edition) Computing Database (Alumni Edition) ProQuest Pharma Collection Technology Research Database ProQuest SciTech Collection ProQuest Technology Collection ProQuest Natural Science Collection ProQuest Hospital Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) ProQuest Central (Alumni) ProQuest One Sustainability (subscription) ProQuest Central UK/Ireland Advanced Technologies & Aerospace Collection ProQuest Central Essentials Biological Science Collection ProQuest Central Technology Collection Natural Science Collection ProQuest One Community College ProQuest Central Engineering Research Database Health Research Premium Collection Health Research Premium Collection (Alumni) ProQuest Central Student SciTech Premium Collection ProQuest Computer Science Collection Computer Science Database ProQuest Health & Medical Complete (Alumni) Advanced Technologies Database with Aerospace Biological Sciences Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional Computing Database Health & Medical Collection (Alumni) Medical Database Biological Science Database ProQuest Advanced Technologies & Aerospace Database ProQuest Advanced Technologies & Aerospace Collection Biotechnology and BioEngineering Abstracts ProQuest Central Premium ProQuest One Academic Publicly Available Content Database ProQuest Health & Medical Research Collection ProQuest One Academic Middle East (New) ProQuest One Health & Nursing ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China ProQuest Central Basic MEDLINE - Academic PubMed Central (Full Participant titles) Directory of Open Access Journals (DOAJ)
DatabaseTitle	CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) Publicly Available Content Database Computer Science Database ProQuest Central Student ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Essentials ProQuest Computer Science Collection Computer and Information Systems Abstracts SciTech Premium Collection ProQuest Central China ProQuest One Applied & Life Sciences ProQuest One Sustainability Health Research Premium Collection Natural Science Collection Health & Medical Research Collection Biological Science Collection ProQuest Central (New) ProQuest Medical Library (Alumni) Advanced Technologies & Aerospace Collection ProQuest Biological Science Collection ProQuest One Academic Eastern Edition ProQuest Hospital Collection ProQuest Technology Collection Health Research Premium Collection (Alumni) Biological Science Database ProQuest Hospital Collection (Alumni) Biotechnology and BioEngineering Abstracts ProQuest Health & Medical Complete ProQuest One Academic UKI Edition Engineering Research Database ProQuest One Academic ProQuest One Academic (New) Technology Collection Technology Research Database Computer and Information Systems Abstracts – Academic ProQuest One Academic Middle East (New) ProQuest Health & Medical Complete (Alumni) ProQuest Central (Alumni Edition) ProQuest One Community College ProQuest One Health & Nursing ProQuest Natural Science Collection ProQuest Pharma Collection ProQuest Central ProQuest Health & Medical Research Collection Biotechnology Research Abstracts Health and Medicine Complete (Alumni Edition) ProQuest Central Korea Advanced Technologies Database with Aerospace ProQuest Computing ProQuest Central Basic ProQuest Computing (Alumni Edition) ProQuest SciTech Collection Computer and Information Systems Abstracts Professional Advanced Technologies & Aerospace Database ProQuest Medical Library ProQuest Central (Alumni) MEDLINE - Academic
DatabaseTitleList	MEDLINE - Academic Publicly Available Content Database MEDLINE
Database_xml	– sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 3 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database – sequence: 4 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database
DeliveryMethod	fulltext_linktorsrc
Discipline	Biology
EISSN	1471-2105
EndPage	20
ExternalDocumentID	oai_doaj_org_article_1fa90cef09d148dca3837e08def28b65 PMC7324992 A628212871 32600248 10_1186_s12859_020_03608_0
Genre	Journal Article
GrantInformation_xml	– fundername: NIDA NIH HHS grantid: 5R25DA027995 – fundername: NHGRI NIH HHS grantid: R01 HG007175 – fundername: NIEHS NIH HHS grantid: U24 ES026699 – fundername: NHGRI NIH HHS grantid: U41 HG010972 – fundername: NIEHS NIH HHS grantid: U24ES026699 – fundername: NHGRI NIH HHS grantid: U41HG010972 – fundername: NIDA NIH HHS grantid: R25 DA027995 – fundername: NHGRI NIH HHS grantid: U01HG009391 – fundername: NCI NIH HHS grantid: U01 CA200060 – fundername: national institute of health grantid: NS041021 – fundername: ; grantid: R01HG007175; U01HG009391; U41HG010972 – fundername: ; grantid: U24ES026699 – fundername: ; grantid: 5R25DA027995 – fundername: ; grantid: NS041021 – fundername: ; grantid: Goldman Sachs Philanthropy Fund
GroupedDBID	--- 0R~ 23N 2WC 53G 5VS 6J9 7X7 88E 8AO 8FE 8FG 8FH 8FI 8FJ AAFWJ AAJSJ AAKPC AASML AAYXX ABDBF ABUWG ACGFO ACGFS ACIHN ACIWK ACPRK ACUHS ADBBV ADMLS ADUKV AEAQA AENEX AEUYN AFKRA AFPKN AFRAH AHBYD AHMBA AHYZX ALIPV ALMA_UNASSIGNED_HOLDINGS AMKLP AMTXH AOIJS ARAPS AZQEC BAPOH BAWUL BBNVY BCNDV BENPR BFQNJ BGLVJ BHPHI BMC BPHCQ BVXVI C6C CCPQU CITATION CS3 DIK DU5 DWQXO E3Z EAD EAP EAS EBD EBLON EBS EMB EMK EMOBN ESX F5P FYUFA GNUQQ GROUPED_DOAJ GX1 HCIFZ HMCUK HYE IAO ICD IHR INH INR ISR ITC K6V K7- KQ8 LK8 M1P M48 M7P MK~ ML0 M~E O5R O5S OK1 OVT P2P P62 PGMZT PHGZM PHGZT PIMPY PQQKQ PROAC PSQYO RBZ RNS ROL RPM RSV SBL SOJ SV3 TR2 TUS UKHRP W2D WOQ WOW XH6 XSB CGR CUY CVF ECM EIF NPM PJZUB PPXIY PQGLB PMFND 3V. 7QO 7SC 7XB 8AL 8FD 8FK FR3 JQ2 K9. L7M L~C L~D M0N P64 PKEHL PQEST PQUKI PRINS Q9U 7X8 5PM PUEGO
ID	FETCH-LOGICAL-c597t-ca8b86ee38a9fb091265e4494a16e22785d0dfe38e60edfa9d2ed6ebfb6a1ec93
IEDL.DBID	M48
ISSN	1471-2105
IngestDate	Wed Aug 27 01:23:57 EDT 2025 Thu Aug 21 14:36:56 EDT 2025 Mon Jul 21 09:41:11 EDT 2025 Fri Jul 25 10:39:46 EDT 2025 Tue Jun 17 21:38:53 EDT 2025 Tue Jun 10 20:28:13 EDT 2025 Fri Jun 27 04:49:48 EDT 2025 Mon Jul 21 05:33:02 EDT 2025 Thu Apr 24 22:53:08 EDT 2025 Tue Jul 01 03:38:30 EDT 2025
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	true
IsScholarly	true
Issue	1
Keywords	High-dimensional data Robust principal component analysis RNA-seq Outlier detection Anomaly detection PcaGrid PcaHubert
Language	English
License	Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c597t-ca8b86ee38a9fb091265e4494a16e22785d0dfe38e60edfa9d2ed6ebfb6a1ec93
Notes	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ORCID	0000-0001-5615-6774
OpenAccessLink	http://journals.scholarsportal.info/openUrl.xqy?doi=10.1186/s12859-020-03608-0
PMID	32600248
PQID	2424712442
PQPubID	44065
PageCount	20
ParticipantIDs	doaj_primary_oai_doaj_org_article_1fa90cef09d148dca3837e08def28b65 pubmedcentral_primary_oai_pubmedcentral_nih_gov_7324992 proquest_miscellaneous_2419087147 proquest_journals_2424712442 gale_infotracmisc_A628212871 gale_infotracacademiconefile_A628212871 gale_incontextgauss_ISR_A628212871 pubmed_primary_32600248 crossref_primary_10_1186_s12859_020_03608_0 crossref_citationtrail_10_1186_s12859_020_03608_0
ProviderPackageCode	CITATION AAYXX
PublicationCentury	2000
PublicationDate	2020-06-29
PublicationDateYYYYMMDD	2020-06-29
PublicationDate_xml	– month: 06 year: 2020 text: 2020-06-29 day: 29
PublicationDecade	2020
PublicationPlace	England
PublicationPlace_xml	– name: England – name: London
PublicationTitle	BMC bioinformatics
PublicationTitleAlternate	BMC Bioinformatics
PublicationYear	2020
Publisher	BioMed Central Ltd BioMed Central BMC
Publisher_xml	– name: BioMed Central Ltd – name: BioMed Central – name: BMC
References	V Todorov (3608_CR22) 2009; 32 GD Orvis (3608_CR25) 2012; 367 A Mortazavi (3608_CR3) 2008; 5 3608_CR7 GV Cohen Freue (3608_CR9) 2007; 23 P Manga (3608_CR17) 2016; 7 A Dobin (3608_CR32) 2013; 29 A Kauffmann (3608_CR11) 2010; 95 M Hubert (3608_CR37) 2008; 23 Y Liu (3608_CR16) 2014; 30 P Du (3608_CR14) 2008; 24 M Martin (3608_CR30) 2011; 17 M Gierlinski (3608_CR43) 2015; 31 S Yang (3608_CR13) 2007; 2 D Pan (3608_CR41) 2012; 11 A Conesa (3608_CR4) 2016; 17 C Croux (3608_CR36) 2007; 87 MB Lopes (3608_CR18) 2018; 19 N Locantore (3608_CR38) 1999; 8 J Stegmuller (3608_CR40) 2006; 50 3608_CR35 T Omura (3608_CR28) 2015; 86 P Filzmoser (3608_CR20) 2011; 705 AM Kenney (3608_CR39) 2003; 130 V Nygaard (3608_CR44) 2016; 17 3608_CR1 C Xu (3608_CR29) 2018; 28 R Schmieder (3608_CR31) 2011; 27 PJ Rousseeuw (3608_CR2) 2018; 8 MI Love (3608_CR34) 2014; 15 AD Shieh (3608_CR12) 2009; 8 A Kauffmann (3608_CR10) 2009; 25 SS Norton (3608_CR5) 2018; 34 P Filzmoser (3608_CR21) 2013; 245 MC Oldham (3608_CR8) 2012; 6 M Hubert (3608_CR19) 2005; 47 AC Frazee (3608_CR26) 2015; 31 3608_CR42 GA Merino (3608_CR6) 2016; 705 S Anders (3608_CR33) 2015; 31 Y Oytam (3608_CR45) 2016; 17 M Cláudia Pascoal (3608_CR24) 2010 WWB Goh (3608_CR46) 2017; 35 F Rapaport (3608_CR15) 2013; 14 WFD Rocha (3608_CR23) 2013; 109 3608_CR27
References_xml	– volume: 7 start-page: 794 year: 2016 ident: 3608_CR17 publication-title: Front Microbiol doi: 10.3389/fmicb.2016.00794 – volume: 95 start-page: 138 issue: 3 year: 2010 ident: 3608_CR11 publication-title: Genomics doi: 10.1016/j.ygeno.2010.01.003 – volume: 245 start-page: 4 year: 2013 ident: 3608_CR21 publication-title: Inform Sci doi: 10.1016/j.ins.2012.10.017 – volume: 86 start-page: 1215 issue: 5 year: 2015 ident: 3608_CR28 publication-title: Neuron doi: 10.1016/j.neuron.2015.05.005 – volume: 2 start-page: 351 year: 2007 ident: 3608_CR13 publication-title: Cancer Inform – volume: 24 start-page: 1547 issue: 13 year: 2008 ident: 3608_CR14 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btn224 – volume: 30 start-page: 301 issue: 3 year: 2014 ident: 3608_CR16 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btt688 – volume: 32 start-page: 1 issue: 3 year: 2009 ident: 3608_CR22 publication-title: J Stat Softw doi: 10.18637/jss.v032.i03 – volume: 17 start-page: 332 issue: 1 year: 2016 ident: 3608_CR45 publication-title: BMC Bioinformatics doi: 10.1186/s12859-016-1212-5 – ident: 3608_CR42 doi: 10.1200/JCO.2017.35.15_suppl.e13025 – ident: 3608_CR7 – volume: 11 start-page: 902 issue: 5 year: 2012 ident: 3608_CR41 publication-title: Aging Cell doi: 10.1111/j.1474-9726.2012.00857.x – volume: 17 start-page: 29 issue: 1 year: 2016 ident: 3608_CR44 publication-title: Biostatistics doi: 10.1093/biostatistics/kxv027 – volume-title: Detection of outliers using robust principal component analysis: a simulation study, vol. 77 year: 2010 ident: 3608_CR24 – volume: 367 start-page: 25 issue: 1 year: 2012 ident: 3608_CR25 publication-title: Dev Biol doi: 10.1016/j.ydbio.2012.04.018 – volume: 14 start-page: R95 issue: 9 year: 2013 ident: 3608_CR15 publication-title: Genome Biol doi: 10.1186/gb-2013-14-9-r95 – volume: 23 start-page: 92 issue: 1 year: 2008 ident: 3608_CR37 publication-title: Stat Sci doi: 10.1214/088342307000000087 – ident: 3608_CR1 – volume: 31 start-page: 166 issue: 2 year: 2015 ident: 3608_CR33 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btu638 – volume: 31 start-page: 3625 issue: 22 year: 2015 ident: 3608_CR43 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btv425 – volume: 17 start-page: 10 year: 2011 ident: 3608_CR30 publication-title: EMBnet J doi: 10.14806/ej.17.1.200 – volume: 28 start-page: 1097 issue: 8 year: 2018 ident: 3608_CR29 publication-title: Genome Res doi: 10.1101/gr.231357.117 – volume: 50 start-page: 389 issue: 3 year: 2006 ident: 3608_CR40 publication-title: Neuron doi: 10.1016/j.neuron.2006.03.034 – volume: 19 start-page: 168 issue: 1 year: 2018 ident: 3608_CR18 publication-title: BMC Bioinformatics doi: 10.1186/s12859-018-2149-7 – volume: 15 start-page: 550 issue: 12 year: 2014 ident: 3608_CR34 publication-title: Genome Biol doi: 10.1186/s13059-014-0550-8 – volume: 8 start-page: Article 13 year: 2009 ident: 3608_CR12 publication-title: Stat Appl Genet Mol Biol doi: 10.2202/1544-6115.1426 – volume: 35 start-page: 498 issue: 6 year: 2017 ident: 3608_CR46 publication-title: Trends Biotechnol doi: 10.1016/j.tibtech.2017.02.012 – volume: 87 start-page: 218 issue: 2 year: 2007 ident: 3608_CR36 publication-title: Chemometr Intell Lab doi: 10.1016/j.chemolab.2007.01.004 – volume: 27 start-page: 863 issue: 6 year: 2011 ident: 3608_CR31 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btr026 – volume: 47 start-page: 64 issue: 1 year: 2005 ident: 3608_CR19 publication-title: Technometrics doi: 10.1198/004017004000000563 – volume: 705 start-page: 2 issue: 1–2 year: 2011 ident: 3608_CR20 publication-title: Anal Chim Acta doi: 10.1016/j.aca.2011.03.055 – volume: 23 start-page: 3162 issue: 23 year: 2007 ident: 3608_CR9 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btm487 – volume: 31 start-page: 2778 issue: 17 year: 2015 ident: 3608_CR26 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btv272 – volume: 130 start-page: 15 issue: 1 year: 2003 ident: 3608_CR39 publication-title: Development doi: 10.1242/dev.00182 – volume: 25 start-page: 415 issue: 3 year: 2009 ident: 3608_CR10 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btn647 – volume: 17 start-page: 13 year: 2016 ident: 3608_CR4 publication-title: Genome Biol doi: 10.1186/s13059-016-0881-8 – ident: 3608_CR27 doi: 10.1523/JNEUROSCI.0688-18.2018 – volume: 6 start-page: 63 year: 2012 ident: 3608_CR8 publication-title: BMC Syst Biol doi: 10.1186/1752-0509-6-63 – volume: 109 start-page: 112 year: 2013 ident: 3608_CR23 publication-title: Microchem J doi: 10.1016/j.microc.2012.03.028 – volume: 8 start-page: 1 issue: 1 year: 1999 ident: 3608_CR38 publication-title: Test doi: 10.1007/BF02595862 – volume: 8 start-page: 1 issue: 2 year: 2018 ident: 3608_CR2 publication-title: WIREs: Data Mining Knowl Discovery – volume: 29 start-page: 15 issue: 1 year: 2013 ident: 3608_CR32 publication-title: Bioinformatics doi: 10.1093/bioinformatics/bts635 – volume: 34 start-page: 1488 issue: 9 year: 2018 ident: 3608_CR5 publication-title: Bioinformatics doi: 10.1093/bioinformatics/btx790 – ident: 3608_CR35 doi: 10.1038/nbt.4096 – volume: 705 start-page: 012003 year: 2016 ident: 3608_CR6 publication-title: J Phys Conf Ser doi: 10.1088/1742-6596/705/1/012003 – volume: 5 start-page: 621 issue: 7 year: 2008 ident: 3608_CR3 publication-title: Nat Methods doi: 10.1038/nmeth.1226
SSID	ssj0017805
Score	2.537247
Snippet	High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition, extreme... Background High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data acquisition,... Abstract Background High throughput RNA sequencing is a powerful approach to study gene expression. Due to the complex multiple-steps protocols in data...
SourceID	doaj pubmedcentral proquest gale pubmed crossref
SourceType	Open Website Open Access Repository Aggregation Database Index Database Enrichment Source
StartPage	269
SubjectTerms	Algorithms Analysis Animals Bias Cerebellum Cerebellum - metabolism Chemometrics Computer simulation Data acquisition Data analysis Data points Data processing Datasets Experiments Female Functional analysis Gene expression Gene sequencing Genes High-dimensional data Information management Male Mice, Knockout Modelling Multivariate analysis Outlier detection Outliers (statistics) PcaGrid PcaHubert Performance enhancement Performance evaluation Principal Component Analysis Principal components analysis Proto-Oncogene Proteins - genetics Reverse Transcriptase Polymerase Chain Reaction Reverse transcription Ribonucleic acid RNA RNA sequencing RNA-seq RNA-Seq - methods Robust principal component analysis Sample variance Samples Simulation Statistical analysis Statistical methods Transcription (Genetics)
SummonAdditionalLinks	– databaseName: Directory of Open Access Journals (DOAJ) dbid: DOA link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3Pi9UwEA6yIHgR1591V4kieJCyaZqXlxyf4rIK7uGtC-sppMlEF6Rvte3B_96ZtO_xiqAXr820tF9nMjPtzDeMvTJQyRBsLEGkpkSPH9Hm0Nxt0o0AjBBU5in4dK7PLtXHq8XV3qgvqgkb6YFH4E6q5K0IkISNGLnH4CmlAmEiJGkandlL0edtk6np_wEx9W9bZIw-6SriaSspVcIdm374z9xQZuv_c0_ec0rzgsk9D3R6j92dQke-Gm_5kN2C9j67PQ6T_PWAfVlvmqHr-c34-RwlqVx80-LFuJ-oRziGqNyHMBBBBKdiIPSKvPNEEcwj9Lkuq-XXLV-fr8oL-MGpgvQhuzx9__ndWTkNTigD5gd9GbxpjAaojbepQXSkXoBSVvlKA_W-LqKICZdBC4gIcJQQNTSp0b6CYOtH7KDF-3vCeFKeRkcoqOukEADvRTSaevdUwtCsLli1xdGFiVWchlt8dzm7MNqN2DvE3mXsnSjYm905NyOnxl-l39Lr2UkSH3Y-gFriJi1x_9KSgr2kl-uI8aKlkpqvfug69-Fi7VYas86KEseCvZ6E0gafIfipQwGRIJKsmeTxTBJNMsyXtzrkpi2hc9SHs6RoShbsxW6ZzqQytxY2A8lgfIYXUMuCPR5VbvfcNY0SkMoUbDlTxhkw85X2-lsmDF9i1GytfPo_kDxid2S2I11Ke8wO-p8DPMO4rG-eZxP8DXEdNXU priority: 102 providerName: Directory of Open Access Journals – databaseName: ProQuest Technology Collection dbid: 8FG link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3Nb9UwDI9gCIkL4pvCQAEhcUDV-pGXl5zQA_EYSOzwxqRxitLE2Sah9m1tD_z32G1eWYW0a-NUiRPHdmL_zNg7BXnhnPYpZKFKUeN7lDkUdx1klQFaCGLAKfhxJA9PxPfTxWm8cGtjWOXuTBwOat84uiM_oDSGJSmj4uP2MqWqUfS6Gkto3GZ3ctQ0FNKl1l-nVwTC698lyih50OaE1paSw4TnNj37z5TRgNn__8l8TTXNwyav6aH1A3Y_GpB8Na74Q3YL6kfs7lhS8s9j9mvTVH3b8e14iY6UFDTe1PgzbiMACUdDlVvneoKJ4BQShLqRt5aAgrmHbojOqvlFzTdHq_QYLjnFkT5hJ-svPz8fprF8QurQS-hSZ1WlJECprA4V2gWFXIAQWthcAmXALnzmAzaDzMAHq30BXkIVKmlzcLp8yvZqHN9zxoOwVEBCQFkGgQywNvNKUgafCGiglQnLd3w0LmKLU4mL32bwMZQ0I-8N8t4MvDdZwj5MfbYjssaN1J9oeSZKQsUePjRXZyYKmclxEpmDkGmPXh4Ok9xvyJSHUKhKLhL2lhbXEO5FTYE1Z7ZvW_PteGNWEn3PnNzHhL2PRKHBOTgb8xSQEwSVNaPcn1GiYLp5824PmXgwtObfNk7Ym6mZelKwWw1NTzRopeEPxDJhz8YtN827pIIChVAJW84244wx85b64nyADV-i7ax18eLmYb1k94pBQmRa6H2211318Artrq56PQjXX1qwK_s priority: 102 providerName: ProQuest
Title	Robust principal component analysis for accurate outlier sample detection in RNA-Seq data
URI	https://www.ncbi.nlm.nih.gov/pubmed/32600248 https://www.proquest.com/docview/2424712442 https://www.proquest.com/docview/2419087147 https://pubmed.ncbi.nlm.nih.gov/PMC7324992 https://doaj.org/article/1fa90cef09d148dca3837e08def28b65
Volume	21
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3da9tADBf9YLCX0X177cJtDPYwvPnjcj4_jJGOZl2gYSQLZE_H-T66QnHaOIH1v59kO1nNSl9s8OnMnSxZki39BPBOujgxJrehi3wRosW3qHOo7rkXReTQQ-A1TsHZWJzO-Gjen-_Apt1Ry8DqztCO-knNlpcf_1zffEGF_1wrvBSfqphQ2EIKhPB9TL_zd2EfLVNGHQ3O-L-_CoTfvymcuXNexzjVGP7_v6lvmapuGuUtuzQ8gEetQ8kGjQQ8hh1XPoEHTYvJm6fwa7Io1tWKXTUf1ZGSksgXJd6M6RaQhKHjyrQxa4KNYJQihLaSVZqAg5l1qzpbq2QXJZuMB-HUXTPKK30Gs-HJz6-nYdtOITQYNaxCo2UhhXOp1Lkv0E9IRN9xnnMdC0cVsX0bWY_DTkTOep3bxFnhCl8IHTuTp89hr8T1vQTmuaaGEtylqefIAK0jKwVV9HGPDlsaQLzhozIt1ji1vLhUdcwhhWp4r5D3qua9igL4sJ1z1SBt3Et9TI9nS0ko2fWFxfJctUqnYtxEZJyPcotRHy6TwnEXSet8IgvRD-AtPVxFOBglJdqc63VVqe_TiRoIjEVjCicDeN8S-QXuwei2bgE5QdBZHcqjDiUqqukOb2RIbeRcUXVORj5WEsCb7TDNpOS30i3WRINeG96AZwG8aERuu--UGgwkXAaQdYSxw5juSHnxu4YRz9CXzvPk1f3LOoSHSa0hIkzyI9hbLdfuNfphq6IHu9k8w6McfuvB_mAwmo7wfHwy_jHp1d82erX6_QWZpDUX
linkProvider	Scholars Portal
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Lb9QwELaqIgQXxJtAAYNAHFDUxPF6kwNCy2PZpe0etq3Unoxjj0sllGybXaH-KX4jM3ksjZB66zWeRPZ4xjPjzHzD2JsUYmFt5kKIfB6ixXeoc6jumVd5BOghyBqnYG-mJofy-9HgaIP96WphKK2yOxPrg9qVlu7It6mMYUjGSHxcnIXUNYr-rnYtNBqx2IGL3xiyVR-mX3B_3wox_nrweRK2XQVCi87zMrQmzVMFkKQm8zmaS6EGIGUmTayACkMHLnIeh0FF4LzJnACnIPe5MjFYAl_CI_-GTNCSU2X6-Nv6rwX1B-gKc1K1XcWEDhdSgIZ2gtIMesav7hHwvyW4ZAr7aZqX7N74LrvTOqx81EjYPbYBxX12s2lhefGAHc_LfFUt-aK5tEdKSlIvC_wYNy3gCUfHmBtrVwRLwSkFCW0xrwwBE3MHyzobrOCnBZ_PRuE-nHHKW33IDq-FsY_YZoHze8K4l4YaVkhIEi-RAcZELlVUMSg9OoRJwOKOj9q2WObUUuOXrmOaVOmG9xp5r2ve6yhg79fvLBokjyupP9H2rCkJhbt-UJ6f6FapdYyLiCz4KHMYVeI0KdyHKHXgRZqrQcBe0-ZqwtkoKJHnxKyqSk_353qkMNaNKVwN2LuWyJe4BmvaugjkBEFz9Si3epR4ENj-cCdDuj2IKv1PbQL2aj1Mb1JyXQHlimjQK8QPyGHAHjcit153Qg0MhEwDNuwJY48x_ZHi9GcNUz5EXz3LxNOrp_WS3Zoc7O3q3els5xm7LWptUaHIttjm8nwFz9HnW-YvakXj7Md1a_Zfeg9rHw
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Robust+principal+component+analysis+for+accurate+outlier+sample+detection+in+RNA-Seq+data&rft.jtitle=BMC+bioinformatics&rft.au=Chen%2C+Xiaoying&rft.au=Zhang%2C+Bo&rft.au=Wang%2C+Ting&rft.au=Azad+Bonni&rft.date=2020-06-29&rft.pub=BioMed+Central&rft.eissn=1471-2105&rft.volume=21&rft.spage=1&rft_id=info:doi/10.1186%2Fs12859-020-03608-0
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1471-2105&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1471-2105&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1471-2105&client=summon