Microarray Data Normalization and Robust Detection of Rhythmic Features

Data derived from microarray technologies are generally subject to various sources of noise and accordingly the raw data are pre-processed before formally analysed. Data normalization is a key pre-processing step when dealing with microarray experiments, such as circadian gene-expressions, since it...

Full description

Saved in:
Bibliographic Details
Published inMethods in molecular biology (Clifton, N.J.) Vol. 1986; p. 207
Main Authors Larriba, Yolanda, Rueda, Cristina, Fernández, Miguel A, Peddada, Shyamal D
Format Journal Article
LanguageEnglish
Published United States 01.01.2019
Subjects
Online AccessGet more information
ISSN1940-6029
DOI10.1007/978-1-4939-9442-7_9

Cover

Abstract Data derived from microarray technologies are generally subject to various sources of noise and accordingly the raw data are pre-processed before formally analysed. Data normalization is a key pre-processing step when dealing with microarray experiments, such as circadian gene-expressions, since it removes systematic variations across arrays. A wide variety of normalization methods are available in the literature. However, from our experience in the study of rhythmic expression patterns in oscillatory systems (e.g. cell-cycle, circadian clock), the choice of the normalization method may substantially impair the identification of rhythmic genes. Hence, the identification of a gene as rhythmic could be just as an artefact of how the data were normalized. Yet, gene rhythmicity detection is crucial in modern toxicological and pharmacological studies, thus a procedure to truly identify rhythmic genes that are robust to the choice of a normalization method is required.To perform the task of detecting rhythmic features, we propose a rhythmicity measure based on bootstrap methodology to robustly identify rhythmic genes in oscillatory systems. Although our methodology can be extended to any high-throughput experiment, in this chapter, we illustrate how to apply it to a publicly available circadian clock microarray gene-expression data and give full details (both statistical and computational) so that the methodology can be used in an easy way. We will show that the choice of normalization method has very little effect on the proposed methodology since the results derived from the bootstrap-based rhythmicity measure are highly rank correlated for any pair of normalization methods considered. This suggests, on the one hand, that the rhythmicity measure proposed is robust to the choice of the normalization method, and on the other hand, that gene rhythmicity detected using this measure is potentially not a mere artefact of the normalization method used. In this way the researcher using this methodology will be protected against the possible effect of different normalizations, as the conclusions obtained will not depend so strongly on them. Additionally, the described bootstrap methodology can also be employed as a tool to simulate gene-expression participating in an oscillatory system from a reference data set.
AbstractList Data derived from microarray technologies are generally subject to various sources of noise and accordingly the raw data are pre-processed before formally analysed. Data normalization is a key pre-processing step when dealing with microarray experiments, such as circadian gene-expressions, since it removes systematic variations across arrays. A wide variety of normalization methods are available in the literature. However, from our experience in the study of rhythmic expression patterns in oscillatory systems (e.g. cell-cycle, circadian clock), the choice of the normalization method may substantially impair the identification of rhythmic genes. Hence, the identification of a gene as rhythmic could be just as an artefact of how the data were normalized. Yet, gene rhythmicity detection is crucial in modern toxicological and pharmacological studies, thus a procedure to truly identify rhythmic genes that are robust to the choice of a normalization method is required.To perform the task of detecting rhythmic features, we propose a rhythmicity measure based on bootstrap methodology to robustly identify rhythmic genes in oscillatory systems. Although our methodology can be extended to any high-throughput experiment, in this chapter, we illustrate how to apply it to a publicly available circadian clock microarray gene-expression data and give full details (both statistical and computational) so that the methodology can be used in an easy way. We will show that the choice of normalization method has very little effect on the proposed methodology since the results derived from the bootstrap-based rhythmicity measure are highly rank correlated for any pair of normalization methods considered. This suggests, on the one hand, that the rhythmicity measure proposed is robust to the choice of the normalization method, and on the other hand, that gene rhythmicity detected using this measure is potentially not a mere artefact of the normalization method used. In this way the researcher using this methodology will be protected against the possible effect of different normalizations, as the conclusions obtained will not depend so strongly on them. Additionally, the described bootstrap methodology can also be employed as a tool to simulate gene-expression participating in an oscillatory system from a reference data set.
Author Fernández, Miguel A
Rueda, Cristina
Peddada, Shyamal D
Larriba, Yolanda
Author_xml – sequence: 1
  givenname: Yolanda
  surname: Larriba
  fullname: Larriba, Yolanda
  email: yolanda.larriba@uva.es
  organization: Departamento de Estadística e Investigación Operativa, Universidad de Valladolid, Valladolid, Spain. yolanda.larriba@uva.es
– sequence: 2
  givenname: Cristina
  surname: Rueda
  fullname: Rueda, Cristina
  organization: Departamento de Estadística e Investigación Operativa, Universidad de Valladolid, Valladolid, Spain
– sequence: 3
  givenname: Miguel A
  surname: Fernández
  fullname: Fernández, Miguel A
  organization: Departamento de Estadística e Investigación Operativa, Universidad de Valladolid, Valladolid, Spain
– sequence: 4
  givenname: Shyamal D
  surname: Peddada
  fullname: Peddada, Shyamal D
  organization: Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, USA
BackLink https://www.ncbi.nlm.nih.gov/pubmed/31115890$$D View this record in MEDLINE/PubMed
BookMark eNo1j8tKAzEUQIMo9qFfIEh-IJrnZO5SWluFqlB0PdzJJHakMymZzKJ-fcHH6sBZHDgzct7H3hNyI_id4Nzegy2ZYBoUMNBaMlvBGZkK0JwVXMKEzIbhi3NtldSXZKKEEKYEPiXrl9aliCnhkS4xI32NqcN9-425jT3FvqHbWI9DpkufvfuRMdDt7ph3XevoymMekx-uyEXA_eCv_zgnH6vH98UT27ytnxcPG_aplMnMOIuuVuDRSChc44WSpgwaShPKBkVRCwHSCagt98oEb6S2MijvpOKKSzknt7_dw1h3vqkOqe0wHav_I3kCUBJNww
ContentType Journal Article
DBID CGR
CUY
CVF
ECM
EIF
NPM
DOI 10.1007/978-1-4939-9442-7_9
DatabaseName Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
DatabaseTitle MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
DatabaseTitleList MEDLINE
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod no_fulltext_linktorsrc
Discipline Biology
EISSN 1940-6029
ExternalDocumentID 31115890
Genre Journal Article
GroupedDBID ---
29M
53G
ACGFS
ALMA_UNASSIGNED_HOLDINGS
CGR
CUY
CVF
ECM
EIF
F5P
NPM
P2P
RSU
SPO
UDS
WH7
ZGI
ID FETCH-LOGICAL-g335t-5c7acb39ea5296cde13258f4985f8da16b1192c19b70e35fe52472f3ec2303022
IngestDate Thu Jan 02 22:59:08 EST 2025
IsPeerReviewed false
IsScholarly true
Keywords Circadian genes
Pre-processing
Oscillatory systems
Rhythmicity
High-throughput technologies
Normalization
Microarray
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-g335t-5c7acb39ea5296cde13258f4985f8da16b1192c19b70e35fe52472f3ec2303022
PMID 31115890
ParticipantIDs pubmed_primary_31115890
PublicationCentury 2000
PublicationDate 2019-01-01
PublicationDateYYYYMMDD 2019-01-01
PublicationDate_xml – month: 01
  year: 2019
  text: 2019-01-01
  day: 01
PublicationDecade 2010
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle Methods in molecular biology (Clifton, N.J.)
PublicationTitleAlternate Methods Mol Biol
PublicationYear 2019
SSID ssj0047324
Score 2.1640115
Snippet Data derived from microarray technologies are generally subject to various sources of noise and accordingly the raw data are pre-processed before formally...
SourceID pubmed
SourceType Index Database
StartPage 207
SubjectTerms Algorithms
Cell Line, Tumor
Gene Expression Regulation
Humans
Oligonucleotide Array Sequence Analysis - methods
Statistics, Nonparametric
Title Microarray Data Normalization and Robust Detection of Rhythmic Features
URI https://www.ncbi.nlm.nih.gov/pubmed/31115890
Volume 1986
hasFullText
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LbxMxELZSEKgXRMsbWvnArdoqu7Zj-4gqoKqaCpVWKqfIz6ZSk1Rh9xB-BT-Zsb3epIFWwGUV2dmVtd_425nxPBB6T7Vj4bipGAjjCsrdoNCMVUXpvTDUllLb4NAfngwOz-nRBbvo9X6uRC01td43P_6YV_I_qMIY4BqyZP8B2e6hMAC_AV-4AsJw_SuMhyGaTs3nagHo1SocwkzUdZtZmVIPZ7r5XgOr1M5k3fB0vKjHISQ-qH_NvA0izF2dYkfpGCQ7yZ1z93KlpnTi6-ucpHW0v-JIOIaFXOmoi36bhXjJjvBPG2fj-EFklGk3EZzY8aS-zJ7s4dVl466XDtYvzlqVbv46XqhJIOlVR0XIjeocFS6RqwypBv12YZl9pbhFoKkH7m_EvozlAJNXEllISsE0GMnVfwM6N5OINQEKZyI1Ir1_dq3adp7aQBucB6I_Cd6f9GWnHLTPrnJVKk68tppN9Dg_Yc1OifrK2VP0pDU08IckNVuo56bb6FFqPbp4hj4vZQcH2cG3ZAcDfDjJDu5kB888zrKDs-w8R-efPp4dHBZtU43ikhBWF8xwZTSRToUTd2NdSSomPJWCeWFVOdAlKP0GNinvO8K8YxXllSfOgLFKQON7gR5MZ1P3CmFLPWcVodSKPlVcwLYW3irPrPYlrcrX6GV6A6ObVDlllN_Nmztn3qLNpey8Qw89bFW3A3pfrXcjGL8AKz1T5Q
linkProvider National Library of Medicine
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Microarray+Data+Normalization+and+Robust+Detection+of+Rhythmic+Features&rft.jtitle=Methods+in+molecular+biology+%28Clifton%2C+N.J.%29&rft.au=Larriba%2C+Yolanda&rft.au=Rueda%2C+Cristina&rft.au=Fern%C3%A1ndez%2C+Miguel+A&rft.au=Peddada%2C+Shyamal+D&rft.date=2019-01-01&rft.eissn=1940-6029&rft.volume=1986&rft.spage=207&rft_id=info:doi/10.1007%2F978-1-4939-9442-7_9&rft_id=info%3Apmid%2F31115890&rft_id=info%3Apmid%2F31115890&rft.externalDocID=31115890