Microarray Data Normalization and Robust Detection of Rhythmic Features
Data derived from microarray technologies are generally subject to various sources of noise and accordingly the raw data are pre-processed before formally analysed. Data normalization is a key pre-processing step when dealing with microarray experiments, such as circadian gene-expressions, since it...
Saved in:
Published in | Methods in molecular biology (Clifton, N.J.) Vol. 1986; p. 207 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
United States
01.01.2019
|
Subjects | |
Online Access | Get more information |
ISSN | 1940-6029 |
DOI | 10.1007/978-1-4939-9442-7_9 |
Cover
Abstract | Data derived from microarray technologies are generally subject to various sources of noise and accordingly the raw data are pre-processed before formally analysed. Data normalization is a key pre-processing step when dealing with microarray experiments, such as circadian gene-expressions, since it removes systematic variations across arrays. A wide variety of normalization methods are available in the literature. However, from our experience in the study of rhythmic expression patterns in oscillatory systems (e.g. cell-cycle, circadian clock), the choice of the normalization method may substantially impair the identification of rhythmic genes. Hence, the identification of a gene as rhythmic could be just as an artefact of how the data were normalized. Yet, gene rhythmicity detection is crucial in modern toxicological and pharmacological studies, thus a procedure to truly identify rhythmic genes that are robust to the choice of a normalization method is required.To perform the task of detecting rhythmic features, we propose a rhythmicity measure based on bootstrap methodology to robustly identify rhythmic genes in oscillatory systems. Although our methodology can be extended to any high-throughput experiment, in this chapter, we illustrate how to apply it to a publicly available circadian clock microarray gene-expression data and give full details (both statistical and computational) so that the methodology can be used in an easy way. We will show that the choice of normalization method has very little effect on the proposed methodology since the results derived from the bootstrap-based rhythmicity measure are highly rank correlated for any pair of normalization methods considered. This suggests, on the one hand, that the rhythmicity measure proposed is robust to the choice of the normalization method, and on the other hand, that gene rhythmicity detected using this measure is potentially not a mere artefact of the normalization method used. In this way the researcher using this methodology will be protected against the possible effect of different normalizations, as the conclusions obtained will not depend so strongly on them. Additionally, the described bootstrap methodology can also be employed as a tool to simulate gene-expression participating in an oscillatory system from a reference data set. |
---|---|
AbstractList | Data derived from microarray technologies are generally subject to various sources of noise and accordingly the raw data are pre-processed before formally analysed. Data normalization is a key pre-processing step when dealing with microarray experiments, such as circadian gene-expressions, since it removes systematic variations across arrays. A wide variety of normalization methods are available in the literature. However, from our experience in the study of rhythmic expression patterns in oscillatory systems (e.g. cell-cycle, circadian clock), the choice of the normalization method may substantially impair the identification of rhythmic genes. Hence, the identification of a gene as rhythmic could be just as an artefact of how the data were normalized. Yet, gene rhythmicity detection is crucial in modern toxicological and pharmacological studies, thus a procedure to truly identify rhythmic genes that are robust to the choice of a normalization method is required.To perform the task of detecting rhythmic features, we propose a rhythmicity measure based on bootstrap methodology to robustly identify rhythmic genes in oscillatory systems. Although our methodology can be extended to any high-throughput experiment, in this chapter, we illustrate how to apply it to a publicly available circadian clock microarray gene-expression data and give full details (both statistical and computational) so that the methodology can be used in an easy way. We will show that the choice of normalization method has very little effect on the proposed methodology since the results derived from the bootstrap-based rhythmicity measure are highly rank correlated for any pair of normalization methods considered. This suggests, on the one hand, that the rhythmicity measure proposed is robust to the choice of the normalization method, and on the other hand, that gene rhythmicity detected using this measure is potentially not a mere artefact of the normalization method used. In this way the researcher using this methodology will be protected against the possible effect of different normalizations, as the conclusions obtained will not depend so strongly on them. Additionally, the described bootstrap methodology can also be employed as a tool to simulate gene-expression participating in an oscillatory system from a reference data set. |
Author | Fernández, Miguel A Rueda, Cristina Peddada, Shyamal D Larriba, Yolanda |
Author_xml | – sequence: 1 givenname: Yolanda surname: Larriba fullname: Larriba, Yolanda email: yolanda.larriba@uva.es organization: Departamento de Estadística e Investigación Operativa, Universidad de Valladolid, Valladolid, Spain. yolanda.larriba@uva.es – sequence: 2 givenname: Cristina surname: Rueda fullname: Rueda, Cristina organization: Departamento de Estadística e Investigación Operativa, Universidad de Valladolid, Valladolid, Spain – sequence: 3 givenname: Miguel A surname: Fernández fullname: Fernández, Miguel A organization: Departamento de Estadística e Investigación Operativa, Universidad de Valladolid, Valladolid, Spain – sequence: 4 givenname: Shyamal D surname: Peddada fullname: Peddada, Shyamal D organization: Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, USA |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/31115890$$D View this record in MEDLINE/PubMed |
BookMark | eNo1j8tKAzEUQIMo9qFfIEh-IJrnZO5SWluFqlB0PdzJJHakMymZzKJ-fcHH6sBZHDgzct7H3hNyI_id4Nzegy2ZYBoUMNBaMlvBGZkK0JwVXMKEzIbhi3NtldSXZKKEEKYEPiXrl9aliCnhkS4xI32NqcN9-425jT3FvqHbWI9DpkufvfuRMdDt7ph3XevoymMekx-uyEXA_eCv_zgnH6vH98UT27ytnxcPG_aplMnMOIuuVuDRSChc44WSpgwaShPKBkVRCwHSCagt98oEb6S2MijvpOKKSzknt7_dw1h3vqkOqe0wHav_I3kCUBJNww |
ContentType | Journal Article |
DBID | CGR CUY CVF ECM EIF NPM |
DOI | 10.1007/978-1-4939-9442-7_9 |
DatabaseName | Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed |
DatabaseTitle | MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) |
DatabaseTitleList | MEDLINE |
Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database |
DeliveryMethod | no_fulltext_linktorsrc |
Discipline | Biology |
EISSN | 1940-6029 |
ExternalDocumentID | 31115890 |
Genre | Journal Article |
GroupedDBID | --- 29M 53G ACGFS ALMA_UNASSIGNED_HOLDINGS CGR CUY CVF ECM EIF F5P NPM P2P RSU SPO UDS WH7 ZGI |
ID | FETCH-LOGICAL-g335t-5c7acb39ea5296cde13258f4985f8da16b1192c19b70e35fe52472f3ec2303022 |
IngestDate | Thu Jan 02 22:59:08 EST 2025 |
IsPeerReviewed | false |
IsScholarly | true |
Keywords | Circadian genes Pre-processing Oscillatory systems Rhythmicity High-throughput technologies Normalization Microarray |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-g335t-5c7acb39ea5296cde13258f4985f8da16b1192c19b70e35fe52472f3ec2303022 |
PMID | 31115890 |
ParticipantIDs | pubmed_primary_31115890 |
PublicationCentury | 2000 |
PublicationDate | 2019-01-01 |
PublicationDateYYYYMMDD | 2019-01-01 |
PublicationDate_xml | – month: 01 year: 2019 text: 2019-01-01 day: 01 |
PublicationDecade | 2010 |
PublicationPlace | United States |
PublicationPlace_xml | – name: United States |
PublicationTitle | Methods in molecular biology (Clifton, N.J.) |
PublicationTitleAlternate | Methods Mol Biol |
PublicationYear | 2019 |
SSID | ssj0047324 |
Score | 2.1640115 |
Snippet | Data derived from microarray technologies are generally subject to various sources of noise and accordingly the raw data are pre-processed before formally... |
SourceID | pubmed |
SourceType | Index Database |
StartPage | 207 |
SubjectTerms | Algorithms Cell Line, Tumor Gene Expression Regulation Humans Oligonucleotide Array Sequence Analysis - methods Statistics, Nonparametric |
Title | Microarray Data Normalization and Robust Detection of Rhythmic Features |
URI | https://www.ncbi.nlm.nih.gov/pubmed/31115890 |
Volume | 1986 |
hasFullText | |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LbxMxELZSEKgXRMsbWvnArdoqu7Zj-4gqoKqaCpVWKqfIz6ZSk1Rh9xB-BT-Zsb3epIFWwGUV2dmVtd_425nxPBB6T7Vj4bipGAjjCsrdoNCMVUXpvTDUllLb4NAfngwOz-nRBbvo9X6uRC01td43P_6YV_I_qMIY4BqyZP8B2e6hMAC_AV-4AsJw_SuMhyGaTs3nagHo1SocwkzUdZtZmVIPZ7r5XgOr1M5k3fB0vKjHISQ-qH_NvA0izF2dYkfpGCQ7yZ1z93KlpnTi6-ucpHW0v-JIOIaFXOmoi36bhXjJjvBPG2fj-EFklGk3EZzY8aS-zJ7s4dVl466XDtYvzlqVbv46XqhJIOlVR0XIjeocFS6RqwypBv12YZl9pbhFoKkH7m_EvozlAJNXEllISsE0GMnVfwM6N5OINQEKZyI1Ir1_dq3adp7aQBucB6I_Cd6f9GWnHLTPrnJVKk68tppN9Dg_Yc1OifrK2VP0pDU08IckNVuo56bb6FFqPbp4hj4vZQcH2cG3ZAcDfDjJDu5kB888zrKDs-w8R-efPp4dHBZtU43ikhBWF8xwZTSRToUTd2NdSSomPJWCeWFVOdAlKP0GNinvO8K8YxXllSfOgLFKQON7gR5MZ1P3CmFLPWcVodSKPlVcwLYW3irPrPYlrcrX6GV6A6ObVDlllN_Nmztn3qLNpey8Qw89bFW3A3pfrXcjGL8AKz1T5Q |
linkProvider | National Library of Medicine |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Microarray+Data+Normalization+and+Robust+Detection+of+Rhythmic+Features&rft.jtitle=Methods+in+molecular+biology+%28Clifton%2C+N.J.%29&rft.au=Larriba%2C+Yolanda&rft.au=Rueda%2C+Cristina&rft.au=Fern%C3%A1ndez%2C+Miguel+A&rft.au=Peddada%2C+Shyamal+D&rft.date=2019-01-01&rft.eissn=1940-6029&rft.volume=1986&rft.spage=207&rft_id=info:doi/10.1007%2F978-1-4939-9442-7_9&rft_id=info%3Apmid%2F31115890&rft_id=info%3Apmid%2F31115890&rft.externalDocID=31115890 |