Balancing false positives and false negatives for the detection of differential expression in malignancies

A basic problem of microarray data analysis is to identify genes whose expression is affected by the distinction between malignancies with different properties. These genes are said to be differentially expressed. Differential expression can be detected by selecting the genes with P-values (derived...

Full description

Saved in:
Bibliographic Details
Published inBritish journal of cancer Vol. 91; no. 6; pp. 1160 - 1165
Main Authors DE SMET, F, MOREAU, Y, ENGELEN, K, TIMMERMAN, D, VERGOTE, I, DE MOOR, B
Format Journal Article
LanguageEnglish
Published Basingstoke Nature Publishing Group 13.09.2004
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A basic problem of microarray data analysis is to identify genes whose expression is affected by the distinction between malignancies with different properties. These genes are said to be differentially expressed. Differential expression can be detected by selecting the genes with P-values (derived using an appropriate hypothesis test) below a certain rejection level. This selection, however, is not possible without accepting some false positives and negatives since the two sets of P-values, associated with the genes whose expression is and is not affected by the distinction between the different malignancies, overlap. We describe a procedure for the study of differential expression in microarray data based on receiver-operating characteristic curves. This approach can be useful to select a rejection level that balances the number of false positives and negatives and to assess the degree of overlap between the two sets of P-values. Since this degree of overlap characterises the balance that can be reached between the number of false positives and negatives, this quantity can be seen as a quality measure of microarray data with respect to the detection of differential expression. As an example, we apply our method to data sets studying acute leukaemia.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
On leave at the Center for Biological Sequence Analysis, BioCentrum, Danish Technical University, Building 208, DK-2800 Lyngby, Denmark
ISSN:0007-0920
1532-1827
DOI:10.1038/sj.bjc.6602140