Application of Simulated Annealing to the Biclustering of Gene Expression Data

In a gene expression data matrix, a bicluster is a submatrix of genes and conditions that exhibits a high correlation of expression activity across both rows and columns. The problem of locating the most significant bicluster has been shown to be NP-complete. Heuristic approaches such as Cheng and C...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on information technology in biomedicine Vol. 10; no. 3; pp. 519 - 525
Main Authors Bryan, K., Cunningham, P., Bolshakova, N.
Format Journal Article
LanguageEnglish
Published United States IEEE 01.07.2006
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In a gene expression data matrix, a bicluster is a submatrix of genes and conditions that exhibits a high correlation of expression activity across both rows and columns. The problem of locating the most significant bicluster has been shown to be NP-complete. Heuristic approaches such as Cheng and Church's greedy node deletion algorithm have been previously employed. It is to be expected that stochastic search techniques such as evolutionary algorithms or simulated annealing might improve upon such greedy techniques. In this paper we show that an approach based on simulated annealing is well suited to this problem, and we present a comparative evaluation of simulated annealing and node deletion on a variety of datasets. We show that simulated annealing discovers more significant biclusters in many cases. Furthermore, we also test the ability of our technique to locate biologically verifiable biclusters within an annotated set of genes
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1089-7771
1558-0032
DOI:10.1109/TITB.2006.872073