An a posteriori strategy for enhancing gene discovery in anonymous cDNA microarray experiments

Motivation: Because of the high cost of sequencing, the bulk of gene discovery is performed using anonymous cDNA microarrays. Though the clones on such arrays are easier and cheaper to construct and utilize than unigene and oligonucleotide arrays, they are there in proportion to their corresponding...

Full description

Saved in:

Bibliographic Details
Published in	Bioinformatics Vol. 20; no. 11; pp. 1721 - 1727
Main Authors	Anderssen, R. S., Wu, Y., Dolferus, R., Saunders, I.
Format	Journal Article
Language	English
Published	Oxford Oxford University Press 22.07.2004 Oxford Publishing Limited (England)
Subjects	Algorithms Biological and medical sciences Fundamental and applied biological sciences. Psychology Gene Expression Profiling - methods General aspects Genetic Variation In Situ Hybridization, Fluorescence - methods Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) Microscopy, Fluorescence - methods Models, Genetic Models, Statistical Oligonucleotide Array Sequence Analysis - methods Reproducibility of Results Sample Size Sensitivity and Specificity Sequence Analysis, DNA - methods Complementary DNA Gene Bioinformatics
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Motivation: Because of the high cost of sequencing, the bulk of gene discovery is performed using anonymous cDNA microarrays. Though the clones on such arrays are easier and cheaper to construct and utilize than unigene and oligonucleotide arrays, they are there in proportion to their corresponding gene expression activity in the tissue being examined. The associated redundancy will be there in any pool of possibly interesting differentially expressed clones identified in a microarray experiment for subsequent sequencing and investigation. An a posteriori sampling strategy is proposed to enhance gene discovery by reducing the impact of the redundancy in the identified pool. Results: The proposed strategy exploits the fact that individual genes that are highly expressed in a tissue are more likely to be present as a number of spots in an anonymous library and, as a direct consequence, are also likely to give higher fluorescence intensity responses when present in a probe in a cDNA microarray experiment. Consequently, spots that respond with low intensities will have a lower redundancy and so should be sequenced in preference to those with the highest intensities. The proposed method, which formalizes how the fluorescence intensity of a spot should be assessed, is validated using actual microarray data, where the sequences of all the clones in the identified pool had been previously determined. For such validations, the concept of a repeat plot is introduced. It is also utilized to visualize and examine different measures for the characterization of fluorescence intensity. In addition, as confirmatory evidence, sequencing from the lowest to the highest intensities in a pool, with all the sequences known, is compared graphically with their random sequencing. The results establish that, in general, the opportunity for gene discovery is enhanced by avoiding the pooling of different biological libraries (because their construction will have involved different hybridization episodes) and concentrating on the clones with lower fluorescence intensities.
Bibliography:	local:bth152 Contact: Bob.Anderssen@csiro.au ark:/67375/HXZ-NJV8KZC3-2 istex:BDF7181F02A1F7D717D174BC2ED12735161859B4 ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 ObjectType-Article-1 ObjectType-Feature-2
ISSN:	1367-4803 1460-2059 1367-4811
DOI:	10.1093/bioinformatics/bth152