Improved analytical methods for microarray-based genome-composition analysis

Whereas genome sequencing has given us high-resolution pictures of many different species of bacteria, microarrays provide a means of obtaining information on genome composition for many strains of a given species. Genome-composition analysis using microarrays, or 'genomotyping', can be us...

Full description

Saved in:
Bibliographic Details
Published inGenome biology Vol. 3; no. 11; pp. RESEARCH0065 - 402
Main Authors Kim, Charles C, Joyce, Elizabeth A, Chan, Kaman, Falkow, Stanley
Format Journal Article
LanguageEnglish
Published England BioMed Central Ltd 29.10.2002
BioMed Central
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Whereas genome sequencing has given us high-resolution pictures of many different species of bacteria, microarrays provide a means of obtaining information on genome composition for many strains of a given species. Genome-composition analysis using microarrays, or 'genomotyping', can be used to categorize genes into 'present' and 'divergent' categories based on the level of hybridization signal. This typically involves selecting a signal value that is used as a cutoff to discriminate present (high signal) and divergent (low signal) genes. Current methodology uses empirical determination of cutoffs for classification into these categories, but this methodology is subject to several problems that can result in the misclassification of many genes. We describe a method that depends on the shape of the signal-ratio distribution and does not require empirical determination of a cutoff. Moreover, the cutoff is determined on an array-to-array basis, accounting for variation in strain composition and hybridization quality. The algorithm also provides an estimate of the probability that any given gene is present, which provides a measure of confidence in the categorical assignments. Many genes previously classified as present using static methods are in fact divergent on the basis of microarray signal; this is corrected by our algorithm. We have reassigned hundreds of genes from previous genomotyping studies of Helicobacter pylori and Campylobacter jejuni strains, and expect that the algorithm should be widely applicable to genomotyping data.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ObjectType-Article-2
ObjectType-Feature-1
ObjectType-Undefined-3
ISSN:1474-760X
1465-6906
1474-760X
1465-6914
DOI:10.1186/gb-2002-3-11-research0065