Analysis of the occurrence of promoter-sites in DNA
We show that the occurrence and homology score (1) of promoter-sites in DNA depends upon the base composition of the DNA. We used simple probability theory to calculate the mean homology score expected for all promoter-sites that had a specific match in the canonical hexamers. By using the square ro...
Saved in:
Published in | Nucleic acids research Vol. 14; no. 1; pp. 109 - 126 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
England
Oxford University Press
10.01.1986
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | We show that the occurrence and homology score (1) of promoter-sites in DNA depends upon the base composition of the DNA. We used simple probability theory to calculate the mean homology score expected for all promoter-sites that had a specific match in the canonical hexamers. By using the square root of this mean score as a measure of significance, we objectively classify all promoter-sites which are reported. We tested the theoretical approach in two ways. First, we used the program (PROMSEARCH) to analyze approximately 150,000 base pairs of random sequence DNA with different base compositions and we found excellent agreement with the theoretical predictions. Our second test was the analysis of a number of sequences drawn from the GENBANK DNA sequence database. We have analyzed 20 bacterial and bacteriophage sequences, which consisted of at least one operon, for promoter-sites. We found no absolute preference for promoter-sites within noncoding regions. We show the results analyzing the phages lambda, T7 and fd, and the E. coli lac operon. The major known promoters in these sequences were all found correctly. We discuss the question of the location of a number of minor promoter-sites and show how PROMSEARCH can be used to help identify the correct location of the promoter. This approach can be applied to the search for any DNA site and should allow greater objectivity when comparing DNA sequences for meaningful subsequences. |
---|---|
Bibliography: | istex:1C375920F451896035AD4F842C97900E4A28435C ark:/67375/HXZ-XG8QW8C7-G ArticleID:14.1.109 Present address: Department of Molecular Genetics and Cell Biology, University of Chicago, 920 E. 58th St., Chicago, IL 60637, USA ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 0305-1048 1362-4962 |
DOI: | 10.1093/nar/14.1.109 |