Assessing Transcription Factor Motif Drift from Noisy Decoy Sequences

Genome scale identification of transcription factor binding sites (TFBS) is fundamental to understanding the complexities of mRNA expression at both the cell and organismal levels. While high-throughput experimental methods provide associations between transcription factors and the genes they regula...

Full description

Saved in:
Bibliographic Details
Published inGenome Informatics Vol. 16; no. 1; pp. 59 - 67
Main Authors Reddy, Timothy E., Lisi, Charles De, Shakhnovich, Boris E.
Format Journal Article
LanguageEnglish
Published Japan Japanese Society for Bioinformatics 2005
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Genome scale identification of transcription factor binding sites (TFBS) is fundamental to understanding the complexities of mRNA expression at both the cell and organismal levels. While high-throughput experimental methods provide associations between transcription factors and the genes they regulate under a specified experimental condition, computational methods are still required to pinpoint the exact location of binding. Moreover, since the binding site is an intrinsic property of the promoter region, computational methods are in principle more general than condition dependent experimental methods. Computational identification of TFBSs is complicated in at least two different ways. First, transcription factors bind a heterogeneous distribution of sites and therefore have a distribution of affinities. Second, the set of sequences for which a common site is to be determined do not all have a site for the TF of interest. In this paper, we evaluate the robustness of TFBS identification with respect to both effects. We show addition of upstream regions that do not have the TFBS destroy the specificity of the predicted binding site. We also propose a method to calculate the distance between position weight matrices that can be used to measure “drift” from the canonical binding site. The results presented here could be useful in developing future transcription factor binding site identification algorithms.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0919-9454
2185-842X
DOI:10.11234/gi1990.16.59