High-resolution DNA-binding specificity analysis of yeast transcription factors

Transcription factors (TFs) regulate the expression of genes through sequence-specific interactions with DNA-binding sites. However, despite recent progress in identifying in vivo TF binding sites by microarray readout of chromatin immunoprecipitation (ChIP-chip), nearly half of all known yeast TFs...

Full description

Saved in:
Bibliographic Details
Published inGenome Research Vol. 19; no. 4; pp. 556 - 566
Main Authors Zhu, Cong, Byers, Kelsey J R P, McCord, Rachel Patton, Shi, Zhenwei, Berger, Michael F, Newburger, Daniel E, Saulrieta, Katrina, Smith, Zachary, Shah, Mita V, Radhakrishnan, Mathangi, Philippakis, Anthony A, Hu, Yanhui, De Masi, Federico, Pacek, Marcin, Rolfs, Andreas, Murthy, Tal, Labaer, Joshua, Bulyk, Martha L
Format Journal Article
LanguageEnglish
Published United States Cold Spring Harbor Laboratory Press 01.04.2009
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Transcription factors (TFs) regulate the expression of genes through sequence-specific interactions with DNA-binding sites. However, despite recent progress in identifying in vivo TF binding sites by microarray readout of chromatin immunoprecipitation (ChIP-chip), nearly half of all known yeast TFs are of unknown DNA-binding specificities, and many additional predicted TFs remain uncharacterized. To address these gaps in our knowledge of yeast TFs and their cis regulatory sequences, we have determined high-resolution binding profiles for 89 known and predicted yeast TFs, over more than 2.3 million gapped and ungapped 8-bp sequences ("k-mers"). We report 50 new or significantly different direct DNA-binding site motifs for yeast DNA-binding proteins and motifs for eight proteins for which only a consensus sequence was previously known; in total, this corresponds to over a 50% increase in the number of yeast DNA-binding proteins with experimentally determined DNA-binding specificities. Among other novel regulators, we discovered proteins that bind the PAC (Polymerase A and C) motif (GATGAG) and regulate ribosomal RNA (rRNA) transcription and processing, core cellular processes that are constituent to ribosome biogenesis. In contrast to earlier data types, these comprehensive k-mer binding data permit us to consider the regulatory potential of genomic sequence at the individual word level. These k-mer data allowed us to reannotate in vivo TF binding targets as direct or indirect and to examine TFs' potential effects on gene expression in approximately 1,700 environmental and cellular conditions. These approaches could be adapted to identify TFs and cis regulatory elements in higher eukaryotes.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
These authors contributed equally to this work.
ISSN:1088-9051
1549-5469
1549-5477
DOI:10.1101/gr.090233.108