AlleleSeq: analysis of allele‐specific expression and binding in a network framework
To study allele‐specific expression (ASE) and binding (ASB), that is, differences between the maternally and paternally derived alleles, we have developed a computational pipeline (AlleleSeq). Our pipeline initially constructs a diploid personal genome sequence (and corresponding personalized gene a...
Saved in:
Published in | Molecular systems biology Vol. 7; no. 1; pp. 522 - n/a |
---|---|
Main Authors | , , , , , , , , , , , , , |
Format | Journal Article |
Language | English |
Published |
London
Nature Publishing Group UK
02.08.2011
John Wiley & Sons, Ltd EMBO Press Nature Publishing Group Springer Nature |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | To study allele‐specific expression (ASE) and binding (ASB), that is, differences between the maternally and paternally derived alleles, we have developed a computational pipeline (AlleleSeq). Our pipeline initially constructs a diploid personal genome sequence (and corresponding personalized gene annotation) using genomic sequence variants (SNPs, indels, and structural variants), and then identifies allele‐specific events with significant differences in the number of mapped reads between maternal and paternal alleles. There are many technical challenges in the construction and alignment of reads to a personal diploid genome sequence that we address, for example, bias of reads mapping to the reference allele. We have applied AlleleSeq to variation data for NA12878 from the 1000 Genomes Project as well as matched, deeply sequenced RNA‐Seq and ChIP‐Seq data sets generated for this purpose. In addition to observing fairly widespread allele‐specific behavior within individual functional genomic data sets (including results consistent with X‐chromosome inactivation), we can study the interaction between ASE and ASB. Furthermore, we investigate the coordination between ASE and ASB from multiple transcription factors events using a regulatory network framework. Correlation analyses and network motifs show mostly coordinated ASB and ASE.
Software was developed for building a personal diploid genome sequence, and determining sites of allele‐specific binding and expression (AlleleSeq).
This computational pipeline was used to analyze variation data, and deeply sequenced RNA‐Seq and ChIP‐Seq datasets, for individual NA12878 from the 1000 Genomes Project.
The interaction between allele‐specific binding and allele‐specific expression are investigated, revealing clear coordination. |
---|---|
Bibliography: | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 ObjectType-Article-1 ObjectType-Feature-2 These authors contributed equally to this work |
ISSN: | 1744-4292 1744-4292 |
DOI: | 10.1038/msb.2011.54 |