CLIP-based prediction of mammalian microRNA binding sites

Prediction and validation of microRNA (miRNA) targets are essential for understanding functions of miRNAs in gene regulation. Crosslinking immunoprecipitation (CLIP) allows direct identification of a huge number of Argonaute-bound target sequences that contain miRNA binding sites. By analysing data...

Full description

Saved in:
Bibliographic Details
Published inNucleic acids research Vol. 41; no. 14; p. e138
Main Authors Liu, Chaochun, Mallick, Bibekanand, Long, Dang, Rennie, William A, Wolenc, Adam, Carmack, C Steven, Ding, Ye
Format Journal Article
LanguageEnglish
Published England Oxford University Press 01.08.2013
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Prediction and validation of microRNA (miRNA) targets are essential for understanding functions of miRNAs in gene regulation. Crosslinking immunoprecipitation (CLIP) allows direct identification of a huge number of Argonaute-bound target sequences that contain miRNA binding sites. By analysing data from CLIP studies, we identified a comprehensive list of sequence, thermodynamic and target structure features that are essential for target binding by miRNAs in the 3' untranslated region (3' UTR), coding sequence (CDS) region and 5' untranslated region (5' UTR) of target messenger RNA (mRNA). The total energy of miRNA:target hybridization, a measure of target structural accessibility, is the only essential feature common for both seed and seedless sites in all three target regions. Furthermore, evolutionary conservation is an important discriminating feature for both seed and seedless sites. These features enabled us to develop novel statistical models for the predictions of both seed sites and broad classes of seedless sites. Through both intra-dataset validation and inter-dataset validation, our approach showed major improvements over established algorithms for predicting seed sites and a class of seedless sites. Furthermore, we observed good performance from cross-species validation, suggesting that our prediction framework can be valuable for broad application to other mammalian species and beyond. Transcriptome-wide binding site predictions enabled by our approach will greatly complement the available CLIP data, which only cover small fractions of transcriptomes and known miRNAs due to non-detectable levels of expression. Software and database tools based on the prediction models have been developed and are available through Sfold web server at http://sfold.wadsworth.org.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors.
Present addresses: Bibekanand Mallick, RNA Biology and Functional Genomics Laboratory, Department of Life Science, National Institute of Technology, Rourkela-769008, Odisha, India.
Dang Long, Biotechnology Department, Faculty of Chemistry, Danang University of Science and Technology, 54 Nguyen Luong Bang St., Danang, Vietnam.
ISSN:0305-1048
1362-4962
DOI:10.1093/nar/gkt435