Human microRNA prediction through a probabilistic co-learning model of sequence and structure

MicroRNAs (miRNAs) are small regulatory RNAs of ∼22 nt. Although hundreds of miRNAs have been identified through experimental complementary DNA cloning methods and computational efforts, previous approaches could detect only abundantly expressed miRNAs or close homologs of previously identified miRN...

Full description

Saved in:
Bibliographic Details
Published inNucleic acids research Vol. 33; no. 11; pp. 3570 - 3581
Main Authors Nam, Jin-Wu, Shin, Ki-Roo, Han, Jinju, Lee, Yoontae, Kim, V. Narry, Zhang, Byoung-Tak
Format Journal Article
LanguageEnglish
Published England Oxford University Press 01.01.2005
Oxford Publishing Limited (England)
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:MicroRNAs (miRNAs) are small regulatory RNAs of ∼22 nt. Although hundreds of miRNAs have been identified through experimental complementary DNA cloning methods and computational efforts, previous approaches could detect only abundantly expressed miRNAs or close homologs of previously identified miRNAs. Here, we introduce a probabilistic co-learning model for miRNA gene finding, ProMiR, which simultaneously considers the structure and sequence of miRNA precursors (pre-miRNAs). On 5-fold cross-validation with 136 referenced human datasets, the efficiency of the classification shows 73% sensitivity and 96% specificity. When applied to genome screening for novel miRNAs on human chromosomes 16, 17, 18 and 19, ProMiR effectively searches distantly homologous patterns over diverse pre-miRNAs, detecting at least 23 novel miRNA gene candidates. Importantly, the miRNA gene candidates do not demonstrate clear sequence similarity to the known miRNA genes. By quantitative PCR followed by RNA interference against Drosha, we experimentally confirmed that 9 of the 23 representative candidate genes express transcripts that are processed by the miRNA biogenesis enzyme Drosha in HeLa cells, indicating that ProMiR may successfully predict miRNA genes with at least 40% accuracy. Our study suggests that the miRNA gene family may be more abundant than previously anticipated, and confer highly extensive regulatory networks on eukaryotic cells.
Bibliography:To whom correspondence should be addressed. Tel: +82 2 880 1847; Fax: +82 2 875 2240; Email: btzhang@bi.snu.ac.kr
ark:/67375/HXZ-XNT86PRL-X
local:gki668
istex:3215DA0D1F1E2FB5A0B7B736B886F0F82AB2BC6E
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ObjectType-Undefined-3
ISSN:0305-1048
1362-4962
DOI:10.1093/nar/gki668