Identification and Characterization of miRNA Transcriptome in Asiatic Cotton ( Gossypium arboreum ) Using High Throughput Sequencing

MicroRNAs (miRNAs) are small 20-24nt molecules that have been well studied over the past decade due to their important regulatory roles in different cellular processes. The mature sequences are more conserved across vast phylogenetic scales than their precursors and some are conserved within entire...

Full description

Saved in:
Bibliographic Details
Published inFrontiers in plant science Vol. 8; p. 969
Main Authors Farooq, Muhammad, Mansoor, Shahid, Guo, Hui, Amin, Imran, Chee, Peng W, Azim, M Kamran, Paterson, Andrew H
Format Journal Article
LanguageEnglish
Published Switzerland Frontiers Media S.A 15.06.2017
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:MicroRNAs (miRNAs) are small 20-24nt molecules that have been well studied over the past decade due to their important regulatory roles in different cellular processes. The mature sequences are more conserved across vast phylogenetic scales than their precursors and some are conserved within entire kingdoms, hence, their loci and function can be predicted by homology searches. Different studies have been performed to elucidate miRNAs using prediction methods but due to complex regulatory mechanisms or false positive predictions, not all of them express in reality and sometimes computationally predicted mature transcripts differ from the actual expressed ones. With the availability of a complete genome sequence of , it is important to annotate the genome for both coding and non-coding regions using high confidence transcript evidence, for this cotton species that is highly resistant to various biotic and abiotic stresses. Here we have analyzed the small RNA transcriptome of leaves and provided genome annotation of miRNAs with evidence from miRNA/miRNA transcripts. A total of 446 miRNAs clustered into 224 miRNA families were found, among which 48 families are conserved in other plants and 176 are novel. Four short RNA libraries were used to shortlist best predictions based on high reads per million. The size, origin, copy numbers and transcript depth of all miRNAs along with their isoforms and targets has been reported. The highest gene copy number was observed for gar-miR7504 followed by gar-miR166, gar-miR8771, gar-miR156, and gar-miR7484. Altogether, 1274 target genes were found in that are enriched for 216 KEGG pathways. The resultant genomic annotations are provided in UCSC, BED format.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Reviewed by: Claus Jüurgen Scholz, University of Bonn, Germany; Chao Cheng, Dartmouth College, United States
This article was submitted to Bioinformatics and Computational Biology, a section of the journal Frontiers in Plant Science
Edited by: Alessandro Laganà, Icahn School of Medicine at Mount Sinai, United States
ISSN:1664-462X
1664-462X
DOI:10.3389/fpls.2017.00969