Identification and bioinformatics analysis of pseudogenes from whole genome sequence of Phaeodactylum tricornutum

Pseudogenes share sequence similarities with functional genes, but in general they have lost their protein-coding ability. The identification of pseudogenes is a very important step in genome annotation. Phaeodactylum tricornutum is a marine diatom that is rich in polyunsaturated fatty acids (PUFAs)...

Full description

Saved in:
Bibliographic Details
Published inChinese science bulletin Vol. 58; no. 9; pp. 1010 - 1017
Main Authors Ji, ChangMian, Huang, AiYou, Liu, WenLing, Pan, GuangHua, Wang, GuangCe
Format Journal Article
LanguageEnglish
Published Heidelberg Springer-Verlag 01.03.2013
SP Science China Press
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Pseudogenes share sequence similarities with functional genes, but in general they have lost their protein-coding ability. The identification of pseudogenes is a very important step in genome annotation. Phaeodactylum tricornutum is a marine diatom that is rich in polyunsaturated fatty acids (PUFAs). The genome of P. tricornutum has been completely sequenced. To identify pseudogenes in P. tricornutum, we developed a pipeline to discover and characterize pseudogenes. We identified a total of 1654 'true' processed pseudogenes, 714 duplicated pseudogenes and 4729 pseudogene fragments. The results of the bioinformatics analysis indicated that the genome sequence of P. tricornutum contained many pseudogenes and pseudogene fragments.
Bibliography:Phaeodactylum tricornutum; pseudogene; pseudogene fragment; bioinformatics
Pseudogenes share sequence similarities with functional genes, but in general they have lost their protein-coding ability. The identification of pseudogenes is a very important step in genome annotation. Phaeodactylum tricornutum is a marine diatom that is rich in polyunsaturated fatty acids (PUFAs). The genome of P. tricornutum has been completely sequenced. To identify pseudogenes in P. tricornutum, we developed a pipeline to discover and characterize pseudogenes. We identified a total of 1654 'true' processed pseudogenes, 714 duplicated pseudogenes and 4729 pseudogene fragments. The results of the bioinformatics analysis indicated that the genome sequence of P. tricornutum contained many pseudogenes and pseudogene fragments.
11-1785/N
JI ChangMian, HUANG AiYou, LIU WenLing, PAN GuangHua & WANG GuangCe 1 Tianjin Key Laboratory of Marine Resources and Chemistry, College of Marine Science and Engineering, Tianjin University of Science and Technology, Tianjin 300457, China; 2 Institute of Oceanology, Chinese Academy of Sciences (IOCAS), Qingdao 266071, China; 3 Graduate University of Chinese Academy of Sciences, Beijing 100049, China
http://dx.doi.org/10.1007/s11434-012-5174-3
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1001-6538
1861-9541
DOI:10.1007/s11434-012-5174-3