From tradition to innovation: conventional and deep learning frameworks in genome annotation

Abstract Following the milestone success of the Human Genome Project, the ‘Encyclopedia of DNA Elements (ENCODE)’ initiative was launched in 2003 to unearth information about the numerous functional elements within the genome. This endeavor coincided with the emergence of numerous novel technologies...

Full description

Saved in:

Bibliographic Details
Published in	Briefings in bioinformatics Vol. 25; no. 3
Main Authors	Chen, Zhaojia, Ain, Noor ul, Zhao, Qian, Zhang, Xingtan
Format	Journal Article
Language	English
Published	England Oxford University Press 27.03.2024 Oxford Publishing Limited (England)
Subjects	Algorithms Annotations Bibliographic information Bioinformatics Deep learning Encyclopaedias Encyclopedias Experimental methods Functionals Gene sequencing Genes Genomes Human Genome Project Machine learning Nucleotide sequence Ribonucleic acid RNA deep learning genome sequence bioinformatic genome annotation RNA-Seq technology
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Abstract Following the milestone success of the Human Genome Project, the ‘Encyclopedia of DNA Elements (ENCODE)’ initiative was launched in 2003 to unearth information about the numerous functional elements within the genome. This endeavor coincided with the emergence of numerous novel technologies, accompanied by the provision of vast amounts of whole-genome sequences, high-throughput data such as ChIP-Seq and RNA-Seq. Extracting biologically meaningful information from this massive dataset has become a critical aspect of many recent studies, particularly in annotating and predicting the functions of unknown genes. The core idea behind genome annotation is to identify genes and various functional elements within the genome sequence and infer their biological functions. Traditional wet-lab experimental methods still rely on extensive efforts for functional verification. However, early bioinformatics algorithms and software primarily employed shallow learning techniques; thus, the ability to characterize data and features learning was limited. With the widespread adoption of RNA-Seq technology, scientists from the biological community began to harness the potential of machine learning and deep learning approaches for gene structure prediction and functional annotation. In this context, we reviewed both conventional methods and contemporary deep learning frameworks, and highlighted novel perspectives on the challenges arising during annotation underscoring the dynamic nature of this evolving scientific landscape.
Bibliography:	ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-3 content type line 23 ObjectType-Review-1
ISSN:	1467-5463 1477-4054
DOI:	10.1093/bib/bbae138