Combining DNA and protein alignments to improve genome annotation with LiftOn

As the number and variety of assembled genomes continue to grow, the number of annotated genomes is falling behind, particularly for eukaryotes. DNA-based mapping tools help to address this challenge, but they are only able to transfer annotation between closely related species. Here we introduce Li...

Full description

Saved in:
Bibliographic Details
Published inGenome research Vol. 35; no. 2; pp. 311 - 325
Main Authors Chao, Kuan-Hao, Heinz, Jakob M., Hoh, Celine, Mao, Alan, Shumate, Alaina, Pertea, Mihaela, Salzberg, Steven L.
Format Journal Article
LanguageEnglish
Published United States Cold Spring Harbor Laboratory Press 01.02.2025
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:As the number and variety of assembled genomes continue to grow, the number of annotated genomes is falling behind, particularly for eukaryotes. DNA-based mapping tools help to address this challenge, but they are only able to transfer annotation between closely related species. Here we introduce LiftOn, a homology-based software tool that integrates DNA and protein alignments to enhance the accuracy of genome-scale annotation and to allow mapping between relatively distant species. LiftOn's protein-centric algorithm considers both types of alignments, chooses optimal open reading frames, resolves overlapping gene loci, and finds additional gene copies when they exist. LiftOn can reliably transfer annotation between genomes representing members of the same species, as we demonstrate on human, mouse, honeybee, rice, and Arabidopsis thaliana . It can further map annotation effectively across species pairs as far apart as mouse and rat or Drosophila melanogaster and Drosophila erecta .
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:1088-9051
1355-8382
1549-5469
1549-5469
1469-9001
DOI:10.1101/gr.279620.124