Long-read sequencing transcriptome quantification with lr-kallisto

RNA abundance quantification has become routine and affordable thanks to high-throughput "short-read" technologies that provide accurate molecule counts at the gene level. Similarly accurate and affordable quantification of definitive full-length, transcript isoforms has remained a stubbor...

Full description

Saved in:

Bibliographic Details
Published in	bioRxiv
Main Authors	Loving, Rebekah, Sullivan, Delaney K, Reese, Fairlie, Rebboah, Elisabeth, Sakr, Jasmine, Rezaie, Narges, Liang, Heidi Y, Filimban, Ghassan, Kawauchi, Shimako, Oakes, Conrad, Trout, Diane, Williams, Brian A, MacGregor, Grant, Wold, Barbara J, Mortazavi, Ali, Pachter, Lior
Format	Journal Article
Language	English
Published	United States 09.09.2024
Online Access	Get full text

Cover

Loading…

More Information
Summary:	RNA abundance quantification has become routine and affordable thanks to high-throughput "short-read" technologies that provide accurate molecule counts at the gene level. Similarly accurate and affordable quantification of definitive full-length, transcript isoforms has remained a stubborn challenge, despite its obvious biological significance across a wide range of problems. "Long-read" sequencing platforms now produce data-types that can, in principle, drive routine definitive isoform quantification. However some particulars of contemporary long-read datatypes, together with isoform complexity and genetic variation, present bioinformatic challenges. We show here, using ONT data, that fast and accurate quantification of long-read data is possible and that it is improved by exome capture. To perform quantifications we developed lr-kallisto, which adapts the kallisto bulk and single-cell RNA-seq quantification methods for long-read technologies.
Bibliography:	ObjectType-Working Paper/Pre-Print-3 ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	2692-8205 2692-8205
DOI:	10.1101/2024.07.19.604364