Experimental and Computational Methods for Allelic Imbalance Analysis from Single-Nucleus RNA-seq Data

Single-cell RNA-seq (scRNA-seq) is emerging as a powerful tool for understanding gene function across diverse cells. Recently, this has included the use of allele-specific expression (ASE) analysis to better understand how variation in the human genome affects RNA expression at the single-cell level...

Full description

Saved in:
Bibliographic Details
Published inbioRxiv
Main Authors Simmons, Sean K, Adiconis, Xian, Haywood, Nathan, Parker, Jacob, Lin, Zechuan, Liao, Zhixiang, Tuncali, Idil, Al'Khafaji, Aziz, Shin, Asa, Jagadeesh, Karthik, Gosik, Kirk, Gatzen, Michael, Smith, Jonathan T, El Kodsi, Daniel N, Kuras, Yuliya, Baecher-Allan, Clare, Serrano, Geidy E, Beach, Thomas G, Garimella, Kiran, Rozenblatt-Rosen, Orit, Regev, Aviv, Dong, Xianjun, Scherzer, Clemens, Levin, Joshua Z
Format Journal Article
LanguageEnglish
Published United States 16.08.2024
Online AccessGet full text

Cover

Loading…
More Information
Summary:Single-cell RNA-seq (scRNA-seq) is emerging as a powerful tool for understanding gene function across diverse cells. Recently, this has included the use of allele-specific expression (ASE) analysis to better understand how variation in the human genome affects RNA expression at the single-cell level. We reasoned that because intronic reads are more prevalent in single-nucleus RNA-Seq (snRNA-Seq), and introns are under lower purifying selection and thus enriched for genetic variants, that snRNA-seq should facilitate single-cell analysis of ASE. Here we demonstrate how experimental and computational choices can improve the results of allelic imbalance analysis. We explore how experimental choices, such as RNA source, read length, sequencing depth, genotyping, etc., impact the power of ASE-based methods. We developed a new suite of computational tools to process and analyze scRNA-seq and snRNA-seq for ASE. As hypothesized, we extracted more ASE information from reads in intronic regions than those in exonic regions and show how read length can be set to increase power. Additionally, hybrid selection improved our power to detect allelic imbalance in genes of interest. We also explored methods to recover allele-specific isoform expression levels from both long- and short-read snRNA-seq. To further investigate ASE in the context of human disease, we applied our methods to a Parkinson's disease cohort of 94 individuals and show that ASE analysis had more power than eQTL analysis to identify significant SNP/gene pairs in our direct comparison of the two methods. Overall, we provide an end-to-end experimental and computational approach for future studies.
Bibliography:ObjectType-Working Paper/Pre-Print-3
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:2692-8205
2692-8205
DOI:10.1101/2024.08.13.607784