Neglecting the impact of normalization in semi-synthetic RNA-seq data simulations generates artificial false positives

A recent study reported exaggerated false positives by popular differential expression methods when analyzing large population samples. We reproduce the differential expression analysis simulation results and identify a caveat in the data generation process. Data not truly generated under the null h...

Full description

Saved in:
Bibliographic Details
Published inGenome Biology Vol. 25; no. 1; pp. 281 - 5
Main Authors Hejblum, Boris P, Ba, Kalidou, Thiébaut, Rodolphe, Agniel, Denis
Format Journal Article
LanguageEnglish
Published England BioMed Central 30.10.2024
BMC
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A recent study reported exaggerated false positives by popular differential expression methods when analyzing large population samples. We reproduce the differential expression analysis simulation results and identify a caveat in the data generation process. Data not truly generated under the null hypothesis led to incorrect comparisons of benchmark methods. We provide corrected simulation results that demonstrate the good performance of dearseq and argue against the superiority of the Wilcoxon rank-sum test as suggested in the previous study.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
ObjectType-Commentary-3
content type line 23
ISSN:1474-760X
1474-7596
1474-760X
DOI:10.1186/s13059-024-03231-9