September 2024 A flexible model for correlated count data, with application to multicondition differential expression analyses of single-cell RNA sequencing data
Yusha Liu, Peter Carbonetto, Michihiro Takahama, Adam Gruenbaum, Dongyue Xie, Nicolas Chevrier, Matthew Stephens
Author Affiliations +
Ann. Appl. Stat. 18(3): 2551-2575 (September 2024). DOI: 10.1214/24-AOAS1894

Abstract

Detecting differences in gene expression is an important part of single-cell RNA sequencing experiments, and many statistical methods have been developed for this aim. Most differential expression analyses focus on comparing expression between two groups (e.g., treatment vs. control). But there is increasing interest in multicondition differential expression analyses in which expression is measured in many conditions and the aim is to accurately detect and estimate expression differences in all conditions. We show that directly modeling single-cell RNA-seq counts in all conditions simultaneously, while also inferring how expression differences are shared across conditions, leads to greatly improved performance for detecting and estimating expression differences compared to existing methods. We illustrate the potential of this new approach by analyzing data from a single-cell experiment studying the effects of cytokine stimulation on gene expression. We call our new method “Poisson multivariate adaptive shrinkage,” and it is implemented in an R package available at https://github.com/stephenslab/poisson.mash.alpha.

Funding Statement

This work was supported by the NHGRI and NIAID at the National Institutes of Health under award numbers R01HG002585, DP2AI145100 and U01AI160418, and by a grant from the Leona M. and Harry B. Helmsley Charitable Trust as part of the Gut Cell Atlas initiative.

Acknowledgments

We thank Abhishek Sarkar, Yuxin Zou and Jason Willwerscheid for their invaluable advice. We also thank the staff at the Research Computing Center for providing the high-performance computing resources used to implement the numerical experiments.

Citation

Download Citation

Yusha Liu. Peter Carbonetto. Michihiro Takahama. Adam Gruenbaum. Dongyue Xie. Nicolas Chevrier. Matthew Stephens. "A flexible model for correlated count data, with application to multicondition differential expression analyses of single-cell RNA sequencing data." Ann. Appl. Stat. 18 (3) 2551 - 2575, September 2024. https://doi.org/10.1214/24-AOAS1894

Information

Received: 1 November 2022; Revised: 1 March 2024; Published: September 2024
First available in Project Euclid: 5 August 2024

Digital Object Identifier: 10.1214/24-AOAS1894

Keywords: differential expression analysis , Empirical Bayes , multivariate count data , Poisson , single-cell RNA sequencing , variational inference

Rights: Copyright © 2024 Institute of Mathematical Statistics

Vol.18 • No. 3 • September 2024
Back to Top