Scalable Amortized GPLVMs for Single Cell Transcriptomics Data

Published: 04 Mar 2024, Last Modified: 07 May 2024MLGenX 2024 PosterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Probabilistic dimensionality reduction, single-cell RNA sequencing, scVI, Gaussian Process Latent Variable Models (GPLVMs)
TL;DR: This paper introduces a Bayesian GPLVM for single-cell RNA-seq that matches top models in visualization and effectively manages cell-cycle and batch effects for improved latent structure interpretation.
Abstract: Dimensionality reduction is crucial for analyzing large-scale single-cell RNA-seq data. Gaussian Process Latent Variable Models (GPLVMs) offer an interpretable dimensionality reduction method, but current scalable models lack effectiveness in clustering cell types. We introduce an improved model, the amortized stochastic variational Bayesian GPLVM (BGPLVM), tailored for single-cell RNA-seq with specialized encoder, kernel, and likelihood designs. This model matches the performance of the leading single-cell variational inference (scVI) approach on synthetic and real-world COVID datasets and effectively incorporates cell-cycle and batch information to reveal more interpretable latent structures as we demonstrate on an innate immunity dataset.
Submission Number: 45
Loading