Benchmarking Dimensionality Reduction Techniques for Spatial Transcriptomics

Published: 12 Oct 2025, Last Modified: 06 May 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Abstract: We introduce a unified framework for evaluating dimensionality reduction techniques in spatial transcriptomics beyond standard PCA approaches. We benchmark six methods—PCA, NMF, autoencoder, VAE, and two hybrid embeddings—on a cholangiocarcinoma Xenium dataset, systematically varying latent dimensions (k=5-40) and clustering resolutions (ρ=0.1-1.2). Each configuration is evaluated using complementary metrics including reconstruction error, explained variance, cluster cohesion, and two novel biologically-motivated measures: Cluster Marker Coherence (CMC) and Marker Exclusion Rate (MER). Our results demonstrate distinct performance profiles: PCA provides a fast baseline, NMF maximizes marker enrichment, VAE balances reconstruction and interpretability, while autoencoders occupy a middle ground. We demonstrate how MER-guided reassignment improves biological fidelity across all methods, with CMC scores improving by up to 12% on average. This framework enables principled selection of dimensionality reduction methods tailored to specific transcriptomics analyses.
Loading