CHAMMI-75: pre-training multi-channel models with heterogeneous microscopy images

Vidit Agrawal; John Peters; Tyler N. Thompson; Mohammad Vali Sanian; Chau Pham; Nikita Moshkov; Arshad Kazi; Aditya Pillai; Jack Freeman; Byunguk Kang; Samouil L. Farhi; Ernest Fraenkel; Ron M. Stewart; Lassi Paavolainen; Bryan A. Plummer; Juan C. Caicedo

CHAMMI-75: pre-training multi-channel models with heterogeneous microscopy images

Vidit Agrawal, John Peters, Tyler N. Thompson, Mohammad Vali Sanian, Chau Pham, Nikita Moshkov, Arshad Kazi, Aditya Pillai, Jack Freeman, Byunguk Kang, Samouil L. Farhi, Ernest Fraenkel, Ron M. Stewart, Lassi Paavolainen, Bryan A. Plummer, Juan C. Caicedo

Published: 26 Jan 2026, Last Modified: 11 Feb 2026ICLR 2026 PosterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: microscopy, representation learning, multi-channel imaging, self-supervised learning, biology

TL;DR: Dataset for pre-training multi-channel imaging models in microscopy and benchmarks in cellular biology applications

Abstract: Quantifying cell morphology using images and machine learning has proven to be a powerful tool to study the response of cells to treatments. However, the models used to quantify cellular morphology are typically trained with a single microscopy imaging type and under controlled experimental conditions. This results in specialized models that cannot be reused across biological studies because the technical specifications do not match (e.g., different number of channels), or because the target experimental conditions are out of distribution. Here, we present CHAMMI-75, a dataset of heterogeneous, multi-channel microscopy images with 2.8M multi-channel images from 75 diverse biological studies. We curated this resource from publicly available sources to investigate cellular morphology models that are channel-adaptive and can process any microscopy image type. Our experiments show that training with CHAMMI-75 can improve performance in multi-channel bioimaging tasks, opening the way to create the next generation of cellular morphology models for biological studies.

Primary Area: datasets and benchmarks

Submission Number: 9717

Loading