Scaling Channel-Adaptive Self-Supervised Learning

Alice V. De Lorenci; Seung Eun Yi; Théo Moutakanni; Piotr Bojanowski; Camille Couprie; Juan C. Caicedo; Wolfgang Maximilian Anton Pernice

Scaling Channel-Adaptive Self-Supervised Learning

Alice V. De Lorenci, Seung Eun Yi, Théo Moutakanni, Piotr Bojanowski, Camille Couprie, Juan C. Caicedo, Wolfgang Maximilian Anton Pernice

Published: 22 Jun 2025, Last Modified: 24 Jun 2025Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: Recent advances in self-supervised pre-training of foundation models for natural images have made them a popular choice for various visual systems and applications. Self-supervised strategies are also promising in non-RGB scientific imaging domains such as in biology, medical and satellite imagery, but their broader application is hampered by heterogeneity in channel composition and semantics between relevant datasets: two datasets may contain different numbers of channels, and these may reveal distinct aspects of an object or scene. Recent works on channel adaptive strategies report substantial advantages for those that account for variable channel compositions without sacrificing the ability to jointly encode channels; yet, how these strategies behave at scale remains unclear. We here show that, surprisingly, trained across large-scale datasets, independent-encoding of channels outperforms joint-encoding methods by a substantial margin. We validate this result along an extensive set of experiments on various datasets from cell microscopy to geospatial imagery. Our DINO BoC approach sets a new state-of-the-art across challenging benchmarks, including generalization to out-of-distribution tasks and unseen channel combinations at test time. We will open source the code, along with model weights that constitute a new general purpose feature extractor for fluorescent microscopy.

Submission Length: Regular submission (no more than 12 pages of main content)

Previous TMLR Submission Url: https://openreview.net/forum?id=Ftxr3Wt6DP

Changes Since Last Submission: This is the camera ready version with two changes: 1/ addition of a summary of results figure, as promised in the rebuttal. 2/ now included a link to the code in the conclusion. For now the link does not work, it will work in July at the latest.

Code: https://github.com/facebookresearch/dinov2/blob/main/docs/README_CHANNEL_ADAPTIVE_DINO.md

Assigned Action Editor: ~Gustavo_Carneiro1

Submission Number: 4288

Loading