WSI-GT: Pseudo-Label Guided Graph Transformer for Whole-Slide Histology

Zhongao Sun; Alexander Khvostikov; Andrey Krylov; Ilya Mikhailov; Pavel G. Malkov

WSI-GT: Pseudo-Label Guided Graph Transformer for Whole-Slide Histology

Zhongao Sun, Alexander Khvostikov, Andrey Krylov, Ilya Mikhailov, Pavel G. Malkov

19 Sept 2025 (modified: 24 Nov 2025)ICLR 2026 Conference Withdrawn SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Keywords: graph transformers, pseudo-labeling, digital pathology, whole-slide images, patch classification, over-smoothing mitigation

TL;DR: WSI-GT is a pseudo-label guided graph transformer for classifying patches in 100k×100k whole-slide histology images, combining graph convolutions and intra-class attention to achieve state-of-the-art accuracy

Abstract: Whole-slide histology images (WSIs) can exceed 100k × 100k pixels, making direct pixel-level segmentation infeasible and requiring patch-level classification as a practical alternative. However, most approaches either treat patches independently, ignoring spatial and biological context, or rely on deep graph models that oversmooth, leading to loss of critical tissue details. We present WSI-GT (Pseudo-Label Guided Graph Transformer), a simple yet effective architecture that addresses these challenges. WSI-GT combines a lightweight local graph convolution block for neighborhood feature aggregation with a pseudo-label guided attention mechanism that preserves intra-class variability and mitigates oversmoothing. To cope with sparse annotations, we introduce an area-weighted sampling strategy that balances class representation while maintaining tissue topology. WSI-GT achieves a Macro F1 of 0.95 on PATH-DT-MSU WSS2v2, improving by up to 3 percentage points over tile-based CNNs and by about 2 points over strong graph baselines. It further generalizes well to the Placenta benchmark and standard graph node classification datasets, highlighting both clinical relevance and broader applicability. These results position WSI-GT as a practical and scalable solution for graph-based learning on extremely large images.

Supplementary Material: zip

Primary Area: learning on graphs and other geometries & topologies

Submission Number: 17788

Loading