\section{Datasets}\label{ap:datasets}

\paragraph{PanNuke} 
The PanNuke dataset \cite{gamper2020pannuke} is a large-scale collection of H\&E stained histopathology images derived from 19 tissue types. It comprises 7,904 patches, each sized $256\times256$ pixels, extracted from WSIs from The Cancer Genome Atlas (TCGA) at a magnification of $40\times$. Within this dataset, there are 189,744 labeled nuclei classified into five classes: neoplastic, inflammatory, connective, necrosis, and epithelial.

\paragraph{CoNSeP}
The CoNSeP dataset \cite{graham2019hover} focuses on H\&E colorectal adenocarcinoma samples. It comprises 41 patches, each 1000x1000 pixels in size, extracted from WSI at a magnification of $40\times$. The dataset encompasses various regions such as stromal, glandular, muscular, collagen, adipose, and tumorous areas and its nuclei are grouped into five classes: inflammatory, epithelial, spindle-shaped and miscellaneous.

\paragraph{Ki-67} Additionally, we employ a custom Ki-67 dataset \cite{anglada2024dualunet}, developed within the DigiPatICS project \cite{digipatics}, comprising 52 annotated tiles (each of size $1024\times1024$ pixels) extracted from Ki-67-stained WSIs at a magnification of $40\times$. Sourced from four patients exhibiting different proliferation levels, each tile is accompanied by cell-level annotations that include segmentation masks and cell classes (positive, negative, or non-epithelial). This dataset is not publicly available, and the weights of the models trained on it will not be released.
