\section{Discussion and Conclusions}\label{sec:discussion}

This study introduces DualU-Net, a streamlined architecture for cell classification and segmentation in histopathology, developed to handle multiple staining protocols, including H\&E and Ki-67. Our goal is to demonstrate that \textit{Two Heads are Enough}, challenging the necessity of HoVer-Net's three-decoder paradigm, yet still tackling the same overall task of cell nuclei classification and instance segmentation. Although state-of-the-art models have widely adopted a three-decoder setup, DualU-Net consolidates its functionality by carefully weighting the background class in the loss function and adopting Gaussian-based density maps for centroid estimation. This makes the NP branch redundant and  provides a faster, more intuitive alternative to HoVer-Net’s HV representation. 

Our results show that DualU-Net achieves comparable classification and detection performance to state-of-the-art models across multiple stains, while reducing architectural complexity, improving computational efficiency, and increasing robustness to color variations. Although slightly lower segmentation scores have been observed, they can be attributed to two factors (more details in Appendix~\ref{ap:GT-examples}):
\begin{enumerate}[label=(\roman*)]
    \item Watershed-based segmentation, where our centroid-based approach, unlike boundary-focused methods, occasionally leads to non-smooth or irregular contours due to the inherent nature of the watershed algorithm (see Fig.~\ref{fig:errors}, bottom).
    \item Ground truth inconsistencies (see Fig.~\ref{fig:errors}, top), notably oversegmentation in CoNSeP and missing cell annotations in PanNuke, directly affect the learning process of our center detection head by introducing errors in the Gaussian map generation (the foundation of our watershed algorithm). Consequently, these issues have a stronger impact on our segmentation metrics than approaches not driven by centroid-based segmentation.
\end{enumerate}
 
Moreover, since segmentation is primarily a visualization tool, the qualitative results shown in Fig.~\ref{fig:qualitative_comparison} confirm for this aim an equivalent performance to state-of-the-art methods. Finally, our results indicate that ConvNeXt does not provide significant improvements over ResNeXt, reinforcing the efficiency of the original backbone.

DualU-Net significantly reduces inference time compared to HoVer-Net, making it more practical for real-world deployment. On CoNSeP, we process images \texttimes2.5 faster, and on PanNuke, we achieve a \texttimes5.1 speed-up. Additionally, DualU-Net is more computationally efficient than CellViT and NuLite. Despite having more parameters, it surpasses NuLite-S in efficiency. These improvements highlight the effectiveness of our approach in reducing computational complexity without sacrificing segmentation and classification accuracy.

Stain variations present a well-known challenge in histopathology, as differences in staining protocols and scanning devices can significantly impact model performance. Our controlled color perturbation experiments on CoNSeP confirm that DualU-Net exhibits lower variance in classification, detection and segmentation scores compared to HoVer-Net.

In conclusion, DualU-Net eliminates the need for a third decoder head, achieving classification and detection performance comparable to state-of-the-art models, along with competitive segmentation, while enhancing inference efficiency and robustness to color variations. These advantages make it well-suited for clinical deployment, where speed and efficiency are crucial. Furthermore, DualU-Net has been successfully integrated into the DigiPatICS project~\cite{digipatics} and deployed in eight hospitals within the Institut Català de la Salut de Catalunya, highlighting its real-world impact.  Future work will focus on exploring lighter models, such as ConvNeXt-Tiny~\cite{Liu_2022_ConvNext}, to further enhance computational efficiency.
