Advancements in the field of machine learning (ML) have allowed for the automation of medical data analysis, with several advanced methods performing as good as humans on a range of tasks (\emph{e.g.}, \citealt{ dalca2018unsupervised, chen2018efficient}). 
Example problems where deep learning (DL) methods have produce significant impact include detection and classification of tumours \cite{coudray2018classification, kamnitsas2017ensembles, budinska2013gene, kalaiselvi2014new}, identification of new biomarkers in high-dimensional data \cite{budinska2013gene, ravi2015detection, rathi2011sparse}, among others. 

\begin{figure}[!th]
    \centering
    \includegraphics[height=2.55cm]{images/real_img_Fluo-N2DL-HeLa.png}\hfill
    \includegraphics[height=2.55cm]{images/real_img_Fluo-N2DH-SIM+.png}\hfill
    \includegraphics[height=2.55cm]{images/real_img_Fluo-N2DH-GOWT1.png}\hfill
    \includegraphics[height=2.55cm]{images/real_img_PhC-C2DL-PSC.png}\hfill
    \includegraphics[height=2.55cm]{images/real_img_DIC-C2DH-HeLa.png}
    \caption{Example images of (a) Fluo-N2DL-HeLa (b) Fluo-N2DH-SIM\plussmall~(c) Fluo-N2DH-GOWT1 (d) PhC-C2DL-PSC (e) DIC-C2DH-HeLa datasets from the cell tracking challenge \cite{cellTrackingChallenge}.}
    \label{fig:real_imgs}
\end{figure}

The problem of medical data analysis becomes more challenging at the micrometer scale (\emph{e.g.} pancreatic stem cells or cell nuclei), where it is difficult to visualise and process the data \cite{coudray2018classification, saltz2018spatial}. Cell distinctive shapes and morphological traits are convoluted with low resolution artefacts, noise components from the microscope scanning device and varying lighting conditions \cite{swiderska2019learning}. Figure \ref{fig:real_imgs} shows example images of cells for five different datasets from the IEEE International Symposium on Biomedical Imaging (ISBI) cell tracking challenge \cite{cellTrackingChallenge}. As can be seen, there are various challenges such as low signal-to-noise ratio, poor illumination, clutter of cells and occlusion that make it difficult to accurately track and segment the cells with a naked eye.

In addition to the challenges outlined in Figure \ref{fig:real_imgs}, biological cells also fail to conform to a predefined shape and there can be several shifting movement patterns that are are hard to analyse or detect \cite{saltz2018spatial}. 
Examples of such behaviour are shown in \mbox{Figure \ref{fig:mitosis_collission_example}} where a parent cell gets split into two daughter cells (Mitosis) or two cells collide and appear as a single cell (Collision). %Also, there can be cells outside the visual area which appear in the later frames, or cells within the frame that eventually disappear, referred to as cell death.

% Recently, several machine learning (ML) approaches have been proposed for the segmentation and tracking of cells. 
Some cell segmentation and tracking approaches include constructing temporal trajectories for cells \cite{yang2005cell}, spatial correlation using Delaunay graphs \cite{nath2006cell}, and watershed deconvolution with morphological operators \cite{sharif2012red}. Due to the large noise components of medical images, ML methods mediating dataset-specific properties such as uneven illumination and lack of pixel-value normalisation, tend to define many morphological dependent conditions for every cell type \cite{sharif2012red, lux2019dic}. However, this process is sensitive to outliers and along with volatile positional changes, it makes tracking each individual cell over time more difficult. This is because a cell in the previous frame might change completely in the next.


The U-Net neural network (NN) introduced by \citet{unet} is among the most successful NN architectures in biomedical image segmentation for tasks such as tumour detection and living cell segmentation \cite{heller2019state, falk2019u}. Due to its U-shape and essential residual connections, U-Net has achieved state-of-the-art results in many challenges \cite{li2018h, dubost2017gp}. However, despite its high performance, even U-Net struggles on images containing significant movements of cells and changes in their morphology, and in particular, fails to reliably detect cells that split or die (leave the field of view of the scanning device) \cite{christ2016automatic}. \citet{lux2019dic} combined U-Net with watershed deconvolution  \cite{kachouie2008watershed} and demonstrated improved performance on cell tracking datasets\footnote{\label{note1}\href{http://celltrackingchallenge.net/}{\textit{http://celltrackingchallenge.net/}}}. However, this approach involves tuning of several parameters, \emph{i.e.}, several data-specific intermediate processing steps relating to cell size, erosion, staining, and other morphological traits so as to acquire reasonable cell shapes. \citet{zhou2019joint} proposed using two U-Net architectures, one for segmenting cells and the other for detecting their centroids. Due to learning data specific network weights for cell detection, this approach is more resilient to outliers, however, it does not suggest a robust way to detect smaller cells that are falsely segmented as a single cell.

\citet{lux2019dic} used static area overlap for building the correspondence of cell trajectory in subsequent frames.  Another approach uses level sets to follow the evolution of cells \cite{cellTrackingChallenge}. However, due to the fluidic and erratic nature of cells, neither of these algorithms are able to model their real movement patterns. Siamese networks, first introduced in the work of \citet{bromley1994signature} and adapted for Siamese Instance Search Tracking (SINT) by \citet{tao2016siamese}, have shown to excel in generic object tracking \cite{siamfc}. Due to their robustness in object matching under appearance variations, siamese methods have been useful in segmenting medical images \cite{spitzer2018improving}. 

% Modified for R4 novelty
In this paper, we introduce a re-segmentation approach, which relies on siamese matching-based trackers, combined with U-Net and the watershed deconvolution method, to track cells and model cell collisions, mitosis and apoptosis. Compared to the original siamese trackers, as in the work of \citet{tao2016siamese} and \citet{siamfc} proposed on natural image videos, we model cell behavioural patterns in order to correctly track cells. Preliminary results related to this research were recently reported by us in \cite{gupta2019tracking}. Compared to the previous cell tracking methods, such as in the work of \citet{magnusson2016segmentation} and \citet{lux2019dic}, we generalise tracking to an approach independent of cell-type, predicting cell displacement across frames.

Our proposed Tracking-Assisted Segmentation (TAS) can easily model and detect the unstable biological cell movement activities such as mitosis and cell collisions that lead to false predictions. Further, through the use of the watershed algorithm, false predictions can be re-segmented to improve the overall cell tracking performance. The contributions of this paper can be summarised as follows.
\begin{itemize}
    \setlength\itemsep{0em}
    \item We augment segmentation by siamese tracking for improved temporal correspondence and re-segmentation of erroneous predictions.
    \item Our approach is more robust to morphology variations and explicitly models rare events such as mitosis, apoptosis and cell collisions. 
    % Modified for R1 outperforming in 3 datasets
    \item Our approach generalises well to different datasets outperforming published state-of-the-art segmentation methods for biological cells on three benchmark datasets \footnote{For the DIC-C2DH-HeLa dataset, we achieved second place at the online ISBI cell tracking competition but the published approach of the team at first place, \cite{lux2019dic}, is still outperformed by our method.}.
\end{itemize}
% Modified for R1 clinical application
Our method can improve the autonomous segmentation of biological cells with the goal of inspecting multiple patient images in parallel, speed up diagnosis and ease the workload of doctors. Our approach indicates a more robust method of cell segmentation which will reduce the number of cells incorrectly detected and improve the accuracy and performance of such automatic systems.
