\documentclass[usenames,dvipsnames]{midl} % usenames and dvipsnames arguments needed for the xcolor package
%\documentclass[anon]{midl} % Anonymized submission

% The following packages will be automatically loaded:
% jmlr, amsmath, amssymb, natbib, graphicx, url, algorithm2e
% ifoddpage, relsize and probably more
% make sure they are installed with your latex distribution

\usepackage{mwe}
% \jmlrvolume{-- Under Review}
\jmlryear{2020}
\jmlrworkshop{Full Paper -- MIDL 2020}
% \editors{Under Review for MIDL 2020}

\usepackage{float}
\usepackage{graphicx}
\usepackage{caption}
% \usepackage{subcaption}
\usepackage{xcolor}
\usepackage{tikz}
\usepackage[normalem]{ulem}

\def\plussmall{\raisebox{0.5ex}{\tiny\textbf{+}}}

\title[Siamese Tracking of Cell Behaviour Patterns]{Siamese Tracking of Cell Behaviour Patterns}

\midlauthor{\Name{Andreas Panteli\midljointauthortext{Contributed equally}} \Email{andreas.panteli@student.uva.nl} \\
	\Name{Deepak K. Gupta\midlotherjointauthor} \Email{d.k.gupta@uva.nl} \\
	\Name{Nathan de Bruijn} \Email{nathanldebruijn@gmail.com} \\
	\Name{Efstratios Gavves} \Email{e.gavves@uva.nl} \\ 
	\addr Informatics Institute, University of Amsterdam, The Netherlands 
}

\begin{document}

\maketitle

\begin{abstract}
Tracking and segmentation of biological cells in video sequences is a challenging problem, especially due to the similarity of the cells and high levels of inherent noise. Most machine learning-based approaches lack robustness and suffer from sensitivity to less prominent events such as mitosis, apoptosis and cell collisions. Due to the large variance in medical image characteristics, most approaches are dataset-specific and do not generalise well on other datasets. 

In this paper, we propose a simple end-to-end cascade neural architecture that can effectively model the movement behaviour of biological cells and predict collision and mitosis events. Our approach uses U-Net for an initial segmentation which is then improved through processing by a siamese tracker capable of matching each cell along the temporal axis. By facilitating the re-segmentation of collided and mitotic cells, our method demonstrates its capability to handle volatile trajectories and unpredictable cell locations while being invariant to cell morphology. We demonstrate that our tracking approach achieves state-of-the-art results on  PhC-C2DL-PSC and Fluo-N2DH-SIM\plussmall~datasets and ranks second on the DIC-C2DH-HeLa dataset of the cell tracking challenge benchmarks. 
\end{abstract}

\begin{keywords}
Cell tracking, re-segmentation, mitosis, cell collision, Siamese tracker
\end{keywords}

\section{Introduction}
\input{introduction}


\section{Methodology}
\label{sec:methodology}
\input{method}

\section{Experiments} 
\subsection{Data}
\input{data} 

\subsection{Results}

The performance of our methods is evaluated using the same measures as described in \citet{cellTrackingChallenge}. For segmentation, the average of detection (DET) and segmentation (SEG) metrics, expressed as OP$_{CSB}= \frac{1}{2}$(DET+SEG), is used. Further, for tracking, the average of SEG and tracking (TRA) measures, expressed as  OP$_{CTB} = \frac{1}{2}$(SEG+TRA), is used.

Table \ref{tab:results3} provides the performance scores for the three variants of TAS. As expected, the performance of s-TAS is the best among the three proposed variants, while that of g-TAS is the lowest. Comparing the published works of \citet{zhou2019joint} and \citet{lux2019dic} with the TAS variants, on the {Fluo-N2DH-SIM}\plussmall~dataset, shows that our g-TAS approach outperforms one of the baselines set by \citet{zhou2019joint}. The intermediate approach, i-TAS, further improves the performance, primarily due to the more precise boundary refining of ellipsoid shapes. For the PhC-C2DL-PSC and DIC-C2DH-HeLa datasets, g-TAS exhibits a significant drop in performance. This is due to overfitting mainly because of lower number of training samples compared to the Fluo-N2DH-SIM set; when all data is used to train one model. In the i-TAS and s-TAS variants however, the U-Net is trained on each dataset separately which improves performance significantly and the s-TAS variant outperforms both baselines.

We further compare the performance of our approach with the top 3 submissions, state-of-art methods, on the leaderboard of ISBI cell tracking challenge\textsuperscript{\ref{note1}}. Approaches in this challenge are precisely tuned to the type of cell morphology and image properties in order to achieve competitive performance. Table \ref{tab:results2} presents the results of our method compared to the other approaches. The detailed results, including all metrics, are shown in Table \ref{tab:results21} in Appendix \ref{sec:appendixA}. As can be seen, our approach outperforms all other methods for datasets PhC-C2DL-PSC and Fluo-N2DH-SIM\plussmall, and ranks second for the DIC-C2DH-HeLa. 

An interesting observation is that for the tracking metric (TRA) for the DIC-C2DH-HeLa dataset, \ref{tab:results21}, our approach outperforms the rest with a value of $0.955$. The cells in DIC-C2DH-HeLa lack easily discernible boundaries compared to other datasets and also lack intense movement activities. This is primarily the reason that this dataset does not benefit from our modelling of cell behaviour. The minor shift of the TRA metric can be explained by the explicit collision detection method which the method of \citet{lux2019dic} lacks. In datasets Fluo-N2DH-SIM\plussmall~and PhC-C2DL-PSC, highly active cells are better detected by our tracking method. This is clearly evident in PhC-C2DL-PSC, where up to 1000 cells are present in each frame, compared to a maximum of around 70 in the Fluo-N2DH-SIM\plussmall. Collisions and mitotic events are more evident with increased spatial displacement which constitutes our method an ideal tracker and re-segmentation. Due to this reason, our method outperforms the previous state-of-art-approaches by a larger margin on this dataset.


% Modified for R4 table 1
\begin{table}
    \centering
    \begin{tabular}{l | c  c | c c | c c |}
      & \multicolumn{2}{|c|}{DIC-C2DH-HeLa} & \multicolumn{2}{|c|}{{Fluo-N2DH-SIM}\plussmall} & \multicolumn{2}{|c|}{PhC-C2DL-PSC}\\\hline
      Method  & OP$_{CSB}$& OP$_{CTB}$ & OP$_{CSB}$& OP$_{CTB}$  & OP$_{CSB}$& OP$_{CTB}$ \\\hline
    \citet{zhou2019joint} & - & - & 0.861 & 0.860 & 0.806 & 0.801 \\
    \citet{lux2019dic} &  0.894 & - & - & - & - & - \\
    g-TAS & 0.077 & 0.076 & {0.872}  & {0.870} & 0.315 & 0.308 \\
    i-TAS & 0.880 & 0.874 & {0.895}  & {0.893} & 0.717 & 0.704 \\
    s-TAS & \textbf{0.905} & \textbf{0.904} & \textbf{0.897} & \textbf{0.896}  & \textbf{0.846} & \textbf{0.843} \\
    \end{tabular}
    \caption{OP$_{CSB}$ and OP$_{CTB}$ results of the three different initial segmentation approaches, as described in section \ref{sec:methodology} for the published state-of-the-art approaches.}
    \label{tab:results3}
\end{table}

\begin{table}
    \centering
    \begin{tabular}{p{4cm} | c  c | c  c | c  c |}
      & \multicolumn{2}{|c|}{{DIC-C2DH-HeLa}} & \multicolumn{2}{|c|}{{Fluo-N2DH-SIM}\plussmall} & \multicolumn{2}{|c|}{{PhC-C2DL-PSC}}\\\hline
      Method & OP$_{CSB}$& OP$_{CTB}$ & OP$_{CSB}$& OP$_{CTB}$ & OP$_{CSB}$& OP$_{CTB}$\\\hline
    %  \citet{zhou2019joint} & - & - & 0.861 & 0.860 & 0.806 & 0.801\\
    %  \citet{lux2019dic} & 0.894 & - & - & - & - & - \\\hline
     ISBI CTC\textsuperscript{\ref{note1}} 3rd entry & \textcolor{Apricot}{0.884} & \textcolor{Apricot}{0.848} & \textcolor{Apricot}{0.887} & \textcolor{Apricot}{0.882} & \textcolor{Violet}{0.808} & \textcolor{BrickRed}{0.804}\\
     ISBI CTC\textsuperscript{\ref{note1}} 2nd entry & \textcolor{Cyan}{0.895} & \textcolor{Cyan}{0.894} & \textcolor{GreenYellow}{0.890} & \textcolor{GreenYellow}{0.889} & \textcolor{BrickRed}{0.809} & \textcolor{Violet}{0.804}\\
     ISBI CTC\textsuperscript{\ref{note1}} 1st entry & \textcolor{Goldenrod}{\textbf{0.912}} & \textcolor{Goldenrod}{\textbf{0.909}} & \textcolor{Goldenrod}{0.896} & \textcolor{Goldenrod}{0.895} & \textcolor{Goldenrod}{0.841} & \textcolor{Goldenrod}{0.836} \\\hline
    s-TAS & 0.905 & 0.904 & \textbf{0.897} & \textbf{0.896}  & \textbf{0.846} & \textbf{0.843} \\
    \end{tabular}
    \caption{OP$_{CSB}$ and OP$_{CTB}$ scores, as of 30th of January for the entries to the ISBI Cell Tracking Competition (CTC)\textsuperscript{\ref{note1}} \cite{cellTrackingChallenge}. \textcolor{Goldenrod}{MU-Lux-CZ} the same team as in the work of \citet{lux2019dic}, \textcolor{Cyan}{ND-US}, \textcolor{Apricot}{BGU-IL} the same team as in the work of \citet{zhou2019joint}, \textcolor{BrickRed}{CVUT-CZ}, \textcolor{Violet}{HD-Hau-GE}, \textcolor{GreenYellow}{UVA-NL} (our earlier submission of this work).
    }
    \label{tab:results2}
\end{table}

\begin{figure}[t]
\centering
\begin{tikzpicture}
\node[inner sep=0pt] (russell) at (0,0){\includegraphics[width=0.55\linewidth]{images/collision_6.png}};
\node at (0,1.5) {\scalebox{0.9}[1.0]{\footnotesize \textbf{Siamese Matching}}};
\node at (2.8,-1.2) {\footnotesize \textbf{Re-segmentation}};
\node at (-4.7, 2.5) {\textbf{t-2}};
\node at (-4.7, 0) {\textbf{t-1}};
\node at (-4.7, -2.7) {\textbf{t}};

\end{tikzpicture}
	\caption{Schematic representation of cell collision detection using a Siamese tracker, and re-segmentation of the detected cells using watershed approach. The two cells, far apart in $(t-2)^{\text{th}}$ frame, collide in the $t^{\text{th}}$ frame, and are wrongly segmented as a single cell. Through siamese tracking between $(t-1)^{\text{th}}$ and $t^{\text{th}}$ frames, the collision event is identified, and applying the watershed approach over the $t^{\text{th}}$ frame, using the cell centroids of frame $(t-1)^{\text{th}}$, helps to correct the segmentation.}
	\label{fig:collision}
\end{figure}

The code used to produce these results is available at \\ \href{https://gitlab.com/Baggsy/cell_tracking_2019}{\textit{https://gitlab.com/Baggsy/cell\_tracking\_2019}}

\section{Discussion}
\label{sec:Discussion}
% Modified for R2 marginal differences
Since our approach enhances the tracking capabilities of segmentation focused methods, it is important to identify which aspects of the tracker contribute most to the improvements reported earlier. Table \ref{tab:results4} presents an ablation study involving s-TAS being studied with respect to the collision and mitosis detection modules. As can be seen, both collision and mitosis modelling steps are an integral part of the approach, since there exists a clear dependence on them. Removing the re-segmentation correction completely results in lower performance values. 

However, it is important to note that the performance variation between different model variants are not very large, with a mean standard deviation value of $0.0169$. Upon further inspection of the 2nd and 3rd entries for all datasets in Table \ref{tab:results21}, all differences seem marginal, indicating saturation in the benchmark. We further explore this in our findings of noisy labels with these properties: (1) Delayed mitotic events (a single label for 2 distinct cell boundaries) (2) Pre-mitotic events (two labels for one bigger cell body before a mitosis) (3) Cell death artefacts (4) Irregular shape of ground truth labels (label not matching visual inspection of cell). To adjust for delayed and pre-mitotic events we experimented with the mitotic/collisional event timing by hastening and delaying the re-segmentation but noticed a decrease in performance. These noisy labels are mostly due to the staining procedure and hence it is something all participating algorithms face. 

For high cell activity, such as in the PhC-C2DL-PSC dataset, we observe a bigger increase in performance due to our ability to correct the initial segmentation in difficult-to-observe cell collisions. Last, we emphasise that quantitatively speaking, we consider as much more important the fact that we produce state-of-the-art results on several different datasets and without any dataset-specific algorithms (at most, we only tune hyperparameters on the respective training sets for the initial segmentation alone); rather than the final numbers themselves.

% Modified R4 power of algorithm
We notice that we improve particularly well when boundaries are not clearly distinguishable. From Figure \ref{fig:mitosis_collission_example}, the mitotic event in the DIC-C2DH-HeLa dataset has a very visible effect. Hence a simple segmentation can detect the mitotic event. In contrast, the collision example in Figure \ref{fig:mitosis_collission_example} for the Fluo-N2DH-SIM+ dataset, does not always result in clearly defined boundaries. As Figure \ref{fig:collision} indicates, when single segmentation cannot split collided cells, our re-segmentation checks the next and previous frames for meshed bodies and is able to split them. The real power of the method, and where we get the biggest numerical improvements, is when there exist high cell activity in frames, such as collisions, for smaller cells such as is the case for the PhC-C2DL-PSC dataset due to the convoluted appearance of the cell bodies. We want to emphasise here that these advantages are inherent to the approach, and the core algorithm does not need to change (at most only hyperparameters tuned depending on the TAS variant).

% Additionally, we note that \textit{g-TAS} as specified in section \ref{sec:methodology}, does not use any manual set parameters tailored to specific cell types but only trains the U-Net architecture to detect the cells. Although a more general approach, it has shown to over-fit to the train set easily and with a large deviation from the results on the testing dataset. This is an important drawback for supervised approaches using very limited training images. This TAS variant still outperformed the method of \citet{zhou2019joint} using significantly less parameters indicating that despite the drop in performance, it has potential to perform in par with state-of-the-art methods.


\hspace{-2em}
\begin{figure}
\begin{minipage}{0.5\textwidth}
\footnotesize
    \centering
    \begin{tabular}{ c c | c c |}
     \multicolumn{2}{c|}{(Base) \textit{s-TAS}} &\multicolumn{2}{|c|}{{Fluo-N2DH-SIM}\plussmall}\\\hline
     Collision detection & Mitosis detection  & OP$_{CSB}$& OP$_{CTB}$\\\hline
    + & + & \textbf{0.902}  & \textbf{0.901} \\
     - & + & 0.899 & 0.899\\
     + & - & 0.875 & 0.874\\
     - & - & 0.859 & 0.854 \\
    \end{tabular}

    % \label{tab:results4}
\end{minipage}\hspace{3em}
\begin{minipage}{0.45\textwidth}
% \vspace{1em}
\captionsetup{width=0.9\linewidth}
    \captionof{table}{OP$_{CSB}$ and OP$_{CTB}$ ablation study results on the \textbf{training} set using the \textit{s-TAS} variant as a base method for applying modification on.}
    \label{tab:results4}
\end{minipage}
\end{figure}
% \vspace{-1em}

\section{Conclusions}
Medical images of biological cells, contain several noisy artefacts, convoluted cell boundaries and unpredictable cell movements, which often confuse cell segmentation methods. We propose a siamese tracking assisted re-segmentation approach which specifically models biological cell activities (mitosis, apoptosis and cell collisions) and enhances the overall cell segmentation. Our results indicate that biologically inspired tracking models of micrometer-scaled cells can better apprehend erratic behaviour. We demonstrate the applicability of our method on three cell tracking datasets. The specialised variant outperforms the previous state-of-the-art models, and ranks first on two and second on one of the three benchmark datasets. 

% Acknowledgments---Will not appear in anonymized version
\midlacknowledgments{We thank Martin Maška for his constant help, time and invaluable assistance.}
\bibliography{panteli20}
\pagebreak
\appendix
\input{appendix}

\end{document}
