% Introduce the background and difficulty of the segmentation task and your ideas.
% Introduction should include at least three paragraphs
% General background
% Main difficulties of the FLARE challenge and state-of-the-art methods
% Your idea and contributions
Subclinical examination plays an important role in all medical treatment processes. With the help of deep learning algorithms, human abdominal organs can be identified automatically with effectiveness and efficiency; thus enabling doctors for faster diagnoses. For deep learning agents to achieve high performance, it often comes with a vast amount of high-quality labeled data for the training stage. However, obtaining a sufficient amount of medical data is quite expensive and time-consuming, not to mention the need for medical labels to be evaluated by experts to ensure accuracy for usability. Because of the lack of useful data and scarce medical experts, it makes the problem becomes more challenging to tackle for today's machines.

Since last year, the FLARE22 challenge has introduced a problem in a specific scenario where a shortage of labeled medical data occurs. The included dataset contains only 50 labeled CT volumes whereas 2000 unlabeled others are given. With the provision of an enormous quantity of non-annotated data, participants are required to utilize them to boost the accuracy of their methods for the segmentation task, as well as optimize their solution for practical applicability.

Past solutions mostly approached the problem by inheriting 3D techniques, which usually demand great computing resources. In fact, the original CT volumes must be resampled to a smaller size to fit these 3D-based approaches, then the prediction of these models must also be post-processed back to its preceding sizes, which can damage the precision of the prediction. In terms of that, other teams proposed 2D-based solutions which can leverage the ability to split the CT volumes into batches of slices for efficient processing. But in reality, these techniques face serious performance issues due to the incapability of capturing the temporal information of CT slices. Therefore, to overcome these drawbacks,  we propose a novel pipeline, which works completely with only 2D image slices, that can comprehend information from all three planes of a volume.

Furthermore, to make use of a huge number of unlabeled data, two of the most common semi-supervised learning methods are consistency regularization and pseudo-labeling. Consistency-based methods train the model to produce the same pseudo-label for two different views (strong and weak augmentations) of an unlabeled sample, while pseudo-labeling converts model predictions on unlabeled samples into soft or hard labels as optimization targets. However, both of the methods suffer from the noise caused by the model trained on different data distribution (between labeled and unlabeled data). To address the above challenges, we propose a simple technique via modeling uncertainty that can be applied to filter out only potentially good pseudo labels for retraining.

Overall, our main contributions are as follows:
\begin{itemize}
    \item We propose a 2D-based segmentation pipeline that can fully exploit information of all three dimensions of a CT volume by integrating temporal positional encoding and mask propagation . 
    \item Simple enough, we come up with an uncertainty estimation technique to selectively choose which pseudo labels are useful for next cycle of training.
\end{itemize}