% The abstract should briefly summarize the contents of the paper in
% 150--250 words.

% Describe the main contribution (e.g., how to use the unlabeled images)
% Present the performance (DSC, NSD, running time...) on validation set

We present a novel two-staged method that employs various 2D-based techniques to deal with the 3D segmentation task. In most of the previous challenges, it is unlikely for 2D CNNs to be comparable with other 3D CNNs since 2D models can hardly capture temporal information. In light of that, we propose using the recent state-of-the-art technique in video object segmentation, combining it with other semi-supervised training techniques to leverage the extensive unlabeled data. Moreover, we introduce a way to generate pseudo-labeled data that is both plausible and consistent for further retraining by using uncertainty estimation. Our code is publicly available at \href{https://github.com/kaylode/ivos}{Github}.