[Re] Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation

Anonymous

[Re] Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation

Anonymous

05 Feb 2022 (modified: 22 Jun 2025)ML Reproducibility Challenge 2021 Fall Blind SubmissionReaders: Everyone

Keywords: panoptic-deeplab, deep learning, computer vision

TL;DR: Reproduction of a bottom up approach to panoptic segmentation

Abstract: Scope of Reproducibility: The original work by Cheng et al. introduces Panoptic-DeepLab - a novel architecture for panoptic segmentation, claiming to achieve comparable performance to two-stage, top down approaches while yielding fast inference speeds. At the time of publication, Panoptic-Deeplab claims to have ranked first in all three cityscapes benchmarks (specifically: mIoU, AP \& PQ). Methodology: As the original paper authors published their source code, our codebase integrates sections of their codebase, while re-implementing components intrinsic to the main claim we are attempting to evaluate. We also studied the source code, using information provided from it and the pipelines to augment our understanding from what the paper described. While we initially attempted a code-blind reproduction, it was soon determined to be unfeasible following which a hybrid approach was instantiated. Results: While we successfully reproduced the given architecture, we have been unable to train it. Therefore: our contributions currently remaining exclusive to architecture, and certain unit tests within the system itself. We also highlight potential low-level Tensorflow that were pitfalls to our development, that may be advantageous to investigate. What was easy: The authors of the paper structured their contributions on well-documented and tested frameworks such as ResNet and DeepLabV3+, while training on popular datasets such as Cityscapes and Mapillary Vistas. Consequently, setting up the dataset and the environment to reproduce the given research was straightforward. What was difficult: A significant hurdle we came across during our reading of the paper was vagueness within the expected implementation. This extended from the architecture to the training regime. The descriptions provided, although accurate, were presented as a high-level overview, with the expectation of a lot of prior domain knowledge. This resulted in a significant time-sink, following which we looked into the codebase for necessitated context. Despite the well-structured objected oriented implementation through which the code was written, we found certain sections hard to understand. We observed convoluted re-implementations of high level functions already part of Tensorflow as part of the codebase. However, this could have been a direct result of the implementation not using the now-popularised Functional API within Tensorflow, which may have resulted in the required use of custom layers. % better language required here Communication with original authors: We communicated with the authors over e-mail, resolving doubts that arose while reading the paper. It is also through author communication that we were directed to the codebase, as although public - the relevant repository wasn't mentioned within the paper.

Paper Url: https://openaccess.thecvf.com/content_CVPR_2020/papers/Cheng_Panoptic-DeepLab_A_Simple_Strong_and_Fast_Baseline_for_Bottom-Up_Panoptic_CVPR_2020_paper.pdf

Paper Venue: Not in list

Venue Name: CVPR 2020

Supplementary Material: zip

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/panoptic-deeplab-a-simple-strong-and-fast/code)

4 Replies

Loading