Abstract: Object counting aims to estimate the number of objects in images. The leading counting approaches focus on single-category counting tasks and achieve impressive performance. Nevertheless, there are multiple categories of objects in real scenes. Multi-class object counting expands the scope of application of object counting tasks. The multi-target detection task can achieve multi-class object counting in some scenarios. However, it requires the dataset annotated with bounding boxes. Compared with the point-level annotations used in mainstream object counting issues, the box-level annotations are more difficult to be obtained. In this paper, we propose a simple yet efficient counting network based on point-level annotations. Specifically, we first change the traditional estimated density map from one to the number of categories to achieve multi-class object counting. Since all categories of objects use the same feature extractor, their features will interfere mutually in the shared feature space. We further design a multi-mask structure to suppress the negative interaction among objects. Extensive experiments on the challenging benchmarks demonstrate that the proposed method achieves state-of-the-art counting performance. <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">The code is available at <uri>https://github.com/PRIS-CV/DSACA</uri>.</i>
0 Replies
Loading