Priming Deep Pedestrian Detection with Geometric Context

Ishani Chakraborty, Gang Hua

21 Feb 2020OpenReview Archive Direct UploadReaders: Everyone

Abstract: We investigate the role of geometric context in deep neural networks to establish better pedestrian detectors that are more robust to occlusions. Notwithstanding their demonstrated successes in the wild, deep object detectors underperform in crowded scenes with high intra-category occlusions. One brute-force solution is to collect a large number of labeled training samples under occlusion, but the combinatorial increase in the labeling effort makes it an unaffordable solution. We argue that a promising and complementary direction to solve this problem is to bring geometric context to modulate feature learning in a DNN. We identify that an effective way to leverage geometric context is to induce it in two steps - through early fusion, by guiding region proposal generation to focus on occluded regions, and through late fusion, by penalizing misalignments of bounding boxes in both 2D and 3D. Our experiments on multiple state-of-the-art DNN detectors and several detection benchmarks clearly demonstrates that our proposed method outperforms strong baselines by an average of 5%.

0 Replies