A Broad Dataset is All You Need for One-Shot Object Detection

Claudio Michaelis; Matthias Bethge; Alexander S Ecker

A Broad Dataset is All You Need for One-Shot Object Detection

Claudio Michaelis, Matthias Bethge, Alexander S Ecker

Published: 28 Jan 2022, Last Modified: 13 Feb 2023ICLR 2022 SubmittedReaders: Everyone

Keywords: One-Shot Learning, Object Detection, Generalization, Instance Segmentation

Abstract: Is it possible to detect arbitrary objects from a single example? A central problem of all existing attempts at one-shot object detection is the generalization gap: Object categories used during training are detected much more reliably than novel ones. We here show that this generalization gap can be nearly closed by increasing the number of object categories used during training. Doing so allows us to improve generalization from seen to unseen classes from 45% to 89% and improve the state-of-the-art on COCO by 5.4 AP50 (from 22.0 to 27.5). We verify that the effect is caused by the number of categories and not the number of training samples, and that it holds for different models, backbones and datasets. This result suggests that the key to strong few-shot detection models may not lie in sophisticated metric learning approaches, but instead simply in scaling the number of categories. We hope that our findings will help to better understand the challenges of few-shot learning and encourage future data annotation efforts to focus on wider datasets with a broader set of categories rather than gathering more samples per category.

One-sentence Summary: The key to successful one-shot detection of arbitrary objects is training on a dataset with a sufficiently broad set of categories.

6 Replies

Loading