Decoupling Representation and Classifier for Long-Tailed Recognition

Bingyi Kang; Saining Xie; Marcus Rohrbach; Zhicheng Yan; Albert Gordo; Jiashi Feng; Yannis Kalantidis

Decoupling Representation and Classifier for Long-Tailed Recognition

Bingyi Kang, Saining Xie, Marcus Rohrbach, Zhicheng Yan, Albert Gordo, Jiashi Feng, Yannis Kalantidis

Published: 20 Dec 2019, Last Modified: 03 Apr 2024ICLR 2020 Conference Blind SubmissionReaders: Everyone

Abstract: The long-tail distribution of the visual world poses great challenges for deep learning based classification models on how to handle the class imbalance problem. Existing solutions usually involve class-balancing strategies, e.g., by loss re-weighting, data re-sampling, or transfer learning from head- to tail-classes, but most of them adhere to the scheme of jointly learning representations and classifiers. In this work, we decouple the learning procedure into representation learning and classification, and systematically explore how different balancing strategies affect them for long-tailed recognition. The findings are surprising: (1) data imbalance might not be an issue in learning high-quality representations; (2) with representations learned with the simplest instance-balanced (natural) sampling, it is also possible to achieve strong long-tailed recognition ability by adjusting only the classifier. We conduct extensive experiments and set new state-of-the-art performance on common long-tailed benchmarks like ImageNet-LT, Places-LT and iNaturalist, showing that it is possible to outperform carefully designed losses, sampling strategies, even complex modules with memory, by using a straightforward approach that decouples representation and classification. Our code is available at https://github.com/facebookresearch/classifier-balancing.

Keywords: long-tailed recognition, classification

Code: [![github](/images/github_icon.svg) facebookresearch/classifier-balancing](https://github.com/facebookresearch/classifier-balancing) + [![Papers with Code](/images/pwc_icon.svg) 3 community implementations](https://paperswithcode.com/paper/?openreview=r1gRTCVFvB)

Data: [AwA](https://paperswithcode.com/dataset/awa-1), [CIFAR-10](https://paperswithcode.com/dataset/cifar-10), [CUB-200-2011](https://paperswithcode.com/dataset/cub-200-2011), [ImageNet](https://paperswithcode.com/dataset/imagenet), [ImageNet-LT](https://paperswithcode.com/dataset/imagenet-lt), [Places](https://paperswithcode.com/dataset/places), [Places-LT](https://paperswithcode.com/dataset/places-lt), [SUN](https://paperswithcode.com/dataset/sun), [iNaturalist](https://paperswithcode.com/dataset/inaturalist)

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 2 code implementations](https://www.catalyzex.com/paper/arxiv:1910.09217/code)

Original Pdf: pdf

12 Replies

Loading