Class-wise Image Mixture Guided Self-Knowledge Distillation for Image Classification

Zeyu Dong, Chuanguang Yang, Yuqi Li, Libo Huang, Zhulin An, Yongjun Xu

Published: 01 Jan 2024, Last Modified: 11 Apr 2025CSCWD 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: We propose a novel regularization method to effectively train a neural network for avoiding overfitting, thus improving the performance. The core idea is to bridge the gap between predictive distributions derived from two popular image mixture techniques Mixup and CutMix by an ensemble distribution in a class-wise manner. Consistent optimization towards these three distributions is conducted by mutual distillation to guide the model to alleviate over-confidence predictions and robustly learn discriminative features as the classification evidence. Experiments across various image classification tasks show that our method significantly achieves better performance than previous data augmentation Mixup+CutMix and Self-KD methods.