Abstract: Highlights•We highlight the limitations of previous KD methods based on forward KL divergence.•Datasets are split to tackle mode-averaging and teacher errors in uncertain images.•Correct samples use RKLD loss, while incorrect samples encourage student self-learning.•Our method achieves superior performance in both classification and object detection.
External IDs:dblp:journals/eaai/KimHLK25
Loading