Maximizing discrimination capability of knowledge distillation with energy function

Published: 01 Jan 2024, Last Modified: 05 Mar 2025Knowl. Based Syst. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We confirm that sample’s energy is crucial for the model’s discrimination ability.•Energy-dependent temperature is utilized to transfer the teacher’s optimal knowledge.•Augmenting high-energy images yields high performance with lower computational costs.•Energy-based methods boost performance on challenging dataset, showing practicality.
Loading