Abstract: Highlights•A conditional generative data-free distillation framework is proposed.•The auxiliary class constraint is introduced to realize semi-supervised learning.•Trained student network can well imitate teacher attention.•We obtain state-of-the-art results on most experimental datasets.
Loading