Abstract: Highlights•Online elitist multiple students ensembling distillation framework with a supervisor.•Supervisor learns student expertise using: input, ground truth, student predictions.•Supervisor - discarded at test time. Only the best student is extracted and used.•Extensive experiments show consistent improvements over vanilla trained students.
External IDs:dblp:journals/cviu/BorzaIMD23
Loading