An empirical analysis of feature fusion task heads of ViT pre-trained models on OOD classification tasks

Published: 01 Jan 2025, Last Modified: 20 May 2025J. Syst. Softw. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Complex task heads should be applied to larger datasets.•The task head’s structure impacts model performance and generalization.•The fusion middle-layer location affects the backbone network characteristics.•Attention mechanism makes the task head more convex and stable.
Loading