Abstract: Highlights•We design modal-specific prompts from the perspective of attention-based interaction.•Combining instance-level and task-level information simultaneously.•Achieving SOTA average results in both base-to-novel and few-shot settings.
Loading