You can run the code simply by runing run.sh

However, you should first set up LLaMA2-accessory for image captioning.

data_selection/extract_feature.py extract the image feature and establish the image embedding pool. data_selection/sample_tools/VeCAF_ImageNet.py integrates ODS and CEA.

We follow the code of DeiT(/deit) in the finetuning stage.