
# Granularity-Aware Adaptive Classifier Expansion via Zero-Shot Learning
This is the repository for "<i>Granularity-Aware Adaptive Classifier Expansion via Zero-Shot Learning</i>" submission.

<p align="center">
  <img src="figs/framework.png" />
</p>
## Preliminary

1. Download the fine-tuned CUB, AWA2, and SUN features, which can be obtained from [here](https://github.com/uqzhichen/SDGZSL) (originally from [this paper](https://github.com/akshitac8/tfvaegan)).
2. Download the pre-trained and fine-tuned ResNet101 features for CUB, AWA2, and SUN datasets. The original image datasets are available: [CUB](https://www.vision.caltech.edu/datasets/cub_200_2011/), [AWA2](https://cvml.ista.ac.at/AwA2/), and [SUN](https://groups.csail.mit.edu/vision/SUN/hierarchy.html). Images are needed to illustrate the method in qualitative insights and case studies. However, this paper does not utilize the original images for the method.
3. Refer to `requirements.py` to install the required Python environment.
4. Modify the default values for "--dataroot" and "--rootpath" in `main.py` to specify the path to data and the directory for saving outputs, respectively.
5. Prepare the API key and access URL for LLMs, and modify it in `LLM.py`. Download and prepare the open-source LLMs.


## Usage

### Step 1: Multi-Source Semantic Generation

We start by generating descriptive texts for multi-source semantics, and embedding vectors through different embedding models for three benchmark datasets and ImageNet based on LLMs. This is achieved by running the following program.

```bash
$ python LLM_desc_gen/f_inv_desc_gen.py
$ python LLM_desc_gen/gen_desc_imageNet.py
```

By running the above program, we can directly and fully automatically generate multi-source semantic descriptions, and form embeddings for subsequent modules.


### Step 2: Class-Level Structure Discovery and Adaptive Classifier Synthesis

Since both modules are involved in training, we will explain them together. We provide example scripts to run, all of which start with `main.py`. For example, to run the training and evaluation on AWA2, CUB, and SUN dataset, please run the following script:

```bash
$ CUDA_VISIBLE_DEVICES=0 python main.py --cuda --manualSeed 0 --dataset=AWA2 --image_embedding=res101_finetuned --class_embedding=clip --expert=att --factual_branch=attention --cos_sim_loss --seperate_loss --llm=gpt4o --cluster_num 6 --include_unseen --conclude_inv --concatenation --num_layers 2 --beta1 0.9 --lr 0.00001 --batch_size 10 --embed_dim 2048 --strict_eval --early_stopping_slope --calc_entropy --save_pred_matrix --nepoch=1000
```
