## Datasets
For miniImageNet and tieredImageNet, we downloaded the data from [DeepBDC](https://github.com/Fei-Long121/DeepBDC).

For CUB-200-2011, we downloaded the data from [DeepEMD](https://github.com/icoz69/DeepEMD).

## Pre-training
We followed [DeepBDC](https://github.com/Fei-Long121/DeepBDC) for pre-training.

## Training and Testing
1. Download RoBERTa base model from [huggingface](https://huggingface.co/roberta-base) and put it under roberta-base/, which contains config.json, merges.txt, pytorch_model.bin, tokenizer.json and vocab.json
2. Download datasets and put them under the corresponding folders in datasets/
3. Pre-train the models and put the weights under the corresponding folders in model_pt/
4. Modify the '-data_dir' in utils.py
5. Run the corresponding scripts in scripts/