# Code submission

We document code and instructions for reproducing our NLP-LR performance in Table 1. At publishing time, we will open source the code for baselines and our methods across all datasets on GitHub, ensuring reproduceability of all paper results.

## Environment

Create Conda environment, use our `requirements.txt` to set up packages.
```
conda create --name opensource python=3.10
conda activate opensource
pip install -r requirements.txt
```

## Dataset

Download data.
```
python download_data.py
```

## Command

Zero-Shot:
```
accelerate launch --mixed_precision bf16 train.py \
    --experiment_name zeroshot \
    --zero_shot \
    --eval_split 87
```

ICL:
```
accelerate launch --mixed_precision bf16 train.py \
    --experiment_name icl \
    --eval_split 87
```

CT-KV:
```
accelerate launch --mixed_precision bf16 train.py \
    --experiment_name ctkv \
    --epochs 200 \
    --eval_split 87
```

Note: the paper's Table 1 evaluation is averaged over 5 eval splits: 13, 21, 42, 87, 100.
