# Transformer^2 Implementation

## Installation

```bash
pip install -r requirements.txt
cd evaluation/fishfarm
pip install -e .
```

## Usage

### Training

Training on seperated tasks described in our work can be replicated via the following [hydra](https://hydra.cc/) commanda:
Please change the argument to choose among models and tasks

```bash
python svd_reinforce_hydra.py \
    base_model@global=llama3i8b \
    task@global=mbpp \
    mode@global=training 
```

### Evaluation
Please change the argument to choose among models and tasks

evaluation on seen finetuning task
```bash
python svd_reinforce_hydra.py \
    base_model@global=llama3i8b \
    task@global=mbpp_2 \
    mode@global=eval 
```

evaluation on unseen task with cem
```bash
python svd_reinforce_hydra.py \
    base_model@global=mistral03i7b \
    task@global=few_shot_math \
    optimization@global=cem \
    policy@global=wcomb
```

