# SATPool

This repository contains self-contained scripts for: RAG (run_rag.py), Multimodal RAG (run_multimodal_rag.py), Classification (run_classification.py), and Factuality evaluation (run_factuality.py).

## Install dependencies

```bash
pip install -r requirements.txt
```


## 1. RAG (run_rag.py)
Text-only retrieval-augmented generation.

### Preprocessing
1. Run `wiki_id.py` for Wikipedia-based RAG preprocessing.
2. Use `preprocessing_summarizer.py` to prepare generation caches:

```bash
python preprocessing_summarizer.py --dataset nq --model bert --mode cls
```
### Key Options
- `--dataset`: Dataset name - 'nq','hotpotqa'(default: 'nq')
- `--model`: Backbone model - 'bert', 'gtr' (default: 'bert')
- `--mode`: Pooling mode - 'sat', 'cls', 'mean','maxp','lc'
- `--batch_size`: Training batch size (default: 16)
- `--num_epochs`: Training epochs (default: 100)
- `--lr`: Learning rate (default: 1e-5)
- `--proj_dim`: Projection dimension (default: 512)
- `--k_for_gen`: Number of passages for generator context (default: 5)

**Example for retriver training**
```bash
python run_rag.py --dataset nq --model bert --mode sat --batch_size 16 --num_epochs 50
```
**Example for generator training**
```bash
python run_rag.py --dataset nq --model bert --mode sat --batch_size 16 --num_epochs 50 --gen_only --k_for_gen 5
```
## 2. Multimodal RAG (run_multimodal_rag.py)
Vision-language retrieval-augmented generation.

### Key Options
- `--dataset`: Dataset name - 'okvqa', 'evqa'(default: 'okvqa')
- `--model`: Backbone model - 'clip', 'ravqa' (default: 'clip')
- `--mode`: Pooling mode - 'sat', 'cls', 'mean','maxp','lc'
- `--image_root`: Path to image directory (COCO, EVQA)
- `--batch_size`: Training batch size (default: 16)
- `--num_epochs`: Training epochs (default: 100)
- `--lr`: Learning rate (default: 1e-4)
- `--proj_dim`: Projection dimension (default: 512)
- `--k_for_gen`: Number of passages for generator context (default: 5)

**Example for retriver training**
```bash
python run_multimodal_rag.py --dataset okvqa --model clip --mode sat --image_root /path/to/images
```
**Example for generator training**
```bash
python run_multimodal_rag.py --dataset okvqa --model clip --mode sat --image_root /path/to/images --gen_only
```
### Note
- Image Description (caption, object detection result) for RAVQA is extracted and included in the repository.

## 3. Classification (run_classification.py)
Text classification tasks.

### Key Options
- `--dataset`: Dataset name '20news', 'eurlex'
- `--model`: Backbone model - 'bert', 'roberta', 'longformer', 'longt5' (default: 'bert')
- `--mode`: Pooling mode - 'sat', 'cls', 'mean','maxp','lc'
- `--classifier`: Classifier type (default: 'mlp')
- `--batch_size`: Training batch size 
- `--num_epochs`: Training epochs (default: 50)
- `--lr`: Learning rate (default: 1e-4)
- `--proj_dim`: Projection dimension (default: 256)

**Example:**
```bash
python run_classification.py --dataset 20news --model bert --mode sat --batch_size 8
```

## 4. Factuality Evaluation (run_factuality.py)
Factuality consistency evaluation.
### Preparation for QAGS dataset
```bash
git clone https://github.com/W4ngatang/qags.git
``` 

### Key Options
- `--mode`: Interaction mode - 'mean', 'sat'
- `--K`: Number of frequency bins for STC (default: 4)
- `--proj_dim`: Projection dimension (default: 512)

**Example:**
```bash
python run_factuality.py --mode sat --K 4
```


