# GMR: General Multimodal Reranker

This folder contains code and example data for the General Multimodal Reranker (GMR) project. GMR is designed for training and evaluating reranking models on multi-modal retrieval tasks, supporting both text and image modalities.

## Folder Structure

- `data/` — Contains example datasets for training and testing.
	- `train_data/` — Training set with corpus, queries, and instance files.
	- `test_data/` — Test set with corpus, queries, and relevance labels.
- `eval/` — Evaluation scripts and reranker implementations.
	- `encoder_inference.py` — Inference for encoder models.
	- `evaluate_pipeline.py` — End-to-end evaluation pipeline.
	- `GMR_reranker.py` — Main reranker implementation.
	- `index_builder.py` — Index construction for retrieval.
	- `jina_reranker.py`, `Mono_Qwen2vl_reranker.py`, `Qwen3_reranker.py` — Alternative reranker models.
	- `ranker.py` — Ranking logic and utilities.
	- `MRB_task_example.json` — Example and task configuration files.
- `train/` — Training scripts and model definitions.
	- `arguments.py` — Argument parsing for training.
	- `data.py` — Data loading and preprocessing.
	- `finetune.py` — Fine-tuning script for reranker models.
	- `loss.py` — Loss functions for training.
	- `model.py` — Model architectures.
	- `trainer.py` — Training loop and utilities.
	- `data_config_gmr_example.yaml` — Example data configuration.

## Getting Started

1. **Prepare Data**: Place your training and test data in the `data/train_data/` and `data/test_data/` folders, following the provided format (see example files).
2. **Configure Training**: Edit `train/data_config_gmr_example.yaml` to specify your dataset and training parameters.
3. **Train Model**: Run the training script:
	 ```bash
	 python train/finetune.py --config train/data_config_gmr_example.yaml
	 ```
4. **Evaluate Model**: Use the evaluation pipeline:
	 ```bash
	 python eval/evaluate_pipeline.py \
        --tasklist_path eval/MRB_task_example.yaml \
        --model_name_or_path $model_name_or_path
	 ```

## Requirements

- Python 3.8+
- PyTorch

