
# MedSIGHT — Project Overview

This repository contains code for MedSIGHT. There are 2 main folders on this project: RegLLM (the code for MedSIGHT LVLM backbone) and source (the code for Region Perceiver training and preparation). Both of the folders include codes for training, evaluation, and inference. The README below summarizes the code layout, important scripts, configuration options, and quick run examples.

## Repository layout (high level)

- `RegLLM/` - Main project package and scripts for the MedSIGHT
	- `RegLLM/RegSeg.py` - The implementation of MedSIGHT structure.
	- `RegLLM/llava/` - Core LVLM code with LLaVA-like structure (model builders, evaluation, training, serving).
		- `eval/` - evaluation utilities and CLI
			- `inference_batch.py` - batch inference script (configurable `ROOT_PATH`)
			- `cli_v1.py`, `inference_batch.py`, `eval_batch.sh` - eval tooling
			- `eval_metrics/` - metric implementations and helper files
		- `model/` - model construction and weight/delta utilities
			- `language_model/` - integrations Qwen as a LVLM.
		- `train/` - Training codes
    - `RegLLM/scripts/` - Core training scripts with both alignment stages and final instruction tuning stage.
- `source/` - Region Perceiver codes (including Dataset and training sources)
	- `dataset/` - dataset loaders and builders (e.g., `biomed_seg.py`)
	- `tokenizer/` - Region Perceiver implementation and config, as well as the codebook quantizer implementation
    - `utils/` - tools and helpers for region perceiver training
    - `scripts/` - training scripts of both region perceiver and codebook

## Key scripts and where to find them

### Region Perceiver used in MedSIGHT

- Training the Region Perceiver and codebook: `source/scripts/`

- Training codes: `source/tokenizer/vq_train.py`

### MedSIGHT LVLM

- Training the MedSIGHT: `RegLLM/scripts/`

- Batch inference: `RegLLM/llava/eval/inference_batch.py`
	- Uses a `root_path` variable.
	- Calls `RegLLM.llava.eval.cli_v1.RegLLMChatbot` to run model inference on datasets (VQA-RAD, SLAKE, PathVQA, BiomedParse, OmniMed).

- Training codes: `RegLLM/llava/train/train_seg.py`.

- Evaluation CLI: `RegLLM/llava/eval/cli_v1.py` provide programmatic and CLI interfaces to evaluate models.





