# TOAM-YOLO: Tiny Object Detection Framework

A lightweight YOLO framework designed for detecting tiny objects (< 20×20 pixels) using a novel TOA-MoE (Tiny Object Attention with Mixture-of-Experts) module.

## About TOAM-YOLO

TOAM-YOLO addresses the challenge of detecting tiny objects across diverse domains like aerial surveillance, medical imaging, and maritime monitoring. The framework introduces:

- **TOA-MoE Module**: Two-stage attention mechanism combining multi-domain experts (Spatial, Frequency, Curvature) with multi-granularity analysis (Pixel, Patch, Region)
- **Enhanced Architecture**: DCNv3 deformable convolutions, CARAFE upsampling, and BiFPN-style feature fusion
- **Lightweight Design**: Only 1.48M additional parameters over YOLOv12n baseline

**Performance**: Achieves +14.9% mAP@0.5 on SeaPerson, +11% on VisDrone, and significant improvements on medical datasets.

## Quick Start

### 1. Setup Environment

1. Create a Conda environment
```bash
conda create -n toam-yolo python=3.12
```

2. Install the packages from requirements.txt

```bash
pip install -r requirements.txt
```

3. If you encounter an error, reinstall PyTorch as per your cuda version. Refer to [Pytorch](https://pytorch.org/get-started/locally/) for info on the command to run for your system.
```bash
pip install torch torchvision torchaudio
```
4. Setup DCNv3
```bash
cd ultralytics/ops_dcnv3/
bash make.sh
cd ../../
```
5. Install flash attention
```bash
pip install flash-attn --no-build-isolation
```

### 2. Prepare Dataset
Organize your dataset in YOLO format:
```
datasets/your_dataset/
├── data.yaml            # Dataset config (paths, classes, names)
├── images/
│   ├── train/           # Training images
│   ├── val/             # Validation images
│   └── test/            # Testing images
└── labels/
    ├── train/           # Training labels (.txt files)
    ├── val/             # Validation labels
    └── test/            # Testing labels
```

### 3. Training

**Single Training Run:**
```bash
python main.py \
    --model_yaml configs/toam-yolo.yaml \
    --data_yaml datasets/your_data.yaml \
    --run_name experiment1 \
    --epochs 100 \
    --batch_size 16
```

**Batch Experiments (Multiple Models/Datasets):**
1. Edit `main.bash` and add all your dataset YAML files to the DATASETS array:
```bash
MODELS=("configs/toam-yolo.yaml" "configs/toe-yolo.yaml")
DATASETS=(
    "datasets/dataset1/data.yaml" 
    "datasets/dataset2/data.yaml"
    "datasets/dataset3/data.yaml"
    # Add all your dataset configurations here
)
```

2. Run all combinations:
```bash
bash main.bash
```

## Model Variants & Ablations

- **toam-yolo.yaml** - Complete TOAM-YOLO framework with TOA-MoE module
- **toe-yolo.yaml** - Ablation study: architectural enhancements without TOA-MoE
- **yolov12.yaml** - Baseline YOLOv12n for comparison

## Training Parameters

Common parameters:
- `--epochs 100` - Training epochs
- `--batch_size 16` - Batch size (adjust based on GPU memory)
- `--project runs/train` - Output directory
- `--results_csv results.csv` - Results logging file

Training automatically evaluates on validation and test sets, saving results to CSV for easy comparison across experiments.
