# FedNano: Federated Learning for Vision-Language Models

## Overview

FedNano is a comprehensive framework for federated learning with vision-language models (VLMs). This project implements federated learning strategies for various VLM architectures, including LLaVA and MiniGPT-4, with support for multiple datasets and evaluation metrics.

## Table of Contents

- [Installation](#installation)
- [Datasets](#datasets)
- [Experiments](#experiments)
- [Computing Infrastructure](#computing-infrastructure)
- [Evaluation Metrics](#evaluation-metrics)


## Installation

### Prerequisites

- Python 3.10+
- CUDA 11.7+ 

### Environment Setup

```bash
# Clone the repository
git clone https://github.com/your-username/FedNano.git
cd FedNano

# Create conda environment
conda create -n fednano python=3.11
conda activate fednano

# Install dependencies
pip install -r requirements.txt
```

### Dependencies

Key dependencies include:
- PyTorch 2.0+
- Transformers 4.30+
- LLaVA
- MiniGPT-4


## Datasets

### Supported Datasets

1. **ScienceQA (SQA)**
   - Description: Multi-modal science question answering
   - Size: ~12K questions
   - Format: Question-Answer pairs with images

2. **OKVQA**
   - Description: Outside Knowledge VQA dataset
   - Size: ~14K questions
   - Format: Visual question answering

3. **IconQA**
   - Description: Icon-based question answering
   - Size: ~107K questions
   - Format: Icon understanding and reasoning

4. **GQA**
   - Description: Visual reasoning dataset
   - Size: ~113K questions
   - Format: Scene graph-based questions


## Experiments

### Experimental Setup

All experiments follow a standardized protocol:

1. **Data Partitioning**: Dirichlet distribution (α ∈ {0.1, 0.3, 0.5, 1.0})
2. **Client Selection**: 5 or 10 clients per experiment
3. **Communication Rounds**: 1 round minimum
4. **Local Epochs**: 1-5 epochs per round
5. **Random Seeds**: Fixed seeds for reproducibility



## Computing Infrastructure

- **GPU**: NVIDIA A100 (80GB)

## Evaluation Metrics
**Accuracy**: Standard accuracy for classification tasks

