# COBRA: Fair Dataset Condensation

This repository contains the implementation of **COBRA** and **FairDD**, frameworks for generating synthetic datasets that are both computationally efficient (small size) and fair (unbiased regarding sensitive attributes).

The code supports three main condensation backbones:
1.  **DC** (Dataset Condensation - Gradient Matching)
2.  **DM** (Distribution Matching - Feature Matching)
3.  **IDC** (Improved Dataset Condensation - Multi-formation/Patching)

## 🛠 Installation

1.  **Clone the repository**.
2.  **Install dependencies**:
    ```bash
    pip install -r requirements.txt
    ```

## 📂 Data Preparation

Ensure your datasets are located in a `data/` directory.
* **CIFAR10-S (Skewed)**: Handled by `data_handler/cifar10.py`.
* **BFFHQ**: Handled by `data_handler/bffhq.py`.
* **UTKFace**: Handled by `data_handler/utkface.py`.

## 🚀 How to Run Experiments

### 1. Distribution Matching (DM)
This is generally the fastest method. We provide scripts for different fairness strategies.

#### A. FairDD Method (Group-wise Matching)
Matches the synthetic data's feature distribution to *each* demographic group of the real data independently.

```bash
python main_DM_fairdd.py --dataset CIFAR10_S_90 --ipc 50 --FairDD