# Magnitude-Distance: Generative Adversarial Networks with Magnitude-Based Distance Metrics

This repository contains the implementation and experiments for **Magnitude-Based Generative Networks (MagGN)**, a novel approach to training GANs using magnitude-based distance metrics instead of traditional discriminator-based adversarial training.

## Table of Contents
- [Overview](#overview)
- [Environment Setup](#environment-setup)
- [Project Structure](#project-structure)
- [Main Experiments](#main-experiments)
  - [1. MagGN for MNIST, CIFAR-10, and CelebA](#1-maggn-for-mnist-cifar-10-and-celeba)
  - [2. Outlier Robustness (2D Example)](#2-outlier-robustness-2d-example)
  - [3. High Dimensional Performance](#3-high-dimensional-performance)
- [Core Implementation](#core-implementation)
- [Citation](#citation)

---

## Overview

This project introduces **Magnitude-Based Generative Networks (MagGN)**, using magnitude theory from enriched category theory to measure the distance between distributions, providing:

- **Improved stability** in GAN training
- **Robustness to outliers** in the data distribution
- **Better performance** in high-dimensional spaces
- **Direct distance optimization** without adversarial training

### Key Features

- **Magnitude Distance**: A novel distance metric based on similarity matrices and magnitude theory
- **Multi-scale Training**: Automatic Scaling scheduling for optimal learning
- **Comprehensive Experiments**: Validation on MNIST, CIFAR-10, and CelebA datasets

---

## Environment Setup

### Prerequisites
- Python 3.11.11
- CUDA-capable GPU (recommended)
- Conda or pip for package management

### Installation

1. **Using Conda (Recommended)**:
```bash
# Create environment from provided YAML file
conda env create -f env_mag.yaml
conda activate env_mag
```

2. **Manual Installation**:
```bash
# Create a new conda environment
conda create -n env_mag python=3.11.11
conda activate env_mag

# Install PyTorch with CUDA support
conda install pytorch=2.0.1 torchvision torchaudio pytorch-cuda=11.8 -c pytorch -c nvidia

# Install other dependencies
conda install numpy=1.24.3 h5py=3.10.0 matplotlib=3.7.1 scikit-learn=1.3.2 pandas jupyter python-lmdb -c conda-forge

# Install additional packages via pip
pip install tensorflow==2.12.0 rbo
```

### Dependencies
- **PyTorch** 2.0.1 (with CUDA)
- **TensorFlow** 2.12.0 (for Inception Score computation)
- **NumPy** 1.24.3
- **Matplotlib** 3.7.1
- **Scikit-learn** 1.3.2
- **h5py** 3.10.0
- **pandas**
- **jupyter**
- **rbo** (for ranking-based overlap metrics)

---

## Project Structure

```
Magnitude-Distance/
├── magnitude.py                    # Core magnitude distance implementations
├── model_loader.py                # Model loading and saving utilities
├── plot_gan_training.py           # Visualization utilities for training
├── env_mag.yaml                   # Conda environment specification
│
├── MNIST_Experiment/              # MNIST experiments
│   ├── MagGN_mnist.py            # MagGN training for MNIST
│   ├── GANs_mnist.py             # Baseline GAN implementations
│   ├── args_maggn_mnist.py       # MagGN hyperparameters
│   ├── args_mnist.py             # Standard GAN hyperparameters
│   ├── MagGN_multiscale_mnist_experiments.ipynb  # Multi-scale experiments
│   ├── GAN_mnist_experiments.ipynb               # Standard GAN experiments
│   ├── GAN-GP_mnist_experiments.ipynb            # WGAN-GP experiments
│   └── architectures/
│       ├── __init__.py
│       └── mlp.py                # MLP generator/discriminator architectures
│
├── Cifar10_Experiment/            # CIFAR-10 experiments
│   ├── MagGN_Cifar.py            # MagGN training for CIFAR-10
│   ├── GANs_Cifar.py             # Baseline GAN implementations
│   ├── args_GAN_Cifar.py         # Hyperparameters
│   ├── Inception_score.py        # Inception Score computation
│   ├── MagGN_multiscale_Cifar_experiments.ipynb
│   ├── GAN_Cifar_experiments.ipynb
│   ├── GAN-GP_Cifar_experiments.ipynb
│   ├── Cifar_Inception_Score_experiments.ipynb
│   └── architectures/
│       ├── __init__.py
│       └── conv.py               # CNN generator/discriminator architectures
│
├── CelebA_Experiment/             # CelebA experiments
│   ├── MagGN_CelebA.py           # MagGN training for CelebA
│   ├── GANs_CelebA.py            # Baseline GAN implementations
│   ├── Inception_score.py        # Inception Score computation
│   ├── MagGN_multiscale_CelebA_experiments.ipynb
│   ├── GAN_CelebA_experiments.ipynb
│   ├── GAN-GP_CelebA_experiments.ipynb
│   ├── CelebA_Inception_Score_experiments.ipynb
│   └── architectures/
│       ├── __init__.py
│       └── conv.py               # CNN architectures for CelebA
│
├── Outlier/                       # Outlier robustness experiments
│   ├── experiment.ipynb          # Main outlier experiments notebook
│   ├── robustness_plot.py       # Plotting utilities for robustness analysis
│   └── outlier_experiment_results.npy  # Pre-computed results
│
├── HighDimensionalityCurse/      # High-dimensional performance analysis
│   ├── high_dimensionality_curse_experiments.ipynb  # Main experiments
│   ├── high_dimensionality_distances.py             # Distance computations
│   └── experiments_meandiff_2.0/                    # Experiment results
│       ├── plots/                                    # Generated plots
│       │   ├── mean_dims2-1000_n500_trials100.png
│       │   ├── mean_log_dims2-1000_n500_trials100.png
│       │   ├── std_dims2-1000_n500_trials100.png
│       │   ├── cv_dims2-1000_n500_trials100.png
│       │   └── errorbar_dims2-1000_n500_trials100.png
│       └── *.json                                    # Results for different configurations
│
├── outlier_robustness.ipynb      # 2D outlier robustness visualization
└── analysis_distributions_magnitude_distances_over_t.ipynb  # t-parameter analysis
```

---

## Main Experiments

### 1. MagGN for MNIST, CIFAR-10, and CelebA

These experiments demonstrate the effectiveness of magnitude-based training on standard image generation benchmarks.

#### MNIST Experiments

**Location**: `MNIST_Experiment/`

**Key Files**:
- `MagGN_mnist.py` - Main MagGN implementation
- `MagGN_multiscale_mnist_experiments.ipynb` - Multi-scale training experiments
- `GAN_mnist_experiments.ipynb` - Baseline comparisons
- `GAN-GP_mnist_experiments.ipynb` - WGAN-GP comparisons

**Running the Experiments**:

```python
# In a Jupyter notebook or Python script
from MNIST_Experiment.MagGN_mnist import GAN
import torch

# Initialize MagGN
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
gan = GAN(
    batch_size=64,
    lr=0.00005,
    latent_dim=100,
    device=device,
    name='MagGN_mlp',
    dataset_name='MNIST'
)

# Configure Scaling scheduling
t_schedule_config = {
    'initial_t': 1.0,
    'schedule_type': 'exponential',  # or 'linear', 'constant'
    'decay_rate': 0.95,
    'min_t': 0.01
}

# Train the model
gan.train_MagGN(
    n_epochs=200,
    batch_size=64,
    normalize=False,
    t_schedule_config=t_schedule_config
)
```

**Expected Results**:
- Stable training without mode collapse
- High-quality digit generation
- Comparable or better Inception Scores than baseline GANs

#### CIFAR-10 Experiments

**Location**: `Cifar10_Experiment/`

**Key Files**:
- `MagGN_Cifar.py` - Main implementation with CNN architectures
- `MagGN_multiscale_Cifar_experiments.ipynb` - Multi-scale experiments
- `Inception_score.py` - Quantitative evaluation
- `Cifar_Inception_Score_experiments.ipynb` - IS comparisons

**Running the Experiments**:

```python
from Cifar10_Experiment.MagGN_Cifar import GAN
import torch

device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
gan = GAN(
    batch_size=64,
    lr=0.0002,
    latent_dim=100,
    img_size=32,
    channels=3,
    device=device,
    name='MagGN_conv',
    dataset_name='CIFAR10'
)

# Train with auto-scheduled Scaling
t_schedule_config = {
    'initial_t': 1.0,
    'schedule_type': 'auto',
    'adaptation_rate': 0.1
}

gan.train_MagGN(
    n_epochs=500,
    batch_size=64,
    normalize=True,
    t_schedule_config=t_schedule_config
)
```

#### CelebA Experiments

**Location**: `CelebA_Experiment/`

**Key Files**:
- `MagGN_CelebA.py` - Implementation for face generation
- `MagGN_multiscale_CelebA_experiments.ipynb` - Multi-scale training
- `CelebA_Inception_Score_experiments.ipynb` - Quality assessment

**Running the Experiments**:

Similar to CIFAR-10 but with CelebA-specific parameters:
- Image size: 64×64 or 128×128
- Batch size: 32-64 (depending on GPU memory)
- Training epochs: 100-200

**Features Evaluated**:
- Visual quality of generated faces
- Diversity of generated samples
- Stability of training process
- Inception Score comparisons with WGAN-GP and standard GANs

---

### 2. Outlier Robustness (2D Example)

This experiment demonstrates the robustness of magnitude distance to outliers compared to traditional distance metrics.

**Location**: `outlier_robustness.ipynb` and `Outlier/`

**Key Concepts**:
- Compare magnitude distance with:
  - Wasserstein distance
  - Maximum Mean Discrepancy (MMD)
  - Euclidean distance
- Test on 2D Gaussian mixtures with varying outlier percentages
- Visualize how different metrics respond to outliers

**Running the Experiment**:

```bash
# Open the main notebook
jupyter notebook outlier_robustness.ipynb
```

**What to Expect**:

1. **Setup**: Two 2D Gaussian distributions are created
2. **Outlier Injection**: Progressively add outliers (0% to 50%)
3. **Distance Computation**: Calculate all distance metrics
4. **Visualization**: Plot showing:
   - Original distributions
   - Distributions with outliers
   - Distance metric behavior as outlier % increases

**Plots Generated**:
- Location: `Outlier/` folder
- Shows comparative robustness of different distance metrics
- Magnitude distance demonstrates superior stability

**Key Results**:
- Magnitude distance is **less sensitive** to outliers
- Traditional metrics (Wasserstein, MMD) show **higher variance**
- Provides theoretical justification for using magnitude in noisy real-world data

---

### 3. High Dimensional Performance

This experiment analyzes how magnitude distance performs as dimensionality increases, addressing the curse of dimensionality.

**Location**: `HighDimensionalityCurse/`

**Key Files**:
- `high_dimensionality_curse_experiments.ipynb` - Main experiments
- `high_dimensionality_distances.py` - Distance computation implementations
- `experiments_meandiff_2.0/` - Pre-computed results
- `experiments_meandiff_2.0/plots/` - Generated visualizations

**Experimental Setup**:

- **Dimensions tested**: 2 to 1000
- **Sample size**: n=500 per distribution
- **Trials**: 100 Monte Carlo trials
- **Distance metrics compared**:
  - Magnitude distance (various t values: 0.01, 0.1, auto)
  - Maximum Mean Discrepancy (MMD with different σ)
  - Wasserstein distance

**Running the Experiments**:

```bash
# Open the experiments notebook
jupyter notebook HighDimensionalityCurse/high_dimensionality_curse_experiments.ipynb
```

**Running Custom High-Dimensional Tests**:

```python
from HighDimensionalityCurse.high_dimensionality_distances import *
import torch
import numpy as np

# Set parameters
dims = [2, 10, 50, 100, 500, 1000]
n_samples = 500
n_trials = 100
mean_diff = 2.0
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")

results = {}
for dim in dims:
    print(f"Testing dimension: {dim}")
    mag_distances = []
    wass_distances = []
    mmd_distances = []
    
    for trial in range(n_trials):
        # Generate two Gaussian distributions
        dist1 = np.random.randn(n_samples, dim)
        dist2 = np.random.randn(n_samples, dim) + mean_diff
        
        # Convert to torch tensors
        points1 = torch.from_numpy(dist1).float().to(device)
        points2 = torch.from_numpy(dist2).float().to(device)
        
        # Compute distances
        mag_dist = compute_magnitude_distance(points1, points2, device, t='auto')
        wass_dist = compute_wasserstein_distance(dist1, dist2)
        mmd_dist = compute_mmd(points1, points2)
        
        mag_distances.append(mag_dist)
        wass_distances.append(wass_dist)
        mmd_distances.append(mmd_dist)
    
    results[dim] = {
        'magnitude': {'mean': np.mean(mag_distances), 'std': np.std(mag_distances)},
        'wasserstein': {'mean': np.mean(wass_distances), 'std': np.std(wass_distances)},
        'mmd': {'mean': np.mean(mmd_distances), 'std': np.std(mmd_distances)}
    }

# Plot results
import matplotlib.pyplot as plt
dimensions = list(results.keys())
mag_means = [results[d]['magnitude']['mean'] for d in dimensions]
wass_means = [results[d]['wasserstein']['mean'] for d in dimensions]
mmd_means = [results[d]['mmd']['mean'] for d in dimensions]

plt.figure(figsize=(10, 6))
plt.plot(dimensions, mag_means, 'o-', label='Magnitude Distance')
plt.plot(dimensions, wass_means, 's-', label='Wasserstein Distance')
plt.plot(dimensions, mmd_means, '^-', label='MMD')
plt.xlabel('Dimension')
plt.ylabel('Distance')
plt.title('Distance Metrics vs. Dimensionality')
plt.legend()
plt.grid(True)
plt.savefig('high_dim_comparison.png')
plt.show()
```

**Pre-computed Results**:

The `experiments_meandiff_2.0/` folder contains results for:
- Different t values (0.01, 0.1, auto, sqrt_auto)
- Different dimensions (2-1000)
- Multiple trials (100 per configuration)

**Visualization Plots**:

Located in `experiments_meandiff_2.0/plots/`:

1. **`mean_dims2-1000_n500_trials100.png`**: 
   - Mean distance vs. dimension
   - Shows how each metric scales

2. **`mean_log_dims2-1000_n500_trials100.png`**:
   - Log-scale version for better visualization

3. **`std_dims2-1000_n500_trials100.png`**:
   - Standard deviation vs. dimension
   - Shows stability of each metric

4. **`cv_dims2-1000_n500_trials100.png`**:
   - Coefficient of variation (CV = std/mean)
   - Relative stability comparison

5. **`errorbar_dims2-1000_n500_trials100.png`**:
   - Mean ± standard error
   - Comprehensive comparison


---

## Core Implementation

### magnitude.py

The core file implementing magnitude-based distance computations.

**Key Functions**:

#### Distance Metrics

```python
# Basic magnitude distance between two distributions
def diff_magnitude_distance(points0, points1, device, points_all=None, t=1):
    """
    Compute magnitude distance between two point sets.
    
    Args:
        points0: First point set [N, D]
        points1: Second point set [M, D]
        device: torch device
        points_all: Optional pre-computed union of points
        t: Scaling parameter (controls scale)
    
    Returns:
        Distance value (scalar)
    """
```

```python
# Normalized magnitude distance with gradients
def norm_diff_magnitude_distance_grad(points0, points1, device, points_all=None, 
                                      t=1, normalize=False, eps=1e-5):
    """
    Differentiable normalized magnitude distance for training.
    
    Args:
        points0: First point set [N, D]
        points1: Second point set [M, D]
        device: torch device
        points_all: Optional union
        t: Scaling parameter
        normalize: Whether to normalize by total magnitude
        eps: Numerical stability constant
    
    Returns:
        torch.Tensor: Distance with gradient support
    """
```

```python
# Magnitude overlap (similarity measure)
def norm_magnitude_overlap_grad(points0, points1, device, points_all=None, 
                               t=1, normalize=False, eps=1e-5):
    """
    Compute magnitude overlap between distributions.
    Higher values indicate more similarity.
    """
```

#### Similarity Matrix

```python
def similarity_matrix(points, t=1):
    """
    Compute exp(-t * ||x_i - x_j||) similarity matrix.
    
    Args:
        points: Point set [N, D]
        t: Scaling/scale parameter
    
    Returns:
        Similarity matrix [N, N]
    """
```

#### Magnitude Computation

```python
def magnitude(points, device, t=1):
    """
    Compute magnitude of a point set.
    Magnitude = sum of inverse similarity matrix.
    
    Args:
        points: Point set [N, D]
        device: torch device
        t: Scaling parameter
    
    Returns:
        Magnitude value (scalar)
    """
```

#### Weight Functions

```python
def weightof_points(points, device, t=1):
    """
    Compute magnitude weights for each point.
    Weights sum similarity-adjusted contributions.
    
    Args:
        points: Point set [N, D]
        device: torch device
        t: Scaling parameter
    
    Returns:
        Weight vector [N]
    """
```

### Scaling Scheduling

Scaling parameter `t` controls the scale of similarity:
- **Low t** (e.g., 0.01): Local structure emphasis
- **High t** (e.g., 10): Global structure emphasis
- **Auto**: Automatically adapt based on data scale

**Implementation in MagGN**:

```python
def schedule_t(self, epoch, config):
    """
    Schedule Scaling parameter during training.
    
    Config types:
    - 'constant': Fixed t value
    - 'linear': Linear decay
    - 'exponential': Exponential decay
    - 'auto': Data-adaptive scheduling
    """
    if config['schedule_type'] == 'constant':
        return config['initial_t']
    
    elif config['schedule_type'] == 'exponential':
        t = config['initial_t'] * (config['decay_rate'] ** epoch)
        return max(t, config.get('min_t', 0.01))
    
    elif config['schedule_type'] == 'linear':
        t = config['initial_t'] - (epoch / self.n_epochs) * config['decay_amount']
        return max(t, config.get('min_t', 0.01))
    
    elif config['schedule_type'] == 'auto':
        # Compute data-adaptive t based on distribution scale
        return self._compute_adaptive_t(epoch)
```

### Model Architecture

**Generator** (from `architectures/mlp.py` or `architectures/conv.py`):
```python
class Generator(nn.Module):
    def __init__(self, latent_dim, img_shape):
        super(Generator, self).__init__()
        # Architecture depends on dataset
        # MLP for MNIST, CNN for CIFAR-10/CelebA
```

**No Discriminator**: MagGN replaces the discriminator with magnitude distance computation.

---

## Usage Examples

### Training a New Model

```python
from MNIST_Experiment.MagGN_mnist import GAN
import torch

# Setup
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
gan = GAN(
    batch_size=64,
    lr=0.00005,
    latent_dim=100,
    device=device,
    name='MagGN_mlp',
    dataset_name='MNIST'
)

# Configure training
t_schedule_config = {
    'initial_t': 1.0,
    'schedule_type': 'exponential',
    'decay_rate': 0.95,
    'min_t': 0.01
}

# Train
results = gan.train_MagGN(
    n_epochs=200,
    batch_size=64,
    normalize=False,
    t_schedule_config=t_schedule_config
)
```

### Generating Images

```python
# Generate samples
z = torch.randn(64, 100, device=device)
generated_images = gan.G(z)

# Visualize
import matplotlib.pyplot as plt
import torchvision.utils as vutils

plt.figure(figsize=(8, 8))
plt.imshow(vutils.make_grid(generated_images.cpu(), normalize=True, nrow=8).permute(1, 2, 0))
plt.axis('off')
plt.show()
```

### Computing Distance Between Distributions

```python
from magnitude import norm_diff_magnitude_distance_grad
import torch

# Create two distributions
real_data = torch.randn(1000, 784).to(device)  # Real images (flattened)
fake_data = torch.randn(1000, 784).to(device)  # Generated images

# Compute distance
distance = norm_diff_magnitude_distance_grad(
    real_data,
    fake_data,
    device=device,
    t=1.0,
    normalize=True
)

print(f"Distribution distance: {distance.item():.4f}")
```

### Evaluating Model Quality

```python
from Cifar10_Experiment.Inception_score import inception_score

# Generate samples
num_samples = 10000
generated_imgs = []
for i in range(num_samples // 64):
    z = torch.randn(64, 100, device=device)
    imgs = gan.G(z)
    generated_imgs.append(imgs.cpu())

generated_imgs = torch.cat(generated_imgs, dim=0)

# Compute Inception Score
is_mean, is_std = inception_score(generated_imgs, cuda=True, batch_size=32, resize=True)
print(f"Inception Score: {is_mean:.2f} ± {is_std:.2f}")
```

---

## Advanced Features

### Multi-Scale Training

Train with multiple Scaling values simultaneously:

```python
t_list = [0.1, 0.5, 1.0, 5.0]
for t in t_list:
    loss += norm_diff_magnitude_distance_grad(real, fake, device, t=t)
loss = loss / len(t_list)
```

### Adaptive Scaling

Automatically adjust Scaling based on gradient magnitude:

```python
def adaptive_t(self, epoch, gradient_norm):
    if gradient_norm < threshold:
        return self.t * 1.1  # Increase scale
    elif gradient_norm > upper_threshold:
        return self.t * 0.9  # Decrease scale
    return self.t
```

### Magnitude-Based Regularization

Add magnitude-based regularization to standard GANs:

```python
# Standard GAN loss
adv_loss = criterion(discriminator(fake), real_labels)

# Add magnitude regularization
mag_reg = norm_diff_magnitude_distance_grad(real, fake, device, t=1.0)
total_loss = adv_loss + lambda_mag * mag_reg
```

---

## Troubleshooting

### CUDA Out of Memory

```python
# Reduce batch size
batch_size = 32  # or 16

# Use gradient accumulation
accumulation_steps = 4
for i, (imgs, _) in enumerate(dataloader):
    loss = compute_loss(imgs) / accumulation_steps
    loss.backward()
    if (i + 1) % accumulation_steps == 0:
        optimizer.step()
        optimizer.zero_grad()
```

### Training Instability

```python
# Adjust learning rate
lr = 0.00001  # Lower learning rate

# Use gradient clipping
torch.nn.utils.clip_grad_norm_(model.parameters(), max_norm=1.0)

# Adjust Scaling scheduling
t_schedule_config = {
    'initial_t': 0.1,  # Start with smaller scale
    'schedule_type': 'exponential',
    'decay_rate': 0.99,  # Slower decay
    'min_t': 0.001
}
```

### Poor Generation Quality

```python
# Increase training epochs
n_epochs = 500

# Try different architectures
# For MNIST: try 'mlp'
# For CIFAR-10/CelebA: try different CNN architectures

# Adjust normalization
normalize = True  # Enable normalization in magnitude distance

# Try different t values
t_schedule_config = {
    'initial_t': 5.0,  # Start with larger scale
    'schedule_type': 'linear',
    'decay_amount': 4.9,
    'min_t': 0.1
}
```

