# Robustness Tree Experiments

This directory contains an alternative implementation of the robustness experiments with additional features and optimizations for tree-based models. This code is used for some experiments in **Section 6.1 and Appendix D.2**.

## Overview

This implementation provides:
- Optimized tree-based adversarial robustness evaluation
- Support for various tree ensemble methods
- Flexible configuration system via YAML files
- Efficient experiment management and result tracking

## Directory Structure

```
robustness_tree/
├── config/                      # Configuration files
│   ├── metrics_0_05.yaml       # 5% attack strength config
│   ├── metrics_0_1.yaml        # 10% attack strength config
│   └── metrics_0_3.yaml        # 30% attack strength config
├── module/                      # Core modules
│   ├── __init__.py
│   ├── datasets.py             # Dataset handling
│   ├── experiment.py           # Experiment management
│   └── model/                  # Model implementations
├── run_exp.py                  # Main experiment runner
└── run_result.py               # Results evaluation and plotting
```

## Running the Experiments

### Basic Usage

Run a single robustness experiment:

```bash
python run_exp.py \
  --dataset compas \
  --encoder threshold_guess \
  --metrics_config config/metrics_0_1.yaml \
  --output_dir out/ \
  --skip_confirm \
  --random_state 10 \
  --retrain \
  --reset_results \
  --fold 0 \
  RSET \
  --rset_eps 0.03
```

### Command Line Arguments

| Argument | Description | Default |
|----------|-------------|---------|
| `--dataset` | Dataset name (compas, fico, adult, etc.) | Required |
| `--encoder` | Tree encoding method | `threshold_guess` |
| `--metrics_config` | Path to metrics YAML config | Required |
| `--output_dir` | Directory for saving results | `out/` |
| `--random_state` | Random seed | `10` |
| `--fold` | Cross-validation fold number | `0` |
| `--retrain` | Force retraining of models | Flag |
| `--reset_results` | Clear previous results | Flag |
| `--skip_confirm` | Skip confirmation prompts | Flag |

### Model Types

Specify the model type as a positional argument:

- `RSET` - Rashomon set with epsilon parameter
- `SINGLE` - Single decision tree model
- `RF` - Random forest ensemble

For Rashomon sets, add:
```bash
RSET --rset_eps 0.03
```

### Configuration Files

YAML configuration files specify attack parameters:

**Example: `config/metrics_0_1.yaml`**
```yaml
attack_strength: 0.1
attack_type: linf
max_iterations: 1000
learning_rate: 0.01
```

Available configurations:
- `metrics_0_05.yaml` - 5% perturbation budget
- `metrics_0_1.yaml` - 10% perturbation budget
- `metrics_0_3.yaml` - 30% perturbation budget

## Evaluating Results

After running experiments, evaluate and plot results:

```bash
python run_result.py \
  --output_dir out/ \
  --dataset_list compas,fico,adult
```

This generates:
- Aggregated statistics across experiments
- Comparison plots (accuracy, robustness, trade-offs)
- CSV files with numerical results
- Publication-ready figures

## Output Structure

Results are organized as:

```
out/
├── compas/
│   ├── fold_0/
│   │   ├── rset_eps_0.03/
│   │   │   ├── model.pkl
│   │   │   ├── results.json
│   │   │   └── metrics.csv
│   │   └── single/
│   └── fold_1/
└── plots/
    ├── accuracy_comparison.pdf
    └── robustness_tradeoff.pdf
```

## Running Multiple Experiments

### Sequential Execution

```bash
# Test different epsilon values
for eps in 0.01 0.03 0.05; do
    python run_exp.py --dataset compas --metrics_config config/metrics_0_1.yaml \
        --output_dir out/ --skip_confirm RSET --rset_eps $eps
done
```

### Parallel Execution

```bash
# Run multiple datasets in parallel
parallel -j 4 python run_exp.py --dataset {} --metrics_config config/metrics_0_1.yaml \
    --output_dir out/ --skip_confirm RSET --rset_eps 0.03 \
    ::: compas fico adult default_credit
```

## Advanced Features

### Cross-Validation

Run experiments across multiple folds:

```bash
for fold in {0..4}; do
    python run_exp.py --dataset compas --metrics_config config/metrics_0_1.yaml \
        --output_dir out/ --fold $fold RSET --rset_eps 0.03
done

# Aggregate results across folds
python run_result.py --output_dir out/ --aggregate_folds
```

### Custom Encoders

Available encoder types:
- `threshold_guess` - GOSDT-based threshold guessing
- `cart` - CART decision tree
- `optimal` - Optimal decision tree

### Experiment Management

Flags for managing experiments:

- `--retrain` - Force retraining even if model exists
- `--reset_results` - Clear previous results before running
- `--skip_confirm` - Skip confirmation prompts for automation
- `--verbose` - Enable detailed logging

## Performance Optimization

For faster experiments:

1. **Use cached models**: Omit `--retrain` to reuse trained models
2. **Reduce iterations**: Edit attack iterations in config files
3. **Parallelize**: Run multiple experiments simultaneously
4. **Use smaller folds**: Test with single fold before full cross-validation

## Differences from `robustness/` Directory

This implementation offers:
- ✓ YAML-based configuration system
- ✓ Better experiment tracking and organization
- ✓ Optimized tree attack implementations
- ✓ Integrated result evaluation
- ✓ Cross-validation support

The `robustness/` directory provides:
- ✓ More attack variants
- ✓ Sequential attack analysis
- ✓ Ensemble-on-ensemble experiments

Both implementations produce comparable results for the main paper experiments.

## Troubleshooting

**Issue**: Configuration file not found
- **Solution**: Ensure you're running from the `robustness_tree/` directory or provide absolute path

**Issue**: Out of memory during Rashomon set generation
- **Solution**: Reduce `--rset_eps` or use a smaller dataset subset

**Issue**: Results directory conflicts
- **Solution**: Use `--reset_results` to clear previous runs or choose a different `--output_dir`

## Related Directories

- `robustness/` - Alternative implementation with additional features
- `stability/` - Stability analysis experiments

## Notes

- This implementation is optimized for tree-based models
- For neural network experiments, see the `rob_priv_nn/` directory
- Results should be consistent with those from the `robustness/` directory
