# Scripts Directory

This directory contains all executable scripts for the Agent Cognitive Attack framework.

## 📁 Directory Structure

```
scripts/
├── experiment/      # Experiment execution scripts
├── analysis/        # Analysis and fitting scripts
└── config/          # Configuration management
```

---

## 🚀 Experiment Scripts (`scripts/experiment/`)

### 1. `run_experiments.py` ⭐ RECOMMENDED
**Purpose**: Batch experiment execution across multiple models

**Features**:
- Load configuration from `config/experiments.json`
- Parallel execution with ThreadPoolExecutor
- Per-model concurrency control
- Mock mode for testing
- Automatic result saving

**Usage**:
```bash
# Using JSON config
python scripts/experiment/run_experiments.py

# With command line overrides
python scripts/experiment/run_experiments.py --mock --max_instructions 2 --trials 3

# Custom config file
python scripts/experiment/run_experiments.py --config my_config.json
```

**Output**: `logs/jailbreak/{model_name}/{timestamp}.csv`

---

### 2. `run_pipeline.py`
**Purpose**: Complete workflow - experiment + analysis in one command

**Features**:
- Runs experiments
- Automatically analyzes results
- Generates visualizations
- Fits cognitive parameters

**Usage**:
```bash
python scripts/experiment/run_pipeline.py \
    --source siliconflow \
    --model deepseek-ai/DeepSeek-R1 \
    --trials 10 \
    --mock
```

---

### 3. `run_mab.py`
**Purpose**: Single model MAB experiment

**Features**:
- Direct experiment execution
- Single instruction testing
- Flexible parameters

**Usage**:
```bash
python scripts/experiment/run_mab.py \
    --source siliconflow \
    --model deepseek-ai/DeepSeek-R1 \
    --instruction "Your prompt" \
    --trials 50
```

---

### 4. `MAB_safety.py` (Legacy)
**Purpose**: Original experiment script (backward compatible)

**Note**: Still works but `run_experiments.py` is recommended for new projects.

---

## 📊 Analysis Scripts (`scripts/analysis/`)

### 1. `analyze.py` ⭐ RECOMMENDED
**Purpose**: Unified analysis entry point

**Features**:
- Metrics calculation (ASR, NTF, JRS, etc.)
- Drift plots (temporal dynamics)
- Radar charts (cognitive fingerprints)
- Batch processing support

**Usage**:
```bash
# Single folder - all analysis types
python scripts/analysis/analyze.py --folder logs/jailbreak/DeepSeek-R1/ --type all

# Specific analysis
python scripts/analysis/analyze.py --folder logs/jailbreak/DeepSeek-R1/ --type metrics
python scripts/analysis/analyze.py --folder logs/jailbreak/DeepSeek-R1/ --type drift
python scripts/analysis/analyze.py --folder logs/jailbreak/DeepSeek-R1/ --type radar

# Batch analysis
python scripts/analysis/analyze.py --folder logs/jailbreak/ --type all
```

**Output**:
- `logs/analysis/{model}/metrics.csv`
- `logs/analysis/{model}/drift.png`
- `logs/analysis/{model}/radar.png`

---

### 2. `fit_params.py`
**Purpose**: Cognitive parameter fitting

**Features**:
- NLL optimization
- Multiple fitting strategies
- Extended parameter mode
- Batch processing

**Usage**:
```bash
# Single model
python scripts/analysis/fit_params.py --folder logs/jailbreak/DeepSeek-R1/

# Batch all models
python scripts/analysis/fit_params.py --folder logs/jailbreak/

# Extended parameters
python scripts/analysis/fit_params.py --folder logs/jailbreak/ --extended
```

**Output**: `logs/analysis/{model}/cognitive_report.csv`

---

### 3. `bias_parameter_fitter.py` (Legacy)
**Purpose**: Original parameter fitting script (backward compatible)

**Note**: Use `fit_params.py` for new projects.

---

## ⚙️ Configuration Scripts (`scripts/config/`)

### `config_manager.py`
**Purpose**: CLI tool for managing environment configuration

**Commands**:
```bash
# View current configuration
python scripts/config/config_manager.py show

# Check configuration completeness
python scripts/config/config_manager.py check

# List supported platforms
python scripts/config/config_manager.py platforms

# Initialize .env file
python scripts/config/config_manager.py init
```

---

## 📋 Quick Reference

| Script | Purpose | When to Use |
|--------|---------|-------------|
| `run_experiments.py` | Batch experiments | Production experiments |
| `run_pipeline.py` | Full workflow | Quick start |
| `run_mab.py` | Single experiment | Debugging |
| `analyze.py` | Analysis | After experiments |
| `fit_params.py` | Parameter fitting | Cognitive modeling |
| `config_manager.py` | Configuration | Setup |

---

## 🎯 Typical Workflow

```bash
# 1. Configure
python scripts/config/config_manager.py check

# 2. Run experiments
python scripts/experiment/run_experiments.py --mock --max_instructions 2 --trials 3

# 3. Analyze results
python scripts/analysis/analyze.py --folder logs/jailbreak/DeepSeek-R1/ --type all

# 4. Fit parameters
python scripts/analysis/fit_params.py --folder logs/jailbreak/DeepSeek-R1/
```

---

## 🔧 Command Line Arguments

### Common Arguments
- `--mock`: Use mock responses (no API calls)
- `--trials N`: Number of trials per instruction
- `--max_instructions N`: Limit number of instructions
- `--source`: API platform (siliconflow, ollama, etc.)
- `--model`: Model name
- `--folder`: Path to results folder
- `--output_dir`: Output directory

### Experiment-Specific
- `--config`: Custom JSON config file
- `--max_workers`: Parallel workers
- `--instruction`: Single instruction string

### Analysis-Specific
- `--type`: Analysis type (metrics, drift, radar, all)
- `--extended`: Use extended parameter set

---

## 📊 Output Locations

**Experiments**:
- `logs/jailbreak/{model_name}/results_{timestamp}_{uuid}.csv`

**Analysis**:
- `logs/analysis/{model_name}/metrics.csv`
- `logs/analysis/{model_name}/drift.png`
- `logs/analysis/{model_name}/radar.png`
- `logs/analysis/{model_name}/cognitive_report.csv`

---

## 🔄 Backward Compatibility

All legacy scripts are preserved:
- `MAB_safety.py` → `scripts/experiment/MAB_safety.py`
- `bias_parameter_fitter.py` → `scripts/analysis/bias_parameter_fitter.py`

They can still be run from the root directory for backward compatibility.

---

## 💡 Tips

1. **Always use mock mode first**: `--mock` to test without API costs
2. **Check configuration**: Run `config_manager.py check` before experiments
3. **Use JSON config**: For complex experiments, use `config/experiments.json`
4. **Batch analysis**: Process multiple models at once with `--folder`
5. **Type all**: Use `--type all` for complete analysis in one command

---

**Last Updated**: 2025-12-29
