# Induction Heads Task with NeuroMamba

This repository provides a comprehensive and automated framework for testing the **NeuroMamba** model on the **Induction Heads** task. It includes a configurable data generator and two distinct workflows: running single, manually configured experiments, and launching a full, automated suite of tests across various model parameters and difficulty levels.

## Workflows

This framework supports two primary modes of operation:

1.  **Single Experiment Mode**: Ideal for quick tests or debugging. You manually set all parameters in a single configuration file and run one experiment at a time.
2.  **Automated Suite Mode**: Designed for comprehensive evaluation. This mode programmatically iterates through a predefined matrix of model parameters (e.g., `expand_gc`) and task difficulties, running each combination and saving the results automatically.

## Files

-   `config.py`: The central command center. Contains the **default** configurations for the training loop, dataset, and the NeuroMamba model architecture. It is used directly by `induction_heads_neuromamba.py`.
-   `data_generator.py`: A model-agnostic script that generates the complex, multi-level datasets for the Induction Heads task. It is used by both workflows.
-   `induction_heads_neuromamba.py`: The script for running a **single, standalone experiment**. It directly reads its settings from `config.py`.
-   `induction_heads_neuromamba_auto.py`: A refactored, modular version of the training script, designed to be imported and called by the automation script. **It is not intended to be run directly.**
-   `run_all_neuromamba_experiments.py`: The main **automation script**. It runs a full suite of experiments by systematically looping through different model parameters and difficulty levels defined within it.

## How to Run Experiments

### Option 1: Running a Single Experiment (Manual)

Use this method for a single, targeted test.

1.  **Configure**: Open `config.py` and manually set the desired `difficulty_level`, `level_4_noise_type`, `vocab_size`, and NeuroMamba parameters (`n_layer`, `d_model`, `expand_gc`, etc.).
2.  **Execute**: Run the standalone script from your terminal:
    ```bash
    python induction_heads_neuromamba.py
    ```
    This will execute one training run based on the exact settings you provided in `config.py`.

### Option 2: Running the Full Experiment Suite (Automated)

Use this method to perform a comprehensive, hands-off evaluation.

1.  **Configure**: Open `run_all_neuromamba_experiments.py`. Inside this file, you will find two lists at the top:
    -   `EXPAND_GC_VALUES`: Define the `expand_gc` values you want to test.
    -   `DIFFICULTY_LEVELS_TO_TEST`: Define the difficulty levels to test against.
2.  **Execute**: Run the automation script from your terminal:
    ```bash
    python run_all_neuromamba_experiments.py
    ```
    This will launch the entire suite, automatically running an experiment for every combination of `expand_gc` and difficulty. For the default setting (5 `expand_gc` values x 7 difficulties), it will run **35 experiments**.

## Results

All experiment results (both single and automated) are saved into a directory named `logs_neuromamba/`. The script generates detailed, timestamped CSV log files for both training and validation.

The intelligent filename convention allows for easy identification of each experiment's settings:

`ih_{level}_{params}_{arch_gc}_{timestamp}_{type}.csv`

A sample filename might be `ih_lv3_60K_2_64_gc2p5_20250717-103000_validation.csv`, which tells you:
-   **Task**: Induction Heads (`ih`)
-   **Difficulty**: Level 3 (`lv3`)
-   **Model Size**: ~60K parameters (`60K`)
-   **Architecture**: 2 layers, 64 dimension, **`expand_gc` of 2.5** (`2_64_gc2p5`)
-   **Timestamp**: The exact date and time of the run.
-   **Type**: `validation` log

You can monitor progress in the terminal, which will look similar to this:
```
2025-07-17 10:30:00,100 - INFO - Using device: cuda
2025-07-17 10:30:00,101 - INFO - Logging training metrics to: logs_neuromamba/ih_training_lv3_60K_2_64_gc2p5_20250717-103000.csv
2025-07-17 10:30:00,102 - INFO - Logging validation results to: logs_neuromamba/ih_validation_lv3_60K_2_64_gc2p5_20250717-103000.csv
...
Training:  4%|▍         | 8192/204800 [05:10<2:03:00, 26.31it/s, loss=0.0123, train_acc=98.50%]
```

## Acknowledgments

This work is built upon the concepts presented in the Mamba paper by Dao and Gu. We thank them for their foundational work and insightful implementation.