# NeuroMamba: A Comprehensive Framework for Development and Evaluation

This repository provides the complete source code and a comprehensive experimental framework for the **NeuroMamba** model. It serves as a one-stop-shop for researchers and developers, encompassing everything from the core model implementation and CUDA kernels to a full suite of diagnostic tasks, ablation studies, large-scale pretraining, and performance benchmarking.

## Key Features

-   **Modular Architecture**: The core logic of NeuroMamba is cleanly separated into a `neuromamba` module, with custom CUDA kernels located in `csrc`, and a Hugging Face compatible wrapper in `TF` for seamless integration.
-   **Comprehensive Evaluation Suite**: An extensive set of experiments located in the `4_experiment/` directory, designed to rigorously test the model's capabilities on synthetic tasks like Induction Heads, Selective Copying, and 2ACDC.
-   **Systematic Ablation Studies**: Dedicated frameworks for systematically removing key architectural components (`gc`, `y2`) to analyze their specific contributions to model performance.
-   **Robust Pretraining Pipeline**: A scalable pretraining script designed for large, multi-source streaming datasets, equipped with custom callbacks for fault tolerance and numerical stability.
-   **Performance Benchmarking**: A standardized script to measure key performance metrics such as training/inference throughput and memory usage, allowing for direct comparison against other architectures.

## Installation

The project requires two main components to be built locally: the `causal-conv1d` dependency and the `neuromamba` package itself.

### Step 1: Install causal-conv1d

Navigate to the `causal_conv1d` directory and run the following command to build and install the package.

```bash
cd causal_conv1d
pip install --no-build-isolation --no-binary causal-conv1d -v .
```

### Step 2: Install NeuroMamba

Return to the project's root directory (`NEUMA`) and run the following command to install the NeuroMamba package in editable mode.

```bash
cd ..
pip install --no-build-isolation -v -e .
```
You can use the `rebuild.sh` script to automate this process.

## Project Structure

```
NEUMA/
├── 4_experiment/
│   ├── ablation_studies/
│   │   ├── 2ACDC/
│   │   ├── induction_heads/
│   │   └── selective_copying/
│   ├── foundational_capabilities/
│   │   ├── 2ACDC/
│   │   ├── induction_heads/
│   │   └── selective_copying/
│   └── real_world_validation/
│       ├── Pretrain/
│       └── benchmark_neuma.py
├── causal_conv1d/
├── csrc/
├── neuromamba/
├── TF/
├── pyproject.toml
├── README.md
├── rebuild.sh
└── setup.py
```

-   **`4_experiment/`**: The main directory containing all experiments. Each sub-directory is a self-contained study with its own detailed `README.md`.
-   **`causal_conv1d/`**: A required dependency providing an efficient implementation of causal convolutions.
-   **`csrc/`**: Contains the raw CUDA source code for the custom kernels used in the NeuroMamba architecture.
-   **`neuromamba/`**: The core implementation of the NeuroMamba modules, layers, and building blocks.
-   **`TF/`**: A wrapper that makes the NeuroMamba model compatible with the Hugging Face `transformers` library, enabling use of the `Trainer` API.
-   **`setup.py`**: The setup script for installing the `neuromamba` package.
-   **`rebuild.sh`**: A utility script to automate the rebuilding and installation of the local packages.

## The Experiment Suite (`4_experiment/`)

This directory is the heart of the evaluation framework, organized into three main categories:

### `foundational_capabilities/`
Contains baseline experiments to evaluate the core performance of the full NeuroMamba model on a range of synthetic tasks.

### `ablation_studies/`
Contains experiments designed to systematically remove components from the NeuroMamba architecture (e.g., the `gc` or `y2` pathways). This allows for a precise analysis of each component's contribution.

### `real_world_validation/`
Contains large-scale experiments that validate the model in more practical scenarios.
-   **`Pretrain/`**: A robust pipeline for pretraining the model on massive, multi-source streaming datasets.
-   **`benchmark_neuma.py`**: A script for measuring the training and inference performance (throughput, latency, memory).

**Each of these subdirectories contains its own detailed `README.md` file with specific instructions on how to configure and run the experiments.**

## How to Get Started

1.  **Clone the Repository**:
    ```bash
    git clone <repository-url>
    cd NEUMA
    ```

2.  **Install Dependencies**: Follow the steps in the **Installation** section to build and install the local packages.

3.  **Navigate to an Experiment**: Choose an experiment you wish to run. For example, to run the ablation study on the Induction Heads task:
    ```bash
    cd 4_experiment/ablation_studies/induction_heads/
    ```

4.  **Read the Local README**: Open and read the `README.md` file inside that directory. It will provide detailed instructions on how to configure and execute that specific experiment.

5.  **Run the Experiment**: Follow the instructions to launch the training script, for example:
    ```bash
    python train_ablation.py
    ```

## Benchmarking Performance

To measure the performance of your configured NeuroMamba model, you can use the provided benchmarking script.

1.  **Navigate** to the script's location.
2.  **Execute** the script:
    ```bash
    python 4_experiment/real_world_validation/benchmark_neuma.py
    ```
    This will run a standardized set of tests for both training and inference performance. The results will be saved to a `benchmark_results_neuma.json` file in the project's root directory, providing clear metrics on throughput and memory usage.