# L-CUBE: Code for Paper Review

> **CONFIDENTIAL - FOR REVIEW ONLY**
>
> This code is submitted as supplementary material for paper review. 
> It is pending patent application and should not be shared or distributed.

## Code Structure

```
code/
├── generator.py              # Core L-CUBE data generation
├── train.py                  # Example training script
├── adapters/                 # Model adapters for continuous I/O
│   ├── base.py               # Common components
│   ├── gpt.py                # GPT-2
│   ├── gpt_neo_x.py          # GPT-NeoX/Pythia
│   ├── mamba.py              # Mamba (HuggingFace)
│   ├── mamba2.py             # Mamba2 (mamba_ssm)
│   ├── lla.py                # Logarithmic Linear Attention
│   ├── mistral.py            # Mistral (sliding window)
│   ├── rwkv.py               # RWKV
│   └── qwen.py               # Qwen3
└── architectures/            # Custom flex_attention implementations
    ├── gpt2_flex_attn.py
    ├── gpt_neo_x_flex_attn.py
    ├── mistral_flex_attn.py
    └── qwen_flex_attn.py
```

## Basic Usage

### Generating L-CUBE Data

```python
import torch
from generator import multi_tree_layers, gen_streach_matrix2

device = torch.device("cuda")
tree_op = gen_streach_matrix2(1., 1., 1, 0.2).to(device) * (4 / 3.2) ** 0.5

y, y_mu, y_sigma, _, _ = multi_tree_layers(
    tree_op, m=3, n=4, num_layers=5, device=device, return_more=False
)
# y: sequences, y_mu/y_sigma: ground-truth conditionals
```

### Training

```bash
python train.py --model_name openai-community/gpt2 --seq_len 1024 --num_steps 10000
```
