# SparseCache: Experiments in KV Cache Compression

This project provides a framework for experimenting with Retrieval-Augmented Generation (RAG) and includes different strategies for Key-Value (KV) cache management, including k-SVD compression.

## 1. Setup

First, set up your Python environment and install the required dependencies.

```bash
# It is recommended to use a virtual environment
python -m venv venv
source venv/bin/activate

# Install dependencies
pip install -r requirements.txt
```

## 2. Configuration

All experiment settings are managed through YAML files located in the `configs/` directory.

- `config_nocache.yaml`: Runs the baseline RAG pipeline without any KV caching.
- `config_kvcache.yaml`: Runs the RAG pipeline with a standard (uncompressed) KV cache.
- `config.yaml`: Runs the RAG pipeline with k-SVD compressed KV cache (non-batched reconstruction).
- `config_ksvd_batched.yaml`: Runs the RAG pipeline with k-SVD compressed KV cache (batched reconstruction).

Before running the k-SVD experiments, you must update `configs/config.yaml` and `configs/config_ksvd_batched.yaml` to provide the correct `DICT_FILE_PATH` for your pre-trained k-SVD dictionary.

## 3. Running Experiments

Convenience scripts are provided in the `scripts/` directory to run the full pipeline, which includes creating the necessary vector index and then running the experiment.

The scripts will intelligently skip index/cache creation if they already exist.

```bash
# To run the baseline (no KV cache) experiment:
./scripts/run_nocache.sh

# To run the standard KV cache experiment:
./scripts/run_kvcache.sh

# To run the k-SVD experiment (non-batched):
./scripts/run_ksvd.sh

# To run the batched k-SVD experiment:
./scripts/run_ksvd_batched.sh
```

### Forcing Cache Recreation

If you need to rebuild the vector index or KV cache from scratch, you can add the `--force` flag to the underlying Python script. The shell scripts do not pass this automatically.

Example:
```bash
python -m src.create_cache --config_file configs/config.yaml --force
```

## 4. Results

Experiment results, including performance metrics like Time-To-First-Token (TTFT), are saved as `.csv` files in the `results/` directory. A subdirectory is created for each experiment type.

## 5. Dictionary Training

In order to train dictionary, refer to the folder named `dictionary_training`
