# BACO: Base-Aligned Model Collaboration

This repository contains the implementation for **Base-Aligned Model Collaboration (BACO)**, an inference-time framework that dynamically combines base and aligned language models to optimize both diversity and quality in text generation.

## Overview

BACO addresses the diversity-quality trade-off in aligned LLMs through token-level routing between a base model (high diversity) and its aligned counterpart (high quality). The framework uses uncertainty-based routing strategies to determine which model to decode from at each generation step.

## Repository Structure

### Core Implementation
- `run_api.py` - Main experiment runner with BACO routing strategies
- `utils.py` - Core BACO implementation with routing mechanisms and model collaboration
- `interact.py` - Interactive demo interface for BACO
- `server.py` - API server for model hosting

### Routing Strategies
- **Uncertainty-based**: `top_prob`, `entropy_threshold`, `entropy_ratio`
- **Model-specific**: `top_prob_al`, `prob_ratio` 
- **Heuristic**: `textual_judge`

### Evaluation & Metrics
- `diversity_metrics.py` - Comprehensive diversity evaluation (lexical + semantic)
- `general_diversity_metrics.py` - Additional diversity measurements
- `perplexity.py` - Quality assessment via perplexity
- `uncertainty_quantification.py` - Routing decision metrics

### Datasets & Benchmarks
- `novelty-bench/` - Creative writing evaluation benchmark
- `novelty-bench-curated/` - Curated subset for standardized evaluation
- `wildchat/` - Dialogue evaluation data
- `dataset_utils.py` - Data loading and preprocessing utilities

### Utilities
- `preprocessing_utils.py` - Input formatting and prompt templates
- `postprocessing_utils.py` - Output cleaning and repetition handling
- `tokenization_utils.py` - Cross-model tokenization handling
- `text_splitter.py` - Text segmentation for analysis

### Experiment Management
- `ckpts/` - Checkpoint processing and result aggregation
- `scripts/` - Batch evaluation and analysis scripts
- `plots/` - Visualization and result plotting
- `human_eval/` - Human evaluation interface and data

## Quick Start

1. **Setup Environment**
   ```bash
   conda create --name baco python=3.11
   pip install vllm==0.6.2
   ```

2. **Host Models**
   ```bash
   # Base model (port 8123)
   python -m vllm.entrypoints.openai.api_server --model base_model_path --port 8123
   
   # Aligned model (port 8000) 
   python -m vllm.entrypoints.openai.api_server --model aligned_model_path --port 8000
   ```

3. **Run BACO**
   ```bash
   python run_api.py --dataset novelty-bench-curated --routing-method entropy_threshold --threshold 0.3
   ```

## Key Features

- **Token-level Collaboration**: Dynamic routing between base and aligned models at each generation step
- **Multiple Routing Strategies**: Family of uncertainty and heuristic-based routing methods
- **Controllable Trade-offs**: Fine-grained control over diversity-quality balance via threshold tuning
- **Comprehensive Evaluation**: 13 metrics across diversity and quality dimensions
- **Human Validation**: Interface for human evaluation of generated outputs
