# Prompts to Proxies (P2P): Emulating Human Preferences via a Compact LLM Ensemble

## Overview

Code for ICML 2026 submission: "Prompts to Proxies: Emulating Human Preferences via a Compact LLM Ensemble"


**Prompts to Proxies (P2P)** is a two-stage system for aligning LLM agent ensembles to human population preferences using survey response data.

The system constructs diverse agent profiles ("endowments")—each representing a distinct persona—and elicits survey responses from LLMs conditioned on those profiles. Through entropy-based active sampling, P2P iteratively expands the agent pool to span the latent preference space. It uses L1-regularized regression to select proxy agents based on observed ground truth data. No finetuning or demographic conditioning is required.

## Quick Start

### 1. Installation

```bash
pip install -r requirements.txt
```

Make sure you have:
- Python 3.10+
- API key in your environment

### 2. Run the Full Pipeline

P2P provides a unified script to run both stages of the experiment:

- **Stage 1**: Active generation of synthetic agent endowments and survey responses.
- **Stage 2**: Lasso/elastic net regression to select a compact agent ensemble based on observed data.

To run the full pipeline using a sample configuration:

```bash
python scripts/run_full_pipeline.py --config config/full_pipelines/example.yaml
```
This script will:

- Launch `run_endowment_generator.py` to generate agent profiles and LLM responses
- Create a `lasso_config.yaml` on the fly, linking outputs from the previous step
- Run `run_lasso_experiment.py` using the generated config to perform Lasso selection
- Save all results, logs, and configuration copies into a timestamped subdirectory under `outputs/`

### 3. Example Output

Upon completion, you will find:

- `outputs/ATP_w36_<timestamp>/responses.csv`: Matrix of survey responses  
- `outputs/ATP_w36_<timestamp>/endowments.csv`: Generated endowment metadata  
- `outputs/ATP_w36_<timestamp>/plots/*.png`: Diagnostic plots (e.g., entropy, diversity)  
- `outputs/ATP_w36_<timestamp>/reports/*.html`: Optional summary reports  
- `outputs/ATP_w36_<timestamp>/pipeline.log`: Execution log for debugging

To customize experiments, modify:

- The full pipeline config: `config/full_pipelines/example.yaml`  
- Component configs under:  
  - `config/surveys/`  
  - `config/attribute_banks/`  
  - `config/experiments/`

To run modular stages separately (e.g., for debugging or custom workflows), you may use:

- `scripts/run_endowment_generator.py`: For generating agent endowments and synthetic responses  
- `scripts/run_lasso_experiment.py`: For running the Lasso regression on generated data

### Full Pipeline Config Example

Below is a minimal example excerpt from a full pipeline configuration file (`config/full_pipelines/example.yaml`):

```yaml
metadata:
  name: "ATP_w36"
  description: "Generate 300 active endowments for ATP Wave 36 using GPT-4o-mini agents."
  seed: 101

paths:
  survey_csv: "data/w36/info.csv"
  survey_yaml: "config/surveys/american_trend_panel.yaml"
  attribute_bank: "config/attribute_banks/attribute_bank.yaml"
  aggregate_json: "data/w36/W36_aggregate_results.json"

generation:
  target_n: 300
  initial_n: 10
  num_update_steps: 10
  parallel: true
  max_workers: 30
```
See [`example.yaml`](config/full_pipelines/example.yaml) for the full configuration and documentation.

## Data

Processed data files are included in `data/` for review purposes.

Original sources:
- American Trends Panel (ATP): [OpinionsQA](https://github.com/tatsu-lab/opinionsqa)
- World Values Survey (WVS) Wave 7: Cao et al. (2025)

## Outputs

The `outputs/` folder is empty by default. Running the pipeline will populate it with results.

## Notes

- Full documentation and preprocessing scripts will be released upon acceptance.
- Figure generation code will be included in the camera-ready version.

## Repository Structure
```
├── agents/         # Agent classes
├── backends/       # LLM API backends (OpenAI, Gemini, etc.)
├── config/         # Configuration files (surveys, attribute banks, experiments)
├── data/           # Processed survey data (ATP, WVS)
├── experiments/    # Experiment module
├── generators/     # Endowment generation module
├── models/         # Regression models
├── modules/        # Other core modules
├── outputs/        # Generated results
├── scripts/        # Runner scripts (run_full_pipeline.py, etc.)
├── styles/         # Report styles
├── LICENSE
├── README.md
└── requirements.txt
```

## License

MIT License (author information redacted for anonymous review)