# ExpLearn: Dual-Loop Cognitive Architecture for LLM Agents

ExpLearn is a framework that implements a "Dual-Loop Cognitive Architecture" for LLM agents, designed to improve problem-solving capabilities (especially in mathematics) through experience learning. It features an **Inference Loop (System 1)** for rapid problem solving and an **Optimization Loop (System 2 / PTGD)** for experience extraction and refinement.

## 🚀 Quick Start

### 1. Setup Environment
```bash
# Create and activate virtual environment
python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

# Configure environment variables
cp src/.env.example src/.env
# Edit src/.env with your API keys (OPENAI_API_KEY, etc.)
```

### 2. Run Initialization (Mining & Pruning)
Generate initial experiences from a training dataset and prune them for quality and diversity.
```bash
bash run_initialization.sh
```

### 3. Run Training (Optimization Loop)
Iteratively refine the experience pool using the Self-Taught Gradient Descent (STGD) approach.
```bash
bash run_train.sh
```

### 4. Run Evaluation (Inference Loop)
Evaluate the performance of the LLM with the learned experience library.
```bash
bash run_eval.sh
```

## 🏗 Project Structure

```text
.
├── src/
│   ├── agents/          # Autonomous agents (Solver, Retriever, Optimizer, etc.)
│   ├── core/            # Core logic (Inference Engine, Trainer, Experience Pool)
│   ├── tools/           # Utilities (Data loaders, Graders, Prompts)
│   └── .env.example     # Environment template
├── data/                # Dataset and experience storage
├── experiments/         # Experiment results and analysis scripts
├── logs/                # Execution logs
├── run_initialization.sh # Workflow: Initialization -> Regularization
├── run_train.sh          # Workflow: Iterative STGD Training
└── run_eval.sh           # Workflow: Inference & Evaluation
```

## 🧠 Key Components

### 1. Agents (`src/agents/`)
- **[SolverAgent](file:///Users/superlj666/Library/CloudStorage/OneDrive-个人/Writing/Writing_202512_ExpSearch/ExpLearn/src/agents/solver.py)**: System 1 reasoning agent that uses retrieved experiences.
- **[RetrievalAgent](file:///Users/superlj666/Library/CloudStorage/OneDrive-个人/Writing/Writing_202512_ExpSearch/ExpLearn/src/agents/retriever.py)**: Manages vector-based and subject-based experience retrieval.
- **[OptimizerAgent](file:///Users/superlj666/Library/CloudStorage/OneDrive-个人/Writing/Writing_202512_ExpSearch/ExpLearn/src/agents/optimizer.py)**: System 2 agent that extracts experiences from successful trajectories.
- **[RegularizerAgent](file:///Users/superlj666/Library/CloudStorage/OneDrive-个人/Writing/Writing_202512_ExpSearch/ExpLearn/src/agents/regularizer.py)**: Prunes and organizes the experience pool.
- **[InitializerAgent](file:///Users/superlj666/Library/CloudStorage/OneDrive-个人/Writing/Writing_202512_ExpSearch/ExpLearn/src/agents/initializer.py)**: Generates the initial seed experience pool.

### 2. Core Logic (`src/core/`)
- **[InferenceEngine](file:///Users/superlj666/Library/CloudStorage/OneDrive-个人/Writing/Writing_202512_ExpSearch/ExpLearn/src/core/inference.py)**: Orchestrates the inference loop.
- **[Trainer](file:///Users/superlj666/Library/CloudStorage/OneDrive-个人/Writing/Writing_202512_ExpSearch/ExpLearn/src/core/train.py)**: Manages the optimization loop and experience updates.
- **[ExperiencePool](file:///Users/superlj666/Library/CloudStorage/OneDrive-个人/Writing/Writing_202512_ExpSearch/ExpLearn/src/core/experience_pool.py)**: Data structure for storing and indexing experiences.

## 📊 Tracing & Monitoring
ExpLearn supports **Arize Phoenix** for real-time tracing of agentic workflows.
1. Enable `PHOENIX_ENABLE=1` in `src/.env`.
2. Start the Phoenix server:
   ```bash
   python -m phoenix.server.main serve
   ```
3. View the UI at `http://localhost:6006`.
