# AgentSteerTTS

Agentsteertts: Multi-Agent Closed-Loop Framework for Intent-Faithful Expressive TTS Control


## Project Structure

```
code/
├── agentsteertts/
│   ├── __init__.py           # 包入口
│   ├── model.py              # 核心模型 (ADM + DAC + Fast Agent)
│   ├── retrieval/            # DAC Retrieval Agent
│   │   ├── __init__.py
│   │   └── emotion_retrieval.py  # 情感原型检索系统
│   ├── agents/               # Fast-Slow Feedback Agents
│   │   ├── __init__.py
│   │   ├── supervisor.py     # Slow Agent (感知批评)
│   │   └── voice_clone.py    # 语音克隆流程控制
│   └── utils/                # 工具函数
│       ├── __init__.py
│       ├── audio.py          # 音频处理
│       └── emotion.py        # 情感检测
├── configs/
│   └── default.yaml          # 默认配置
├── data/                     # 测试数据
├── scripts/
│   └── test.py               # 测试脚本
├── requirements.txt
└── README.md
```

## Quick Start

```bash
# 安装依赖
pip install torch torchaudio numpy

# 运行所有测试
python scripts/test.py

# 测试特定模块
python scripts/test.py --module model
python scripts/test.py --module retrieval
python scripts/test.py --module agents

# 使用真实数据测试
python scripts/test.py --data_dir data
```

## Core Components

### 1. ADM (Adversarial Disentanglement Module)
```python
from agentsteertts import AgentSteerTTS

model = AgentSteerTTS()
# 计算 ADM 解耦损失
losses = model.compute_adm_losses(semantic_features, speaker_labels, emotion_labels)
# L_ADM = λ_adv·L_adv + λ_orth·L_orth
```

### 2. DAC (Dual-stream Anchoring Controller)
```python
from agentsteertts.retrieval import EmotionRetrievalSystem

# 检索情感原型
retriever = EmotionRetrievalSystem()
results = retriever.search_by_instruction("带着快乐的语气", top_k=5)

# 融合控制向量
# ẑ_emo = q_txt + (1-Σw_k)·q_mix
```

### 3. Fast-Slow Feedback
```python
from agentsteertts.agents import SupervisorAgent

# Supervisor 分析合成音频
supervisor = SupervisorAgent()
critique = supervisor.analyze_audio("output.wav", target_emotion="happy")

if supervisor.should_adjust_alpha(critique):
    new_alpha = supervisor.get_alpha_adjustment(critique, current_alpha)
```

