# AutoRPA User Guide

This document contains configuration instructions, advanced features, and troubleshooting for AutoRPA.

---

## 🛠️ Complete Parameter Reference

### Experiment Mode

```bash
--agent_name=autorpa              # 'autorpa' (full pipeline) or 'gui-agent' (exploration only)
--test_rpa_mode=False             # Test mode (Employment Phase)
```

### GUI Agent Configuration

```bash
--gui_agent_type=react_star       # 'react_star' / 'droidrun' / 'askui'
--reflection_rounds=2             # [React*] Reflection rounds (1-3 recommended)

# React* specific
--react_star_action_space=index   # 'index' / 'coordinate'
--react_star_ui_info=screenshot_with_tree  # UI info mode
--enable_shell_action=False     # Enable shell commands

# DroidRun specific
--droidrun_config_path=config.yaml
--droidrun_timeout=600
```

### LLM Model Configuration

```bash
--default_llm=gpt-5-low           # Default model (all components)

# Dedicated models (optional overrides)
--builder_llm=gpt-5-low           # RPA Builder
--grounder_llm=claude-sonnet-4-5  # UI element grounding (recommend non-thinking model, e.g. gemini-2.5-pro-nothinking)
--actiontranslator_llm=gpt-5-low  # Action translation

# React* specific (only when gui_agent_type=react_star)
--planner_llm=claude-sonnet-4-5   # Planner
--summarizer_llm=gpt-5-low        # Summarizer
```

**Available models**: `gpt-4o`, `gpt-4.1`, `gpt-5-low`, `gemini-2.5-pro-thinking`, `claude-sonnet-4-5`

**Model selection recommendations**:
- **Builder/Planner**: Recommend thinking models (`claude-sonnet-4-5`, `gpt-5-low`) — complex reasoning needed
- **Grounder**: Recommend non-thinking models (`gemini-2.5-pro-nothinking`, `gpt-4o`) — fast matching
- **Summarizer/Translator**: Balanced model (`gpt-5-low`) — cost-effective

### Task Configuration

```bash
--num_tasks_to_explore=3          # Number of task instances to explore (1-10 recommended)
--max_attempts_per_task=3         # Max attempts per task
--tasks=None                      # Task list:
                                  #   - None: all tasks
                                  #   - subset: 40 predefined tasks (TASKS_SUBSET)
                                  #   - chosen: 38 stable tasks (excludes OsmAndTrack, SystemBrightnessMax)
                                  #   - rag_test: 22 RAG test tasks
                                  #   - TaskName1,TaskName2: custom task list
--task_random_seed=30             # Random seed
--fixed_task_seed=False           # Use identical task parameters
```

**Task subset description** (defined in `configs/task_subsets.py`):
- **subset** (40 tasks): Main evaluation set
- **chosen** (38 tasks): Excludes 2 unstable tasks, recommended
- **rag_test** (22 tasks): Tasks where React succeeded but RPA generation failed, for testing improvements

### Action Generation Configuration

```bash
--use_action_translator=True      # Enable action translation (strongly recommended)
--enable_shell_action=False  # Enable shell commands (exploration + building)
--use_fetch_info=True             # Builder can fetch detailed UI info
```

### Knowledge Bank Configuration

```bash
--update_rpa_bank=True            # Save newly generated RPAs
--use_react_trajs_bank=False      # Use existing trajectories
--update_react_trajs_bank=True    # Save successful trajectories
```

**Note**: The `load_rpa_bank` parameter has been removed. In test mode (`--test_rpa_mode=True`), the RPA bank is loaded automatically.

#### ReAct Trajs Bank — Trajectory Reuse Details

**Parameters**:

| Parameter | Default | Description |
|-----------|---------|-------------|
| `--use_react_trajs_bank` | `False` | Whether to load and use existing exploration trajectories |
| `--update_react_trajs_bank` | `True` | Whether to save new trajectories to the bank |

**Behavior**:

1. **Check existing trajectories**: On startup, check `data/react_trajs_bank.json`
2. **Smart reuse**:
   - Exact match: Prefer trajectory for current instance_id
   - Fallback: If current instance has no trajectory, use a successful trajectory from another instance with the same task_goal (shortest step count)
3. **Auto exploration**: If no usable trajectory exists, run exploration and save

**Usage scenarios**:

```bash
# Scenario 1: Full reuse (recommended)
python main.py \
  --use_react_trajs_bank=True \
  --update_react_trajs_bank=True \
  --tasks=CameraTakePhoto

# Scenario 2: Read-only (test existing RPA)
python main.py \
  --use_react_trajs_bank=True \
  --update_react_trajs_bank=False \
  --tasks=CameraTakePhoto

# Scenario 3: Force re-exploration (better trajectories)
python main.py \
  --use_react_trajs_bank=False \
  --update_react_trajs_bank=True \
  --tasks=CameraTakePhoto
```

**Example output**:

```bash
# When using existing trajectory
✅ ReAct Traj Exists, skip running ReAct Agent.
🔧 Batch Action Translation (Before RPA Builder)...

# When using fallback
🔄 No trajectory found for instance 2, using instance 1 trajectory (5 steps) as fallback

# When no trajectory exists
⚠️  ReAct Traj Doesn't Exist, running ReAct Agent.
[Starting exploration...]
```

**Quick start script**:

```bash
# Use existing trajectories
cd examples
./run_autorpa.sh
```

### Logging Configuration

```bash
--log_level=INFO                  # DEBUG/INFO/WARNING/ERROR/CRITICAL
--enable_llm_logging=False        # Save LLM call details
```

**Logging**:
- Console: Filtered by `--log_level`
- File: Always records all levels (including DEBUG)

**Usage**:
```bash
# Development / debugging
python main.py --log_level=DEBUG --tasks=ContactsAddContact

# Production (avoid console spam)
python main.py --log_level=INFO --tasks=subset --num_tasks_to_explore=10

# Troubleshooting: run with INFO first, then inspect DEBUG in log file
grep -A 10 -B 10 "ERROR" log/autorpa_xxx/output.log
```

---

## 🔧 Shell Action Support

### Overview

Shell actions let the agent run ADB shell commands and access system-level features.

### Configuration

#### GUI Agent Shell Action (Exploration Phase)

```bash
python main.py \
    --agent_name=gui-agent \
    --gui_agent_type=react_star \
    --react_star_ui_info=screenshot_with_tree \
    --enable_shell_action \
    --tasks=ContactsAddContact
```

**Effect**: Exposes `shell()` in the React* Agent action space.

#### AutoRPA Shell Action (Full Pipeline)

```bash
python main.py \
    --agent_name=autorpa \
    --gui_agent_type=react_star \
    --react_star_ui_info=screenshot_with_tree \
    --enable_shell_action \
    --tasks=ContactsAddContact
```

**Effect**: Enables shell execution in exploration and shell translation in building.

### Configuration Summary

| enable_shell_action | Result | Use case |
|---------------------|--------|----------|
| ✅ Enabled | ✅ Full shell support (exploration + building) | System-level operations |
| ❌ Disabled | ✅ Default: no shell | Most tasks |

**Note**: `enable_shell_action` controls shell support for the whole pipeline; no separate flags needed.

### Usage Rules

**Priority**: Use shell only as a last resort; prefer dedicated actions.

✅ Correct:
```python
open_app("Contacts")
click(x, y)
input_text("text")
```

❌ Avoid:
```python
shell("am start -n ...")
shell("input tap x y")
shell("input text ...")
```

**Suitable cases**:
- System info: `shell("dumpsys battery")`, `shell("wm size")`
- Package management: `shell("pm list packages | grep contacts")`
- Settings: `shell("settings get system screen_brightness")`

### Smart Translation

**Needs translation** (hardcoded parameters):
```python
# Exploration phase
shell("settings put system screen_brightness 50")

# After building
brightness = 50  # Extracted from task params
env_op.shell(f"settings put system screen_brightness {brightness}")
```

**No translation** (pure queries):
```python
shell("dumpsys battery")   # Leave as is
shell("pm list packages")  # Leave as is
```

### Shell Return Value Support

AutoRPA’s `env_op.shell()` **returns command output**. You can use it in RPA code, and it is passed to the Planner.

#### Basic usage

```python
# Query — use return value
result = env_op.shell("ls -d /sdcard/Notifications 2>&1")

# Execution — return value can be ignored
env_op.shell("mkdir -p /sdcard/Notifications")
```

#### Examples

```python
# 1. Conditional check
result = env_op.shell("ls -d /sdcard/Notifications 2>&1")
if "No such file" in result:
    env_op.shell("mkdir -p /sdcard/Notifications")
    print("Folder created")
else:
    print("Folder already exists")

# 2. Get system setting
brightness = env_op.shell("settings get system screen_brightness")
current_value = int(brightness.strip())
if current_value < 100:
    env_op.shell("settings put system screen_brightness 200")

# 3. Dynamic file handling
files = env_op.shell("ls /sdcard/Download")
if "report.pdf" in files:
    # Open target file
    env_op.open_app("Files")
    # ...

# 4. File count
count_str = env_op.shell("ls /sdcard/Download | wc -l")
file_count = int(count_str.strip())
print(f"Total files: {file_count}")
```

#### Passed to Planner

Shell output is **recorded in `action_history`** for the Planner:

```
Step-1:
Executed action: env_op.shell("ls /sdcard/Download")
Action has been performed.
Shell output: file1.txt
file2.pdf
report.doc
```

**Benefit**: Planner can see command output and adjust the next plan.

#### Best practices

✅ **Recommended**:
- File/dir existence: `[ -f /path ] && echo 'yes' || echo 'no'`
- System settings: `settings get system ...`
- Dynamic content: `ls`, `find`, `grep`
- Use `.strip()` on return value
- Use `2>&1` to capture errors: `ls /path 2>&1`

⚠️ **Notes**:
- Return value is a string
- Some commands may return None
- Avoid frequent shell calls in loops

📚 **Full example**: See `examples/test_shell_return_value.py`

---

## 🤖 Core Components

### Step Budget

AutoRPA assigns step budget by task complexity:

**Formula**: `max_steps = min(task.complexity * 10 + 5, 50)`

| Complexity | Steps | Example |
|------------|-------|---------|
| 1.0 | 15 | Simple settings |
| 2.0 | 25 | Add contact |
| 3.0 | 35 | Create calendar event |
| 4.0 | 45 | Complex file ops |
| 5.0 | 50 (cap) | Advanced multi-step |

**Usage**:
- **Exploration**: Full budget
- **RPA verification**: Full budget
- **Fix React**: Remaining budget (total − steps already used by RPA)

**Log example**:
```
[Exploration] 📊 Max steps calculation for task 'ContactsAddContact': 
complexity=2.0 → steps=20+5=25 → final=25 (capped at 50)

🔧 Fix React: Total budget=25, RPA executed=10, Remaining=15
```

This keeps simple tasks from wasting steps and gives complex tasks enough budget.

---

### Agent_RPA

Location: `autorpa/core/agent_rpa.py`

**Responsibilities**:
- RPA execution (`Execute_rpa_code`, `rpa_testing`)
- Action translation (`ActionTranslator_Agent`, `batch_translate_actions`)
- Task summarization (`Concluder_Agent`)
- Breakpoint analysis (`Breakpoint_Analyzer_Agent`)

### RPA_Builder_Agent

Location: `autorpa/core/rpa_builder.py`

**Features**:
- Generate RPA code from trajectories
- `fetch_info` for detailed UI info
- Syntax check and retry (up to 2 times)

### GUI Agents

#### React* Agent
- **Style**: Planner + Summarizer loop, multi-round reflection
- **Best for**: Medium-complexity tasks
- **Location**: `gui_agents/react_star/adapter.py`

#### DroidRun Adapter
- **Style**: Strong reasoning, external tools
- **Best for**: Complex multi-step tasks
- **Requires**: DroidRun + Portal setup
- **Location**: `gui_agents/droidrun/adapter.py`

---

## 📖 Typical Workflows

### Workflow 1: Generate New RPA (Standard)

**Purpose**: Full AutoRPA — explore → build → verify → save.

```bash
python main.py \
    --agent_name=autorpa \
    --gui_agent_type=react_star \
    --react_star_ui_info=screenshot_with_tree \
    --num_tasks_to_explore=3 \
    --tasks=ContactsAddContact \
    --update_rpa_bank=True \
    --use_react_trajs_bank=True \
    --update_react_trajs_bank=True \
    --enable_shell_action=True
```

**Notes**:
- **Pipeline**: Explore 3 instances → generate RPA → verify → save to `data/rpa_bank.json`
- **Traj reuse** (`use_react_trajs_bank=True`): Skip exploration if trajectories exist; else explore
- **Traj save** (`update_react_trajs_bank=True`): Save successful trajectories under `data/.../react_trajs_bank.json`
- **Shell** (`enable_shell_action=True`): For tasks that need system-level ops

---

### Workflow 2: Test Existing RPA

**Purpose**: Test existing RPA on different task instances.

```bash
python main.py \
    --agent_name=autorpa \
    --test_rpa_mode=True \
    --to_test_tasks=[0]
```

**Notes**:
- Loads existing RPAs from `data/rpa_bank.json`
- Tests specified instances (`to_test_tasks=[0]` = instance index 0)
- Produces test report (success rate, runtime, etc.)
- No new RPA generation (`update_rpa_bank` has no effect here)

---

### Workflow 3: ReAct* Only (Exploration)

**Purpose**: Run only React* Agent to collect trajectories for later analysis or RPA generation.

```bash
python main.py \
    --agent_name=gui-agent \
    --gui_agent_type=react_star \
    --react_star_ui_info=screenshot_with_tree \
    --reflection_rounds=0 \
    --use_react_trajs_bank=True \
    --update_react_trajs_bank=True \
    --tasks=ContactsAddContact
```

**Notes**:
- **Exploration only**: No RPA code generation
- **Reflection**: `reflection_rounds=0` = single pass; 1–3 for multi-round reflection
- **Traj reuse/save**: Same as above
- With `agent_name=gui-agent`, `run_react_test_tasks=True` is set and `to_test_tasks` is used

---

### Workflow 4: React* vs RPA Comparison

**Purpose**: Compare React* and RPA on the same instances.

```bash
# Step 1: Run React* Agent
python main.py \
    --agent_name=gui-agent \
    --gui_agent_type=react_star \
    --react_star_ui_info=screenshot_with_tree \
    --run_react_test_tasks=True \
    --to_test_tasks=[0,1,2] \
    --tasks=ContactsAddContact

# Step 2: Run RPA test (same instances)
python main.py \
    --agent_name=autorpa \
    --test_rpa_mode=True \
    --to_test_tasks=[0,1,2] \
    --tasks=ContactsAddContact
```

**Notes**:
- Same instances (`to_test_tasks=[0,1,2]`) for both modes
- Compare success rate, runtime, step count
- Use to validate that RPA matches or exceeds exploration performance

---

### Workflow 5: Quick Test (Single Instance)

**Purpose**: Quickly check that setup works; useful for development.

```bash
python main.py \
    --agent_name=autorpa \
    --num_tasks_to_explore=1 \
    --tasks=ContactsAddContact
```

**Notes**:
- One instance per task type
- Good for debugging and config checks
- ⚠️ RPA may not generalize well (only 1 instance); prefer `num_tasks_to_explore=3` or more for production

---

### Workflow 6: Verbose Logging (Debug)

**Purpose**: Log all LLM calls for analysis and debugging.

```bash
python main.py \
    --agent_name=autorpa \
    --enable_llm_logging=True \
    --log_level=DEBUG \
    --tasks=ContactsAddContact
```

**Notes**:
- **LLM logs** (`enable_llm_logging=True`): Saves prompts, responses, images, metadata under `log_folder_exp/llm_calls/`
- **Console** (`log_level=DEBUG`): Shows DEBUG logs
- ⚠️ Can create many log files; turn off in production

---

## 📖 Other Run Examples

### GUI Agent Exploration (DroidRun)

```bash
python main.py \
    --agent_name=gui-agent \
    --gui_agent_type=droidrun \
    --droidrun_config_path=configs/droidrun_config.yaml \
    --tasks=ContactsAddContact
```

### Generate RPA from Existing Trajectories (Manual Params)

```bash
python main.py \
    --agent_name=autorpa \
    --gui_agent_type=react_star \
    --react_star_ui_info=screenshot_with_tree \
    --tasks=CameraTakePhoto \
    --use_react_trajs_bank=True \
    --update_react_trajs_bank=True
```

**Effect**:
- ✅ If trajectories exist → go straight to ActionTranslator + RPA Builder
- 🔄 If current instance has no trajectory → fallback to other instance’s trajectory
- ⚠️ If no trajectory at all → run exploration and save

---

## ⚙️ Environment Operations (EnvOperation)

Location: `autorpa/env_operation.py`

### Basic Operations

```python
# Apps and navigation
env_op.open_app(app_name)              # Open app
env_op.go_back()                       # Back
env_op.go_home()                       # Home
env_op.wait()                          # Wait for screen to settle

# UI interaction
env_op.click(index)                    # Click by index
env_op.click(x, y)                     # Click by coordinates
env_op.long_press(index)               # Long press
env_op.input_text(text, index, clear_text=True)  # Input text
env_op.swipe(direction, index)         # Swipe (direction: 'up'/'down'/'left'/'right')
env_op.keyboard_enter()                # Enter key

# Shell (when enabled)
env_op.shell(command)                  # Run ADB shell command

# Task end
env_op.answer(text)                    # Answer task (e.g. MiniWoB)
env_op.stop(goal_status)               # Mark task done/failed
```

### Element Lookup

`find_element()` uses **two stages**: attribute filter → LLM Grounder for precise match.

```python
# Example 1: Attribute match
kwargs = {
    'text': 'Settings',                      # Text (exact)
    'is_clickable': True,
    'class_name': 'android.widget.TextView',
    'target_description': 'Settings button'   # Semantic hint for LLM (required)
}
index = env_op.find_element(**kwargs)
assert index != -1, "Element not found!"
env_op.click(index)

# Example 2: Disambiguation when multiple elements match
index = env_op.find_element(
    is_clickable=True,
    target_description='Add-contact plus button'  # LLM uses this to pick one
)
env_op.click(index)
```

**Filter parameters**:
- `text` — element text (exact)
- `content_description` — accessibility description
- `resource_id` — resource id
- `class_name` — view class
- `is_clickable`, `is_scrollable`, `is_checkable`, `is_checked`, `is_enabled`
- `target_description` — semantic description (required for LLM disambiguation)

**Flow**:
1. Filter candidates by attributes
2. If exactly 1 → return it
3. If > 1 → LLM Grounder picks by `target_description`
4. If 0 → LLM Grounder searches over all elements

### RPA-Specific Tools

```python
# MLLM question (info from current screen)
answer = env_op.ask_mllm("What is the email address shown on this page?")
# Returns: "john@example.com"

# UI elements as list of dicts
ui_elements = env_op.get_ui_content()
# Returns: [{'index': 0, 'text': 'Settings', 'is_clickable': True, ...}, ...]
```

---

## 📊 Output Layout

```
logs/autorpa_YYYY-MM-DD_HH-MM-SS_model/
├── output.log                    # Main log (all levels including DEBUG)
├── exploration_result.csv        # Exploration results
├── step_tokens.csv               # Token stats
├── temp_rpa.json                 # Temporary RPA bank
└── TaskType/
    └── Exploration/
        └── task_1/
            └── attempt_1/
                ├── ReAct/                 # GUI Agent exploration
                │   ├── round_0/            # No reflection
                │   ├── round_1/            # With reflection
                │   └── standard_trajectory.json
                ├── RPABuilder/             # RPA generation
                │   ├── rpa_builder_prompt_0.txt
                │   ├── rpa_builder_output_0.txt
                │   └── screenshot_0.png    # fetch_info
                ├── RPAFitExec/             # Verification
                │   └── task_1/
                └── BreakpointAnalyzer/    # Breakpoint analysis
```

---

## 📦 Data Models

### StandardTrajectory

```python
{
  "task_type": "ContactsAddContact",
  "task_goal": "Add a contact...",
  "success": true,
  "raw_steps": [...],          # Layer 1: full raw trajectory
  "action_history": {...},     # Layer 2: simplified action sequence
  "reflection_data": {...},    # Layer 3: high-level reflection
  "agent_name": "react_star",
  "duration": 45.2
}
```

### RPAInfo

```python
{
  "task_type": "ContactsAddContact",
  "parameters": "- name (str): full name\n- phone (str): phone number",
  "rpa_description": "Add a new contact in the Contacts app",
  "rpa_code": "def add_contact(name: str = None, phone: str = None):\n  ...",
  "example_usage": "add_contact(name='John', phone='1234567890')",
  "conclusion": "Successfully generated RPA code"
}
```

---

## 🔌 Adding a New GUI Agent

### Step 1: Implement the interface

```python
from gui_agents.interfaces import BaseGUIAgent, StandardTrajectory

class MyAgent(BaseGUIAgent):
    @property
    def agent_name(self):
        return "my_agent"
    
    @property
    def environment_type(self):
        return "android"
    
    async def explore_task(self, task, env_op, max_steps, **kwargs):
        # Your exploration logic
        ...
        return StandardTrajectory(...)
```

### Step 2: Create factory and register

```python
# gui_agents/my_agent/factory.py
from gui_agents.registry import register_agent
from .adapter import MyAgent

@register_agent('my_agent')
def create_my_agent(**kwargs):
    return MyAgent(**kwargs)
```

### Step 3: Import and run

```python
# gui_agents/my_agent/__init__.py
from . import factory  # Triggers registration

# Usage
python main.py --gui_agent_type=my_agent
```

---

## 🔍 Troubleshooting

### DroidRun Import Error

```bash
pip install 'autorpa[droidrun]'
```

### Portal Not Enabled

Settings → Accessibility → DroidRun Portal → Enable

### Element Not Found

Check that `target_description` is set and clearly describes the element.

### Configuration Warnings

On startup, the program validates config and may show:
- ❌ **Error**: Must fix before running
- ⚠️ **Warning**: Can continue, but consider changing

### Log Inspection

```bash
# Find ERROR logs
grep "ERROR" log/autorpa_*/output.log

# Per-component logs
grep "Planner" log/autorpa_*/output.log
grep "RPA Builder" log/autorpa_*/output.log

# Errors with context
grep -A 10 -B 10 "ERROR" log/autorpa_*/output.log

# Token usage
cat log/autorpa_*/step_tokens.csv | column -t -s,
```

### Result Statistics

```bash
cd log/autorpa_YYYY-MM-DD_HH-MM-SS_model/

# Success rate
python -c "
import pandas as pd
df = pd.read_csv('exploration_result.csv')
success_cols = [c for c in df.columns if c.startswith('task_')]
success_rate = df[success_cols].apply(lambda x: (x=='1').sum() / len(x)).mean()
print(f'Average success rate: {success_rate:.2%}')
"

# Token cost
python -c "
import pandas as pd
df = pd.read_csv('step_tokens.csv')
total = df['prompt_tokens'].sum() + df['completion_tokens'].sum()
print(f'Total tokens: {total:,}')
print('\nTokens per component:')
print(df.groupby('agent')[['prompt_tokens', 'completion_tokens']].sum())
"
```

---

## 💡 Best Practices

### Exploration
- Use React* for simpler tasks, DroidRun for complex ones
- Enable `use_action_translator` for better generalization
- Use 2–3 reflection rounds to improve success rate

### Recommended LLM Setup

**Quality-first**:
```bash
python main.py --agent_name=autorpa \
    --planner_llm=claude-sonnet-4-5 \
    --builder_llm=claude-sonnet-4-5 \
    --grounder_llm=gpt-4o \
    --summarizer_llm=gpt-5-low
```

**Balanced** (recommended):
```bash
python main.py --agent_name=autorpa \
    --planner_llm=claude-sonnet-4-5 \
    --builder_llm=gpt-5-low \
    --grounder_llm=gemini-2.5-pro-nothinking \
    --summarizer_llm=gpt-5-low
```

**Cost-first**:
```bash
python main.py --agent_name=autorpa --default_llm=gpt-5-low
```

### Performance Tuning

**Reduce token usage**:
```bash
--summarizer_llm=gpt-5-low
--reflection_rounds=1
--enable_llm_logging=False
--use_react_trajs_bank=True
```

**Increase success rate**:
```bash
--use_action_translator=True
--planner_llm=claude-sonnet-4-5
--reflection_rounds=2
--max_attempts_per_task=5
```

### Other Tips
- N-to-1 reverse verification improves robustness
- Periodically clean log directories; keep important RPA banks
- Use `--tasks=chosen` for the 38 stable tasks

---

## 🌍 Android World Environment

### Setup

```bash
# 1. SQLite with FTS (required for Joplin tasks)
conda install -c conda-forge sqlite

# 2. Environment variables (or set in main.py)
export OPENAI_API_KEY="your-api-key"
export OPENAI_URL="https://api.openai.com/v1"  # Optional: proxy API

# 3. ADB path (auto-detected or set manually)
export ADB_PATH="~/Library/Android/sdk/platform-tools/adb"
```

### Emulator

```bash
EMULATOR_NAME=AndroidWorldAvd
~/Library/Android/sdk/emulator/emulator -avd $EMULATOR_NAME -no-snapshot -grpc 8554

# Check connection
adb devices
# Expect: emulator-5554  device

# List agents
python list_agents.py
```

### Common Tasks

```python
# Files
FilesDeleteFile, FilesMoveFile

# Calendar (Simple Calendar)
SimpleCalendarAddOneEventRelativeDay, SimpleCalendarDeleteEvents

# Notes (Markor)
MarkorCreateNote, MarkorEditNote, MarkorTranscribeReceipt

# Contacts
ContactsNewContactDraft

# System
SystemWifiTurnOn, SystemBluetoothTurnOn, SystemBrightnessMax

# Tasks.org
TasksHighPriorityTasks, TasksIncompleteTasksOnDate

# Media
AudioRecorderRecordAudio, CameraTakePhoto

# Full list
python main.py --tasks=subset --help
```

**Task sets**:
- **40 tasks** (`--tasks=subset`): Full evaluation set
- **38 tasks** (`--tasks=chosen`): Stable set (recommended)
- **22 tasks** (`--tasks=rag_test`): RAG test set

---

## 💾 Data Management

### RPA Bank (data/rpa_bank.json)

```json
{
  "TaskType": {
    "task_type": "TaskType",
    "rpa_code": "def task_function(...):\n  ...",
    "parameters": "...",
    "example_usage": "task_function(...)",
    "based_on_task": 2,
    "verified_tasks": ["task_1", "task_2", "task_3"]
  }
}
```

### ReAct Trajs Bank (data/react_trajs_bank.json)

Stores exploration trajectories (both successful and failed) for reuse during RPA generation.

**Structure**:
```json
{
  "task_type": {
    "task_goal": [
      {
        "task": "task_goal",
        "traj": [...],
        "success": true,
        "instance_id": 1,
        "timestamp": "2025-12-11_20-00-00",
        "num_steps": 5
      }
    ]
  }
}
```

**Features**:

1. **Quality-based trajectory management**
   - Prefer successful trajectories
   - If all success, keep the one with fewer steps
   - If all fail, keep existing

2. **Centralized screenshots**
   - Path: `data/react_trajs_banks/{agent_type}/{config_id}/screenshots/{task_type}/task{instance_id}/`
   - Naming: `step{step_n}_{before|after|before_som|after_som}.png`
   - Relative paths supported; resolved from project root

3. **Overwrite backups**
   - **JSON**: `data/backups/react_trajs_bank_{timestamp}.json`
   - **Screenshots**: `data/backups/screenshots_{timestamp}/`
   - Timestamp is from run start; multiple saves in one run overwrite the same backup
   - Backups are kept until manually removed

4. **Incremental updates**
   - Add trajectories for new instances across runs
   - Update existing instance if new trajectory is better

5. **Trajectory lookup fallback**
   - **Priority 1**: Exact match for current instance_id
   - **Priority 2**: If current instance has no trajectory, use another instance’s successful trajectory for the same task_goal (shortest steps)
   - Useful for tasks like CameraTakePhoto where all instances share one task_goal

**task_goal vs instance_id**:

Most task types have different task_goal per instance (e.g. "Add contact Alice", "Add contact Bob"). Some (e.g. CameraTakePhoto) share one task_goal for all instances ("Take one photo."). Both are handled via the `instance_id` metadata.

**Usage**:

See the "ReAct Trajs Bank — Trajectory Reuse Details" section under Knowledge Bank Configuration.

Quick example:
```bash
# Reuse trajectories (skip exploration)
cd examples
./run_autorpa.sh

# Or with explicit flags
python main.py --use_react_trajs_bank=True --tasks=CameraTakePhoto
```

---

**Last updated**: 2025-12-11
