# AutoRPA

**From Exploration to Automation: AI Agents Automatically Generate Reusable RPA Code**

This project implements a general-purpose automated RPA code generation framework that can automatically generate, verify, and execute RPA code across multiple benchmark environments.

## Core Concept

1. **Exploration Phase**: Use GUI Agents (e.g., ReAct*) to explore task completion paths
2. **Building Phase**: Automatically convert exploration trajectories into reusable RPA code
3. **Verification Phase**: Validate RPA code generalization on multiple task instances
4. **Employment Phase**: Directly execute new tasks using verified RPA code

## Supported Environments

### 1. AndroidWorld
Android app automation, supporting 40+ real-world application tasks

### 2. WebArena  
Web page automation, supporting complex multi-step web operations

### 3. MiniWoB++
Lightweight web task automation, 100+ standardized test tasks

## Project Structure

```
AutoRPA/
├── androidworld/          # Android environment implementation
│   ├── autorpa/          # AutoRPA core code
│   ├── gui_agents/       # GUI Agent implementations (ReAct*, DroidRun, etc.)
│   ├── examples/         # Example run scripts
│   └── README.md
│
├── webarena/             # Web environment implementation
│   ├── UIAgents/         
│   │   └── Agent_RPA/    # RPA Agent core
│   └── main_run.py
│
└── miniwob/              # MiniWoB environment implementation
    ├── UIAgents/
    │   └── Agent_RPA/    # RPA Agent core
    └── main_run.py
```

## Quick Start

### AndroidWorld

```bash
cd androidworld

# Install dependencies
pip install -r requirements.txt

# Configure API Key
export OPENAI_API_KEY="your-api-key"

# Start Android emulator
~/Library/Android/sdk/emulator/emulator -avd AndroidWorldAvd -no-snapshot -grpc 8554

# Run AutoRPA (complete pipeline: exploration + generation + verification)
python main.py --agent_name=autorpa \
    --gui_agent_type=react_star \
    --num_tasks_to_explore=3 \
    --tasks=ContactsAddContact
```

### WebArena

```bash
cd webarena

# Run Building Phase (generate RPA code)
python main_run.py \
    --agent_name=agent_rpa \
    --num_tasks_to_explore=3 \
    --examine_sites=reddit

# Run Employment Phase (test RPA code)
python main_run.py \
    --test_rpa_mode=True \
    --to_test_tasks=0,1,2
```

### MiniWoB

```bash
cd miniwob

# Install dependencies
pip install -r requirements_autorpa.txt

# Run ReAct Agent
python main_run.py --agent_name agent_react
```

## Key Features

### 1. Intelligent Action Translation
Automatically converts hard-coded actions (e.g., `click(5)`) to soft-coded actions (e.g., `find_element(text="OK") + click`), improving code generalization

### 2. N-to-1 Verification
Tests RPA code on N task instances to ensure robustness and generalization

### 3. Automatic Repair Mechanism
When verification fails, the system automatically analyzes breakpoints and attempts repairs without manual intervention

### 4. Pluggable Architecture
Supports multiple GUI Agents (ReAct*, DroidRun, AskUI) with unified interfaces for flexible switching

### 5. Knowledge Base Management
- **RPA Bank**: Stores verified RPA code for reuse
- **Trajectory Bank**: Stores successful exploration trajectories, supporting cross-experiment reuse

## Technical Architecture

```
┌─────────────────────────────────────────────────────────┐
│  1. Exploration Phase                                   │
│     GUI Agent → StandardTrajectory                      │
└─────────────────────────────────────────────────────────┘
                         ↓
┌─────────────────────────────────────────────────────────┐
│  2. Translation Phase                                   │
│     ActionTranslator batch translates hard-coded actions│
└─────────────────────────────────────────────────────────┘
                         ↓
┌─────────────────────────────────────────────────────────┐
│  3. Generation Phase                                    │
│     RPA Builder → Reusable Python functions             │
└─────────────────────────────────────────────────────────┘
                         ↓
┌─────────────────────────────────────────────────────────┐
│  4. Verification Phase                                  │
│     N-to-1 Verification → RPA Bank                      │
└─────────────────────────────────────────────────────────┘
```
