<!-- # GUI-Actor -->
1. Anonymous Huggingface checkpoint: anonymous-ckpt/GUI-AIMA-3B

2. Environment Installation:
```bash
cd GUI-AIMA
conda create -n gui_aima python=3.10
conda activate gui_aima
pip install torch==2.6.0 torchvision==0.21.0 torchaudio==2.6.0 --index-url https://download.pytorch.org/whl/cu124
pip install -e .
```

3. Evaluation on GUI Grounding Benchmarks
We provide evaluation scripts on ScreenSpot-Pro, ScreenSpot-v2 and OSWorld-G under the `eval/` folder: `eval_ss_pro.sh`, `eval_ss_v2.sh`, `eval_osworld_g.sh`. 

For ScreenSpot-Pro and OSWorld-G, we provide 2-step inference in `eval_ss_pro.sh` and `eval_osworld_g.sh`, which determines the focusing area at the 1st step and zoom-in the focusing area for grounding at the 2nd step without extra model training.

For ScreenSpot-Pro and OSWorld-G, you need to download the data from their repo, then adjust the data path in `eval_ss_pro.sh` and `eval_osworld_g.sh`.
