# Gaze on the Prize - Code Implementation

## Overview
Implementation of "Gaze on the Prize: Shaping Visual Attention with Return-Guided Contrastive Learning" for ICLR 2026 submission #23632.

## Requirements

### Tested Environment
This code was developed and tested on:
- Python 3.10.18
- PyTorch 2.4.1 with CUDA 12.1
- Ubuntu 20.04
- NVIDIA RTX 3090 (24GB VRAM)
- FAISS 1.8.0

While the code may work on other configurations, we have only validated it on the above setup.

### Installation

0. Create a conda environment:

```bash
conda create -n gaze_on_the_prize python=3.10 -y
conda activate gaze_on_the_prize
```

1. Install ManiSkill3 (robotics simulation environment)

Our fork of the original [ManiSkill3]('https://github.com/haosulab/ManiSkill') environment (with cluttered environments) couldn't be uploaded due to file size. Please clone the original repository.

```
git clone https://github.com/haosulab/ManiSkill.git
```

and install

```bash
cd ManiSkill && pip install -e . && cd ..
```

1. Install FAISS for efficient similarity search

Refer to the [original repository](https://github.com/facebookresearch/faiss) for detailed instructions specific to your platform and requirements.

3. Install remaining dependencies:

```bash
conda install pytorch==2.4.1 torchvision==0.19.1 torchaudio==2.4.1 pytorch-cuda=12.1 -c pytorch -c nvidia
pip install imageio==2.19.0 matplotlib tyro tensorboard opencv-python gymnasium
```

## Quick Start

### Training with our method
```bash
# PPO with foveal attention and contrastive learning
CUDA_VISIBLE_DEVICES=0 python gaze_on_the_prize_ppo.py --env_id PushT-v1 --gamma 0.99 --total_timesteps 50_000_000 --attention_type foveal --use_contrastive

# SAC with foveal attention and contrastive learning  
CUDA_VISIBLE_DEVICES=0 python gaze_on_the_prize_sac.py --env_id PickCube-v1 --attention_type foveal --use_contrastive
```

### Training scripts

These scripts demonstrate the commands used for key experiments.
We provide examples for PlaceSphere, PokeCube, and PushT tasks in `training_scripts.sh`
Complete codebase with all experiment commands will be released upon acceptance.