## Datasets download (These are all public datasets.)

### RVOS
Please refer to [Refer-Youtube-VOS Challenge](https://competitions.codalab.org/competitions/29139) to download.

### A2D Sentences
Follow the instructions and download the dataset from the website [here](https://kgavrilyuk.github.io/publication/actor_action/). 

### JHMDB-Sentences

Follow the instructions and download the dataset from the website [here](https://kgavrilyuk.github.io/publication/actor_action/). 
Then, extract the files. Additionally, we use the same json annotation files generated by [MTTR](https://github.com/mttr2021/MTTR). Please download these files from [google drive](https://drive.google.com/drive/u/0/folders/1sXmjpWmc0GxYIz-EFLw5S9dJvmGJAPqx).


## Usage
```
python main.py
```

## Requirements

Python 3.8
CUDA 11.2
NVIDIA GPU: Telsa V100 * 4