
# 🌀 MR-CATR: Motion-Residual Conflict-Aware Time-Reversal Sampler for Generative Inbetweening

This repository provides the official PyTorch implementation of **MR-CATR**, a plug-and-play **inference-time sampler** for **generative video inbetweening**.
MR-CATR builds upon **Stable Video Diffusion (SVD-XT)** and improves time-reversal sampling by explicitly resolving **motion-prior conflicts** between start- and end-conditioned paths.

<p align="center" width="100%">
  <img src="./output/bear/000000.gif" width="32%">
  <img src="./output/boat/000000.gif" width="32%">
  <img src="./output/car-turn/000000.gif" width="32%">
</p>



## 🚀 Getting Started

### 🔧 1. Environment Setup
Our implementation is based on the official  
[**generative-models**](https://github.com/Stability-AI/generative-models) repository by Stability AI.

Please first clone `generative-models`, and then place `mrcatr.py` into:

```text
generative-models/scripts/sampling/
````

Follow the environment setup instructions provided in the `generative-models` repository.



### 📦 2. Pre-trained Model

Download the **Stable Video Diffusion (SVD-XT)** checkpoint from:

* [https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt](https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt)

Then specify the local path to the downloaded checkpoint in:

```text
scripts/sampling/configs/svd_xt.yaml
```

For example:

```yaml
ckpt_path: /path/to/svd_xt.safetensors
```



### 🎞️ 3. Video Inbetweening

To run inference with MR-CATR, execute:

```bash
python scripts/sampling/mrcatr.py
```




## 🧩 Code Release Note

This repository currently releases the MR-CATR sampling code built on top of the ViBiD sampling pipeline, enabling reproducible inference and qualitative evaluation.

The complete training and end-to-end codebase, including additional utilities and scripts, will be released upon paper acceptance.


