# Real-Time Facial Animation of Gaussian Head Avatars via Mocap-to-Parametric Expression Mapping

The code implementation of the paper "Real-Time Facial Animation of Gaussian Head Avatars via Mocap-to-Parametric Expression Mapping"

## Scope of Release

This repository contains the core implementation required to reproduce the main results of our paper. In particular, we release:

* **Core training and inference code** for the proposed expression mapping and animation pipeline.
* **Pretrained checkpoints** (avatar point clouds and expression mappers) used in our experiments.
* **A pretrained expression regularization (ER) matrix**.

The following components are **not included** in this release:

* **Data collection and preprocessing scripts** for training the expression mapper.
* **ER matrix construction script** is not included due to space and complexity, while the construction procedure is described in Section 3.2.4 of the paper, and the provided model suffices to visualize the qualitative results.


## Environment Setup

This repository is built on top of [GaussianAvatars](https://github.com/ShenhanQian/GaussianAvatars.git). Run the script below to fetch a specific upstream commit and apply a small compatibility patch.

```
bash scripts/fetch.sh
```

Please follow its [Installation](https://github.com/ShenhanQian/GaussianAvatars/blob/main/doc/installation.md) guide and download all the required files at [Download](https://github.com/ShenhanQian/GaussianAvatars/blob/main/doc/download.md).


**Note:** The current motion capture (mocap) input is supported **only via ARKit**, which requires an iOS device. Support for additional PC-compatible alternatives (e.g., MediaPipe) is planned and will be developed in future updates.


## Inference

### Data Layout

`checkpoints/` contains pretrained checkpoints used in our experiments, including avatar point clouds and expression mappers. Each subject folder (e.g., `306`) follows the structure below. While experiments were conducted on nine subjects, only one subject is included here due to the supplementary file size limit.

```
SUBJECT_FOLDER/
├── point_cloud.ply    # Gaussian point cloud checkpoint
├── flame_param.npz    # FLAME sequence used to train the avatar
├── mat.npy            # pre-trained expression mapper from ARKit blendshapes to FLAME parameters
├── transform.json     # camera view for offline rendering in recording mode
```
- `examples/` provides three sample ARKit blendshape sequences, corresponding to those described in `Section 4.3.2 Cross-identity Reenactment` of the paper. Each subfolder (e.g., `sequence_a`) includes an `arkit.json` that can be directly processed in [recording mode](#recording-mode). Video results are demonstrated in `demo.mp4`.
- `data/` contains the training datasets for two subjects, used to train the expression mapper.

### Live Mode (GUI)

1. Open an GUI and stream ARKit blendshape data from FaceCap IOS App. 
2. In this mode, you have to run `mocap/facecap/bs_receiver.py` in one terminal to receive incoming blendshape data from your IOS device. The detailed setup process is described in [mocap/facecap/README.md](mocap/facecap/README.md).
3. In another terminal, launch the real-time renderer which opens a GUI (modified from the viewer of [GaussianAvatars](https://github.com/ShenhanQian/GaussianAvatars.git)) and listens on `127.0.0.1:9000`:

   ```bash
   bash scripts/stream.sh
   ```
   Adjust the subject ID inside the script to switch avatars.

### Recording Mode

1. Offline processing of a pre-recorded ARKit sequence:

   ```bash
   bash scripts/record.sh
   ```

   The script first converts `examples/<SEQUENCE>/arkit.json` to FLAME parameters using `exp_mapper.py`, then renders frames with `offline_renderer.py`. Outputs are saved under `examples/<SEQUENCE>/<SUBJECT>/`.
2. To record your own ARKit sequence, follow the recording steps in [mocap/facecap/README.md](mocap/facecap/README.md) to capture and align the data. Place the resulting `arkit.json` in `examples/<YOUR_SEQUENCE>/` and update the script variables before running.

## Train

The data preparation code is not currently available. Nevertheless, aligned training dataset for subjects 306 is provided in the `data/` directory and can be directly used to train the expression mapper.

You can test the training code by running: 
```
python train.py --subject 306
```

## Citation

If you find this code useful for your research, please cite:

```bibtex
@inproceedings{qian2024gaussianavatars,
  title={Gaussianavatars: Photorealistic head avatars with rigged 3d gaussians},
  author={Qian, Shenhan and Kirschstein, Tobias and Schoneveld, Liam and Davoli, Davide and Giebenhain, Simon and Nie{\ss}ner, Matthias},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={20299--20309},
  year={2024}
}

@article{kirschstein2023nersemble,
    author = {Kirschstein, Tobias and Qian, Shenhan and Giebenhain, Simon and Walter, Tim and Nie\ss{}ner, Matthias},
    title = {NeRSemble: Multi-View Radiance Field Reconstruction of Human Heads},
    year = {2023},
    issue_date = {August 2023},
    publisher = {Association for Computing Machinery},
    address = {New York, NY, USA},
    volume = {42},
    number = {4},
    issn = {0730-0301},
    url = {https://doi.org/10.1145/3592455},
    doi = {10.1145/3592455},
    journal = {ACM Trans. Graph.},
    month = {jul},
    articleno = {161},
    numpages = {14},
}
```