# Code used for the experiments with VARAN: Variational Inference for Speech Self-Supervised Models Fine-tuning on Downstream Tasks

## Setup

```bash
docker build -t varan . 
docker run -it --rm --net=host --ipc=host --gpus=all -v $PWD:/app -v <path_to_data_directory>:/app/data`varan
```
There are 4 main directories:
- ser – with the code to fine-tune WavLM or data2vec models for emotion recognition task.
- sv – with the code to fine-tune WavLM model for speaker verification task.
- asr – with the code to fine-tune WavLM model for automatic speech recognition task.
- mhfa – with the official implementation of [MHFA method](https://arxiv.org/abs/2210.01273) for speaker verification and its adaptation for emotion recognition task.

## Usage

TODO: Write instructions for training and inference
