Hello!
This is the code we the authors of PHALAR distribute to the ICML2026 reviewers.

The code used to run PHALAR was used mainly on HPC frameworks, so if something breaks during
    testing, that could be the culprit. Nonetheless, we provide a pyproject.toml that SHOULD
    recreate our training environment.

If you have 'uv' installed, creation of the environment should be available by running
    'uv sync'

Not all the code used to create the submission is present as some of it requires pieces from other
    codebases (like VERSA, to compute FAD_MERT's, ViSQOL's and Audiobox's scores over the listening
    tests audios, which we DID NOT include here, to avoid dramatically increasing upload size)

Mainly, we suggest looking at 'main.py' to look at the code that starts the pytorch-lightning training
    and to look at 'model_usage_example.py', which gives a minimum viable manner of executing
    both PHALAR and our retrained COCOLA version.
    You can try executing:
        'uv run python model_usage_example.py'
    To start training (which we don't suggest as it requires 50 GPU hours):
        'uv run python main.py fit --config configs/train_phalar.yaml'

Furthermore, we provide in an anonymized manner the results of our human listening tests, over which
    it should be possible to compute the same results we achieved. To enable that, we also provide
    the scores we obtained over the listening samples with all the models used for table 2.
    All of that is available under 'listening_tests'. The same calculations we ran can be found in stat_calcs.py
    Thus you can recreate those same results by running:
        'uv run python stat_calcs.py'