# NORESQA: A Framework for Speech Quality Assessment using Non-Matching References

This zip contains: (0) Supplementary Section for the paper that describes further experiments and ablations; and (1) listening examples for the Speech Enhancement Task.

## (0) Supplementary section for the paper
Refer to the *reading_material* folder after unzipping. It contains additional experiments we did to verify the metric. It includes:(i) Framework description including details on the architecture; (ii) Experimental setup, including details about the dataset and augmentations for training; (iii) Objective evaluations including empirically showing invariance to language and gender, showing properties like indiscernibility of identical and commutativity, and finally showing results on the frame wise detection capabilities; (iv) Subjective evaluation datasets including a short description of all 10 datasets that we used for correlating to MOS ratings; (v) Ablations, including relative VS absolute scores, multi-objective learning and influence of number of NMRs for evaluation; and finally (vi) Speech Enhancement, including description of the model and a few additional results.

## (1) Listening examples 
Refer to *listening_examples* folder for more details. It contains a few examples from the VCTK test set; including the noisy (original noisy), clean (target clean), baseline (L2 only), and pre_fin (our pretraining-finetuning approach).