Reproducing "Fair Selective Classification via Sufficiency"

Nils Peters; Joy Crosbie; Marius Adrianus Strampel; Rachel Van t Hull

Reproducing "Fair Selective Classification via Sufficiency"

Nils Peters, Joy Crosbie, Marius Adrianus Strampel, Rachel Van t Hull

Published: 11 Apr 2022, Last Modified: 05 May 2023RC2021Readers: Everyone

Abstract: Reproducibility Summary Scope of Reproducibility In this reproducibility study we focus on the paper "Fair Selective Classification via Sufficiency". Our experiments focus on the following claims: 1. Sufficiency is able to mitigate disparities in precision across the entire coverage scale and in margin distributions, and will not increase these disparities compared to a baseline selective classification model in any case. 2. Using sufficiency may decrease overall accuracy in some cases, but still mitigates the disparity between groups when looking at individual classification scores. 3. The sufficiency-regularised classifier exhibits better fairness performance on traditional fairness datasets. Methodology As the authors have not made their code publicly available, all code was written from scratch, based on the instructions and pseudocode given in the original paper. Our code reconstruction contains code for training both the sufficiency model and a baseline model performing standard selective classification. Results We were not able to fully reproduce the results of the original paper in this setting. The numbers (accuracies, precisions and margin distributions) obtained in our experiments differ significantly from those reported in the original paper. Though differences between the baseline model and the sufficiency model are not as significant as in the original paper, our results do support the main claims about sufficiency being able to increase the worst-group precision and thus causing disparities between groups to decrease. What was easy The authors made the importance of implementing fair selective classification with sufficiency very clear. Moreover, the authors provided an in-depth mathematical background to sufficiency and selective classification, making their reasoning explicit. Finally, the authors presented their results in such a manner that allowed for straightforward comparison once we had trained the model. What was difficult Many technical details and model parameters were not specified in the original paper, and as no code was provided by the authors, these initially had to be determined by experimentation. Furthermore, some of the figures in the paper caused confusion about the exact implementation of the model. Communication with original authors As soon as we noticed we needed clarification on the hyperparameters, datasets and models, we contacted the authors via email. Initially we did not receive a reply, and eventually the authors were only able to answer some of our questions on the Tuesday before the deadline. While we re-implemented our model based on the newly supplied information, time was too short to fix the new issues that became apparent with the new model. The code is available at: https://github.com/MLRC2022FSCS/FSCS

Paper Url: http://proceedings.mlr.press/v139/lee21b.html

Paper Venue: ICML 2021

Supplementary Material: zip

4 Replies

Loading