DECO-Bench: Unified Benchmark for Decoupled Task-Agnostic Synthetic Data Release

Published: 26 Sept 2024, Last Modified: 13 Nov 2024NeurIPS 2024 Track Datasets and Benchmarks PosterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Synthetic Data Generation, Decoupling Methods, Privacy-Utility Trade-offs, Privacy-Preserving Machine Learning, Stable Diffusion
TL;DR: Introducing a unified benchmarking framework for task-agnostic decoupling in privacy-preserving ML, filling a critical gap in the field.
Abstract: In this work, we tackle the question of how to systematically benchmark task-agnostic decoupling methods for privacy-preserving machine learning (ML). Sharing datasets that include sensitive information often triggers privacy concerns, necessitating robust decoupling methods to separate sensitive and non-sensitive attributes. Despite the development of numerous decoupling techniques, a standard benchmark for systematically comparing these methods remains absent. Our framework integrates various decoupling techniques along with synthetic data generation and evaluation protocols within a unified system. Using our framework, we benchmark various decoupling techniques and evaluate their privacy-utility trade-offs. Finally, we release our source code, pre-trained models, datasets of decoupled representations to foster research in this area.
Supplementary Material: pdf
Submission Number: 1809
Loading