Differentially Private Heavy Hitters using Federated Analytics

Karan Chadha; Junye Chen; John Duchi; Vitaly Feldman; Hanieh Hashemi; Omid Javidbakht; Audra McMillan; Kunal Talwar

Differentially Private Heavy Hitters using Federated Analytics

Karan Chadha, Junye Chen, John Duchi, Vitaly Feldman, Hanieh Hashemi, Omid Javidbakht, Audra McMillan, Kunal Talwar

Published: 19 Jun 2023, Last Modified: 21 Jul 2023FL-ICML 2023EveryoneRevisionsBibTeX

Keywords: Heavy Hitters Detection, Differential Privacy, Federated Analytics

TL;DR: We propose and optimize an algorithm to find as many of the most frequent data points as possible across all users' data with aggregate and local differential privacy.

Abstract: We study practical heuristics to improve the performance of prefix-tree based algorithms for differentially private heavy hitter detection. Our model assumes each user has multiple data points and the goal is to learn as many of the most frequent data points as possible across all users' data with aggregate and local differential privacy. We propose an adaptive hyperparameter tuning algorithm that improves the performance of the algorithm while satisfying computational, communication and aggregate privacy constraints. We explore the impact of different data-selection schemes as well as the impact of introducing deny lists during multiple runs of the algorithm. We test these improvements using extensive experimentation on the Reddit dataset on the task of learning most frequent words.

Submission Number: 40

Loading