From Isolation to Identification

Published: 01 Jan 2024, Last Modified: 09 May 2025PSD 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: We present a mathematical framework for understanding when successfully distinguishing a person from all other persons in a data set—a phenomenon which we call isolation—may enable identification, a notion which is central to deciding whether a release based on the data set is subject to data protection regulation. We show that a baseline degree of isolation is unavoidable in the sense that isolation can typically happen with high probability even before a release was made about the data set and hence identification is not enabled. We then describe settings where isolation resulting from a data release may enable identification.
Loading