SOMnibus: Recovering Underlying Sensitive Attributes with Self-Organizing Maps

Joseph Charles Bingham; Netanel Arussy; Dvir Aran

SOMnibus: Recovering Underlying Sensitive Attributes with Self-Organizing Maps

Joseph Charles Bingham, Netanel Arussy, Dvir Aran

Published: 02 Mar 2026, Last Modified: 14 Apr 2026AFAA 2026 PosterEveryoneRevisionsBibTeXCC BY 4.0

Track: Tiny/Short Papers Track (up to 3 pages)

Keywords: fairness, unsupervised learning, self-organizing maps, sensitive attribute leakage, dimensionality reduction

TL;DR: In this work, we explore utilizing how fairness can be violated utilizing an unsupervised learning pipeline

Abstract: Unsupervised representation learning is often assumed to be benign with respect to sensitive attributes when those attributes are withheld from training. We challenge this assumption by demonstrating systematic \emph{representation-level leakage} of ordinal sensitive attributes in purely unsupervised embeddings. Using \textbf{SOMnibus}, a topology-preserving method based on high-capacity Self-Organizing Maps, we show that attributes such as age and income emerge as dominant latent axes despite being explicitly excluded from the input. Across two large-scale real-world benchmarks, the World Values Survey and the Census-Income (KDD) dataset, SOMnibusrecovers monotonic orderings aligned with withheld sensitive attributes, achieving Spearman correlations of up to $0.85$, while PCA and UMAP typically remain below $0.23$ . Moreover, unsupervised segmentation of SOMnibus embeddings yields demographically skewed clusters, revealing downstream fairness risks in the absence of any supervised task. These results demonstrate that \emph{fairness through unawareness} can fail at the representation level.

Submission Number: 24

Loading