Incentivizing Truthful Submissions in a Data Marketplace for Mean Estimation

Published: 03 Feb 2026, Last Modified: 03 Feb 2026AISTATS 2026 PosterEveryoneRevisionsBibTeXCC BY 4.0
Abstract: We study a data marketplace where a broker intermediates between buyers, who seek to estimate the mean $\mu$ of an unknown normal distribution $N(\mu, \sigma^2)$, and contributors, who can collect data from this distribution at a cost. The broker delegates data collection work to contributors, aggregates reported datasets, sells it to buyers, and redistributes revenue as payments to contributors. We aim to maximize welfare or profit under key constraints: individual rationality for buyers and contributors, incentive compatibility (contributors are incentivized to comply with data collection instructions and truthfully report the collected data), and budget balance (total contributor payments equals total revenue). We first compute welfare/profit-optimal prices under truthful reporting; however, to incentivize data collection and truthful data reporting, we adjust them based on discrepancies in contributors' reported data. This yields a Nash equilibrium (NE) where the two lowest-cost contributors collect all data. We complement this with two hardness results: $\mathcal{(i)}$ no nontrivial dominant-strategy incentive-compatible mechanism exists in this problem, and $\mathcal{(ii)}$ no mechanism outperforms ours in a NE.
Submission Number: 944
Loading