Efficient pooling of predictions via kernel embeddings

Sam Allen; David Ginsbourger; Johanna Ziegel

Efficient pooling of predictions via kernel embeddings

Sam Allen, David Ginsbourger, Johanna Ziegel

Published: 31 Aug 2025, Last Modified: 31 Aug 2025Accepted by TMLREveryoneRevisionsBibTeXCC BY 4.0

Abstract: Probabilistic predictions are probability distributions over the set of possible outcomes. Such predictions quantify the uncertainty in the outcome, making them essential for effective decision making. By combining multiple predictions, the information sources used to generate the predictions are pooled, often resulting in a more informative forecast. Probabilistic predictions are typically combined by linearly pooling the individual predictive distributions; this encompasses several ensemble learning techniques, for example. The weights assigned to each prediction can be estimated based on their past performance, allowing more accurate predictions to receive a higher weight. This can be achieved by finding the weights that optimise a proper scoring rule over some training data. By embedding predictions into a Reproducing Kernel Hilbert Space (RKHS), we illustrate that estimating the linear pool weights that optimise kernel-based scoring rules is a convex quadratic optimisation problem. This permits an efficient implementation of the linear pool when optimally combining predictions on arbitrary outcome domains. This result also holds for other combination strategies, and we additionally study a flexible generalisation of the linear pool that overcomes some of its theoretical limitations, whilst allowing an efficient implementation within the RKHS framework. These approaches are compared in an application to operational wind speed forecasts, where this generalisation is found to offer substantial improvements upon the traditional linear pool.

Submission Length: Long submission (more than 12 pages of main content)

Changes Since Last Submission: The paper has been deanonymised, and a link to the code and data used in the study has been added to the introduction.

Code: https://github.com/sallen12/RKHSCombi

Assigned Action Editor: ~Krikamol_Muandet1

Submission Number: 4145

Loading