Integrating Human Expertise in Continuous Spaces: A Novel Interactive Bayesian Optimization Framework with Preference Expected Improvement

Nikolaus Feith; Elmar Rueckert

Integrating Human Expertise in Continuous Spaces: A Novel Interactive Bayesian Optimization Framework with Preference Expected Improvement

Nikolaus Feith, Elmar Rueckert

Published: 01 Jan 2024, Last Modified: 13 Nov 2024UR 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Interactive Machine Learning (IML) seeks to integrate human expertise into machine learning processes. However, most existing algorithms cannot be applied to real world scenarios because their state spaces and/or action spaces are limited to discrete values. Furthermore, the interaction is limited to either a binary, good or bad, decision or the choice of which of the proposed solutions is the best. We therefore propose a novel framework based on Bayesian Optimization (BO). Interactive Bayesian Optimization (IBO) captures user preferences and provides an interface for users to shape the strategy by hand. Additionally, we've incorporated a new acquisition function, Preference Expected Improvement (PEI), to refine the system's efficiency using a probabilistic model of the user preferences. Our approach is geared towards ensuring that machines can benefit from human expertise, aiming for a more aligned and effective learning process. In the course of this work, we applied our method to simulations and in a real world task using a Franka Panda robot to show human-robot collaboration.

Loading