Published: 01 Jan 2023, Last Modified: 22 Feb 2024ICML 2023Readers: Everyone
Abstract:This paper introduces a new principled approach for off-policy learning in contextual bandits. Unlike previous work, our approach does not derive learning principles from intractable or loose bound...