Published: 2022, Last Modified: 20 Feb 2024CLeaR 2022Readers: Everyone
Abstract:The Causal Bandit is a variant of the classic Bandit problem where an agent must identify the best action in a sequential decision-making process, where the reward distribution of the actions displ...