Tsetlin Machine for Solving Contextual Bandit Problems

Raihan Seraj; Jivitesh Sharma; Ole-Christoffer Granmo

Tsetlin Machine for Solving Contextual Bandit Problems

Raihan Seraj, Jivitesh Sharma, Ole-Christoffer Granmo

Published: 31 Oct 2022, Last Modified: 04 Aug 2025NeurIPS 2022 AcceptReaders: Everyone

Keywords: Contextual bandits, Tsetlin Machine

TL;DR: The paper presents an interpretable contextual bandit algorithm using Tsetlin Machine which learns pattern recognition task using propositional (boolean) logic.

Abstract: This paper introduces an interpretable contextual bandit algorithm using Tsetlin Machines, which solves complex pattern recognition tasks using propositional (Boolean) logic. The proposed bandit learning algorithm relies on straightforward bit manipulation, thus simplifying computation and interpretation. We then present a mechanism for performing Thompson sampling with Tsetlin Machine, given its non-parametric nature. Our empirical analysis shows that Tsetlin Machine as a base contextual bandit learner outperforms other popular base learners on eight out of nine datasets. We further analyze the interpretability of our learner, investigating how arms are selected based on propositional expressions that model the context.

Supplementary Material: pdf

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 1 code implementation](https://www.catalyzex.com/paper/tsetlin-machine-for-solving-contextual-bandit/code)

10 Replies

Loading