Continual Depth-limited Responses for Computing Counter-strategies in Sequential Games

David Milec; Ondrej Kubicek; Viliam Lisý

Continual Depth-limited Responses for Computing Counter-strategies in Sequential Games

David Milec, Ondrej Kubicek, Viliam Lisý

Published: 13 Mar 2024, Last Modified: 22 Apr 2024ALA 2024EveryoneRevisionsBibTeXCC BY 4.0

Keywords: large games, approximating best response, robust response, opponent exploitation, imperfect information, depth limited solving, gadgets

TL;DR: Novel approaches fusing depth-limited solving and opponent modeling outperforming SotA.

Abstract: In zero-sum games, the optimal strategy is well-defined by the Nash equilibrium. However, it is overly conservative when playing against suboptimal opponents and it can not exploit their weaknesses. Limited look-ahead game solving in imperfect-information games allows defeating human experts in massive real-world games such as Poker, Liar's Dice, and Scotland Yard. However, since they approximate Nash equilibrium, they tend to only win slightly against weak opponents. We propose methods combining limited look-ahead solving with an opponent model, in order to 1) approximate a best response in large games or 2) compute a robust response with control over the robustness of the response. Both methods can compute the response in real time to previously unseen strategies. We present theoretical guarantees of our methods. We show that existing robust response methods do not work combined with limited look-ahead solving of the shelf, and we propose a novel solution for the issue. Our algorithm performs significantly better than multiple baselines in smaller games and outperforms state-of-the-art methods against SlumBot.

Supplementary Material: zip

Type Of Paper: Full paper (max page 8)

Anonymous Submission: Anonymized submission.

Submission Number: 13

Loading