Optimal Best-Arm Identification Methods for Tail-Risk Measures

Shubhada Agrawal; Wouter M Koolen; Sandeep Kumar Juneja

Optimal Best-Arm Identification Methods for Tail-Risk Measures

Shubhada Agrawal, Wouter M Koolen, Sandeep Kumar Juneja

Published: 09 Nov 2021, Last Modified: 05 May 2023NeurIPS 2021 PosterReaders: Everyone

Keywords: Multi-armed bandits, pure exploration, CVaR, VaR, tail-risk, heavy-tailed distributions, best-arm identification

Abstract: Conditional value-at-risk (CVaR) and value-at-risk (VaR) are popular tail-risk measures in finance and insurance industries as well as in highly reliable, safety-critical uncertain environments where often the underlying probability distributions are heavy-tailed. We use the multi-armed bandit best-arm identification framework and consider the problem of identifying the arm from amongst finitely many that has the smallest CVaR, VaR, or weighted sum of CVaR and mean. The latter captures the risk-return trade-off common in finance. Our main contribution is an optimal $\delta$-correct algorithm that acts on general arms, including heavy-tailed distributions, and matches the lower bound on the expected number of samples needed, asymptotically (as $ \delta$ approaches $0$). The algorithm requires solving a non-convex optimization problem in the space of probability measures, that requires delicate analysis. En-route, we develop new non-asymptotic, anytime-valid, empirical-likelihood-based concentration inequalities for tail-risk measures.

Code Of Conduct: I certify that all co-authors of this work have read and commit to adhering to the NeurIPS Statement on Ethics, Fairness, Inclusivity, and Code of Conduct.

TL;DR: We consider the best-arm identification problem in the multi-armed bandit framework where an arm with the smallest tail-risk measure is identified.

Supplementary Material: pdf

Code: zip

15 Replies

Loading