Toggle navigation
OpenReview
.net
Login
×
Go to
AUTOMATICA 2022
homepage
Whittle index based Q-learning for restless bandits with average reward
Konstantin E. Avrachenkov
,
Vivek S. Borkar
Published: 01 Jan 2022, Last Modified: 10 May 2023
Autom. 2022
Readers:
Everyone
0 Replies
Loading