
## Reward-Mixing MDPs with Few Latent Contexts are Learnable
Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor
Keywords: 
ICML/2023/Proceedings/24833 - Reward-Mixing MDPs with Few Latent Contexts are Learnable.pdf
Project URL: nan

[X] Theoretical

### Implementation
_Given the documentation given by the authors on the method, how much time investment would it be to re-implement the method from scratch?_

[1]

N/A

### Data
_Given the data description in the documentation, how much effort take to either: Find the same dataset the authors used, or similar datasets and defend the comparability, or acquire one from scratch?_

[1]

(0/0)

N/A

### Configuration 
_Given the (hyper)parameters, including semantic parameters, of the method: How much effort would it take to acquire the algorithm configurations used for their results, and compare against their budgetary constraints?_

[1]

N/A

### Experimental Procedure
_Given the experimental set-up of the work, how difficult is it to set up a new experiment, similar to those presented in the original work, with the same procedure?_

[1]

N/A

### Expertise
_How much effort would it take to acquire the expertise required to reproduce the work independently relying on the available documentation?_

[9]

Requires experience with MDP.
