# In-Context Preference-based Reinforcement Learning (ICPRL)

This repo contains implementations for *Learning in Context, Guided by Choice: A Reward-Free Paradigm for Reinforcement Learning with Transformers*

We propose and investigate two ICPRL settings: **I-PRL** and **T-PRL**. We conduct experiments mainly with two MDP environments: **DarkRoom** and **Meta-World**. As we propose different frameworks for different settings, this repo is structured according to the experiments' ICPRL setting and MDP environment. Specifically,

- DarkRoom in I-PRL: see directory ***I-PRL DarkRoom***.
- Meta-World in I-PRL: see directory ***I-PRL Meta-World***. 
- DarkRoom in T-PRL: see directory ***T-PRL DarkRoom***.
- Meta-World in T-PRL: see directory ***T-PRL Meta-World***. 