We provide several visualizations of the IRIS agent after 100k environment steps, i.e. two hours of real-time experience:
- `episodes_atari.gif`: agent playing in the actual environments.
- `episodes_atari_imagined.mp4`: agent playing in the imagination of its world model. The imagination procedure starts from a true frame, and the agent receives as input reconstructed frames from the latent state tokens of the world model.
- `episode_krull.mp4`: agent playing in the game Krull, where we observe an unexpected behaviour. Instead of defending the princess from monsters, the agent will run away and hide to safely maximize its return in the next level. A nice illustration of the gap between human priors and RL objectives.