Abstract: Highlights•The proposed STAR method introduces state–action representation to enhance visual RL.•The Q-function can still converge after incorporating state–action embedding.•STAR outperforms prior methods on challenging tasks from DeepMind Control Suite.
Loading