Off-Policy Actor-Critic for Adversarial Observation Robustness: Virtual Alternative Training via Symmetric Policy Evaluation

Kosuke Nakanishi, Akihiro Kubo, Yuji Yasui, Shin Ishii

Published: 2025, Last Modified: 08 May 2026CoRR 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Loading