GDC: From Brittle Optimality to Robust Satisfiability  via Riemannian Risk Geometry

GDC: From Brittle Optimality to Robust Satisfiability via Riemannian Risk Geometry

ICLR 2026 Conference Submission22595 Authors

20 Sept 2025 (modified: 08 Oct 2025)ICLR 2026 Conference SubmissionEveryoneRevisionsBibTeXCC BY 4.0

Keywords: Differential Geometry, Risk-Sensitive Control, Robustness

TL;DR: We propose Geodesic Duality Control (GDC), a reinforcement learning method that adjusts the Bellman target based on geometric curvature to improve policy safety margins while maintaining performance.

Abstract: Standard reinforcement learning (RL) often yields brittle policies that fail under hard safety constraints. We propose **Geodesic Duality Control (GDC)**, which adapts an agent’s risk posture endogenously by re-weighting the Bellman target using local geometric cues of the value function—specifically, the gradient magnitude $||\nabla V||$ and a curvature surrogate $\kappa$. To accommodate piecewise-smooth neural critics, we formulate a Sub-Riemannian / generalized-gradient treatment and provide practical, numerically stable curvature surrogates. Our main theoretical result shows that, under explicit regularity and stochastic-model assumptions, GDC induces a curvature-decreasing learning dynamic: $$ \frac{d\kappa}{dt} < 0, $$ which in turn increases a quantifiable safety margin $\Delta_s$. We validate the mechanism with proof-of-concept experiments—including a hard-boundary safety environment (*Optimal-Trap*), targeted ablations, and a computational-cost study on Humanoid-Safety—to confirm the intended geometric risk posture. We do not claim broad empirical superiority on all benchmarks; rather, the paper’s primary contribution is theoretical, with key components validated empirically.

Supplementary Material: zip

Primary Area: applications to robotics, autonomy, planning

Submission Number: 22595

Loading