gradSim: Differentiable simulation for system identification and visuomotor control

J. Krishna Murthy; Miles Macklin; Florian Golemo; Vikram Voleti; Linda Petrini; Martin Weiss; Breandan Considine; Jérôme Parent-Lévesque; Kevin Xie; Kenny Erleben; Liam Paull; Florian Shkurti; Derek Nowrouzezahrai; Sanja Fidler

gradSim: Differentiable simulation for system identification and visuomotor control

J. Krishna Murthy, Miles Macklin, Florian Golemo, Vikram Voleti, Linda Petrini, Martin Weiss, Breandan Considine, Jérôme Parent-Lévesque, Kevin Xie, Kenny Erleben, Liam Paull, Florian Shkurti, Derek Nowrouzezahrai, Sanja Fidler

Published: 12 Jan 2021, Last Modified: 22 Jun 2025ICLR 2021 PosterReaders: Everyone

Keywords: Differentiable simulation, System identification, Physical parameter estimation, 3D scene understanding, 3D vision, Differentiable rendering, Differentiable physics

Abstract: In this paper, we tackle the problem of estimating object physical properties such as mass, friction, and elasticity directly from video sequences. Such a system identification problem is fundamentally ill-posed due to the loss of information during image formation. Current best solutions to the problem require precise 3D labels which are labor intensive to gather, and infeasible to create for many systems such as deformable solids or cloth. In this work we present gradSim, a framework that overcomes the dependence on 3D supervision by combining differentiable multiphysics simulation and differentiable rendering to jointly model the evolution of scene dynamics and image formation. This unique combination enables backpropagation from pixels in a video sequence through to the underlying physical attributes that generated them. Furthermore, our unified computation graph across dynamics and rendering engines enables the learning of challenging visuomotor control tasks, without relying on state-based (3D) supervision, while obtaining performance competitive to/better than techniques that require precise 3D labels.

One-sentence Summary: Differentiable models of time-varying dynamics and image formation pipelines result in highly accurate physical parameter estimation from video

Code Of Ethics: I acknowledge that I and all co-authors of this work have read and commit to adhering to the ICLR Code of Ethics

Supplementary Material: zip

Data: [ShapeNet](https://paperswithcode.com/dataset/shapenet)

Community Implementations: [![CatalyzeX](/images/catalyzex_icon.svg) 6 code implementations](https://www.catalyzex.com/paper/gradsim-differentiable-simulation-for-system/code)

9 Replies

Loading