Markov Chain Monte Carlo for Koopman-Based Optimal Control

João P. Hespanha; Kerem Yunus Çamsari

Markov Chain Monte Carlo for Koopman-Based Optimal Control

João P. Hespanha, Kerem Yunus Çamsari

Published: 01 Jan 2024, Last Modified: 12 May 2025IEEE Control. Syst. Lett. 2024EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: We propose a Markov Chain Monte Carlo (MCMC) algorithm based on Gibbs sampling with parallel tempering to solve nonlinear optimal control problems. The algorithm is applicable to nonlinear systems with dynamics that can be approximately represented by a finite dimensional Koopman model, potentially with high dimension. This algorithm exploits linearity of the Koopman representation to achieve significant computational saving for large lifted states. We use a video-game to illustrate the use of the method.

Loading