Abstract: Highlights•A sample-efficient RMPC algorithm is proposed for pixel-based long-horizon tasks.•A regularization is incorporated into value function estimation and model learning.•RMPC achieves a 20.88% improvement and a 56.39% decrease in standard deviation.
External IDs:doi:10.1016/j.asoc.2025.113377
Loading