Abstract: Highlights•Introduced the Dynamic Programming Expected Free Energy (DPEFE) for efficient active inference planning.•Developed a new algorithm for learning time-constrained agent behaviour preferences.•Demonstrated, theoretically and via simulations, reduced computational cost by orders of magnitude.
Loading