Control with adaptive Q-learning: A comparison for two classical control problems

João Pedro Araújo, Mário A. T. Figueiredo, Miguel Ayala Botto

Published: 2022, Last Modified: 29 Apr 2025Eng. Appl. Artif. Intell. 2022EveryoneRevisionsBibTeXCC BY-SA 4.0

Abstract: Highlights•Two recent Q-learning algorithms, AQL and SPAQL, are evaluated on two classical control benchmarks.•Based on insights from control theory, a new algorithm, SPAQL-TS, is introduced.•It is shown that both SPAQL and SPAQL-TS outperform TRPO in the Cartpole problem.