Position Estimation of Obstacles Using Maximum Expectation Stochastic Gradient Variational Inference in Intelligent Control Game Systems

Published: 09 Mar 2025, Last Modified: 09 Mar 2025MathAI 2025 OralEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Incomplete Information; Intelligent Control Game; Q-learning; Maximum Expected Random Gradient Variational Inference; Reinforcement Learning
Abstract:

This paper discusses the optimization of the confrontation position of agent in a two-dimensional game system with incomplete information, based on the random movements of agent relative to obstacles. Incomplete information games contain numerous unknown factors, adding complexity to intelligent control game systems. To address these unknown factors, intelligent control game systems often require substantial data and significant computational resources. However, real confrontation systems involve intelligence gathering, and neither side will have a complete information set about the confrontation situation. To tackle this issue, this paper proposes a Maximum Expected Stochastic Gradient Variational Inference Algorithm within the Q-learning framework, which can infer the position coordinates of obstacles in the confrontation plane. In this paper, the experimental data of Q-learning model of reinforcement learning,The Maximum Expectation Stochastic Gradient Variational Inference algorithm is then used to estimate the position coordinates of the obstacles arranged by player B.

Submission Number: 2
Loading