Train a snake with reinforcement learning algorithms

Ruikai Cai; Condi Zhang

Train a snake with reinforcement learning algorithms

Ruikai Cai, Condi Zhang

14 Dec 2020 (modified: 05 May 2023)CUHK 2021 Course IERG5350 Blind SubmissionReaders: Everyone

Keywords: Snake, Double-DQN, PPO

TL;DR: Tyr to use Double-DQN and PPO to train our agent for playing Snake game.

Abstract: Snake is a typical video game where the player maneuvers a line which grows in length, with the line itself being a primary obstacle.Our goal is to train a more efficient agent than human for Snake.To approoach this goal, firstly we use pygame achieve a simple Snake game as the environment.Because Snake game have so many states that it is impossible to use tabular method represent them and human play Snake and learn it by geting the image of it. We believe that Double-DQN which is a classical algorithm of RL is suitable for our agent training.Besides,Snake is also similar with a walking or climbing process.We note that PPO has good performance for resolving such problem by restricting the variance during training.That is why we want to compare Double-DQN with PPO in our environment.Then we use Double-DQN and PPO to train our agent.At present, we get the results in current environment.We find that Double-DQN is more stable but the peak of PPO is better.

4 Replies

Loading