Training an AI agent to play a Snake Game via Deep Reinforcement Learning
Keywords:
Deep reinforcement learning, Snake Game, Autonomous agent, Deep Learning, Experience replayAbstract
Deep Reinforcement Learning (DRL) has become a normally adopted methodology to alter the agents to be told complex management policies in varied video games, after Deep-Mind used this technique to play Atari games. In this paper, we will develop a Deep Reinforcement Learning Model along with Deep Q-Learning Algorithm that will enable our autonomous agent to play the classical snake game. Specifically, we will employ a Deep Neural Network (DNN) trained with a variant of Q-Learning. No rules about the game are mentioned, and initially the agent is provided with no information on what it needs to do. The goal for the system is to figure out the rules and elaborate a method to maximize the score or reward.
References
- G. Tesauro, “Temporal difference learning and TDgammon,” Communications of the ACM, vol. 38, no. 3, pp. 58–68, 1995.
- V. Mnih, K. Kavukcuoglu, D. Silver, A. Graves, I. Antonoglou, D. Wierstra, and M. Riedmiller, “Playing atari with deep reinforcement learning,” ArXiv e-prints, 2013.
- E. A. O. Diallo, A. Sugiyama, and T. Sugawara, “Learning to coordinate with deep reinforcement learning in doubles pong game,” in Proceedings of IEEE International Conference on Machine Learning and Applications (ICMLA), 2017, pp. 14–19.
- S. Yoon and K. J. Kim, “Deep Q networks for visual fighting game AI,” in Proceedings of IEEE Conference on Computational Intelligence and Games (CIG), 2017, pp. 306–308.
- M. Andrychowicz, D. Crow, A. Ray, J. Schneider, R. Fong, P. Welinder, B. McGrew, J. Tobin, P. Abbeel,and W. Zaremba, “Hindsight experience replay,” in Proceedings of Annual Conference on Neural Information Processing Systems, 2017, pp. 5055–5065.
- L.-J. Lin, “Reinforcement learning for robots using neural networks,” Ph.D. dissertation, Pittsburgh, PA, USA, 1992, UMI Order No. GAX93-22750.
- T. Schaul, J. Quan, I. Antonoglou, and D. Silver, “Prioritized experience replay,” Computing Research Repository, vol. abs/1511.05952, 2015.
- M. G. Bellemare, Y. Naddaf, J. Veness, and M. Bowling, “The arcade learning environment: An evaluation platform for general agents,” Journal of Artificial Intelligence Research, vol. 47, pp. 253–279, 2013.
- P. D. Christopher J. C. H. Watkins, “Q-learning,” Machine Learning, vol. 8, no. 3-4, pp. 279–292, 1992.
- M. Roderick, J. MacGlashan, and S. Tellex, “Implementing the deep q-network,” ArXiv e-prints, 2017.
- Ali Jaber Almalki, Pawel Wocjan, “Exploration of Reinforcement Learning to Play Snake Game” in Proceedings of International Conference on Computational Science and Computational Intelligence (CSCI), 2019, pp. 377-381.
Downloads
Published
Issue
Section
License
Copyright (c) IJSRST
This work is licensed under a Creative Commons Attribution 4.0 International License.