Solving Atari Pong - Reinforcement Learning

Using Open AI Gym library to solve Atari Pong game by implementing Reinforcement Learning techniques

Open Table of contents

Atari Pong Environment
Algorithm
Screenshots

Atari Pong Environment

Pong is a two-dimensional sports game that simulates table tennis. The player controls an in-game paddle by moving it vertically across the left or right side of the screen.

Players use the paddles to hit a ball back and forth. The goal is for each player to reach eleven points before the opponent;points are earned when one fails to return the ball to the other

Actions - There are 6 actions noop, fire, right, left, rightfire, leftfire
Reward - 10 points for passing the ball to the opponent, minus points for missing the ball
Goal – Make the agent play pong

Algorithm

The model is trained using the Actor Critic method. Actor critic: Actor critic combines both value-based and policy-based algorithms, here we have both networks actor and critic the actor-network is used to perform an action for a given state it chooses the best action. The critic network calculates the Q value for that state action and helps the actor to improve the policy for selecting the action Screenshot-2024-03-30-at-6.11.03PM81dcf73860cc3da8.png

Screenshots

Final trained model playing the pong game: