Skip to content

Implementation of Reinforcement Learning Algorithms(Q-Learning and DeepQLearning) on Open-AI Gym Environments

Notifications You must be signed in to change notification settings

haiderzm/ReinforcementLearningWithGym

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Q-Learning

Q-learning is a model-free reinforcement learning algorithm to learn the value of an action in a particular state. It does not require a model of the environment (hence "model-free"), and it can handle problems with stochastic transitions and rewards without requiring adaptations.

In Q learning

We replace iterative format of bellman equation

alt text

And use it in form of expectation

alt text

Equation for training of the RL agent

alt text

Q-Learning using Q Table

alt text



Deep Q-Learning

In Deep Q-learning, we use a neural network to approximate the Q-value function. The state is given as the input and the Q-value of all possible actions is generated as the output.
Deep Q Algorithm

alt text

Bellman Equation
In deep Q-learning, we use a neural network to approximate the Q-value function. The state is given as the input and the Q-value of all possible actions is generated as the output.
Deep Q-Learning using Neural Networks

alt text

Results

Result of Q-learning on MountainCar-v0 Gym Environment
mountain-car.mp4
Result of Q-learning on CartPole Gym Environment
CartPole_CartPole.mp4
Result of Deep Q-learning on Breakout Atari Gym Environment
breakout.mp4

About

Implementation of Reinforcement Learning Algorithms(Q-Learning and DeepQLearning) on Open-AI Gym Environments

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published