QLearning

Implementation of Q-Learning, SARSA and SARSA-λ on a grid-world test-bed

Requirements : pygame,gym,numpy,argparse,matplotlib

Usage:

python2 grid.py [-h] [--alpha ALPHA] [--epsilon EPSILON] [--gamma GAMMA]
           [--episodes EPISODES] [--verbose] [--grid GRID] [--render]
           [--show-policy] [--algo ALGO] --games GAMES [--lam LAM]

ALPHA : learning rate [Default:0.5]
EPSILON : Epsilon value for epsilon-greedy exploration [Default:0.1]
GAMMA : Discount [Default:0.9]
GAMES : number of games [Default:1]
--render : to render the game for last 10% of the episodes
--show-policy : to show policy at the end 
GRID : A/B/C
ALGO : Q/SARSA/SARSAlam
LAM : lambda value for SARSAlam [Default:0.5]

Example: To run a game with default settings on grid-A, to render the enviroment and show learnt policy at the end

python grid.py --episodes 1000 --render --show-policy

Reference : Reinforcement Learning: An Introduction Book by Andrew Barto and Richard S. Sutton

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
gym_grid		gym_grid
LICENSE		LICENSE
README.md		README.md
algos.py		algos.py
algos.pyc		algos.pyc
grid.py		grid.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

QLearning

Implementation of Q-Learning, SARSA and SARSA-λ on a grid-world test-bed

About

Uh oh!

Releases

Packages

Languages

License

ajithalbus/QLearning

Folders and files

Latest commit

History

Repository files navigation

QLearning

Implementation of Q-Learning, SARSA and SARSA-λ on a grid-world test-bed

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages