Skip to content

jaindeepali/Reinforcement-Learning-Algorithms

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Reinforcement Learning Algorithms

This is a collection of Python (numpy + tensorflow) implementations of common RL algorithms.

  • MDP Solutions - Value Iteration, Policy Iteration, Fitted Value iteration through function approximation, Policy Gradient
  • Model-free Solutions - Q-Iteration, Q-Learning, Monte-Carlo Policy iteration, REINFORCE (Vanilla policy gradient), SARSA, n-Step SARSA, SARSA-Lambda, Actor-Critic, Deep Q-Network

About

Python (Numpy + Tensorflow) implementations of common Reinforcement Learning algorithms.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published