Skip to content

Commit 9c24bd1

Browse files
committed
Policy Gradient refactor
1 parent 115dd42 commit 9c24bd1

File tree

1 file changed

+12
-6
lines changed

1 file changed

+12
-6
lines changed

PolicyGradient/README.md

Lines changed: 12 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -48,9 +48,15 @@
4848

4949
### Exercises
5050

51-
- Implement REINFORCE with Baseline (Exercise, [Solution](CliffWalk REINFORCE with Baseline Solution.ipynb))
52-
- Implement Actor Critic with Baseline (Exercise, [Solution](CliffWalk Actor Critic Solution.ipynb))
53-
- Implement Actor Critic with Baseline for Continuous Action Space (Exercise, [Solution](Continuous MountainCar Actor Critic Solution.ipynb))
54-
- Implement Deterministic Policy Gradients for Continuous Action Spaces (WIP)
55-
- Implement Deep Deterministic Policy Gradients (WIP)
56-
- Implement Asynchronous Advantage Actor Critic (A3C) (WIP)
51+
- REINFORCE with Baseline
52+
- Exercise
53+
- [Solution](CliffWalk REINFORCE with Baseline Solution.ipynb)
54+
- Actor Critic with Baseline
55+
- Exercise
56+
- [Solution](CliffWalk Actor Critic Solution.ipynb)
57+
- Actor Critic with Baseline for Continuous Action Spaces
58+
- Exercise
59+
- [Solution](Continuous MountainCar Actor Critic Solution.ipynb)
60+
- Deterministic Policy Gradients for Continuous Action Spaces (WIP)
61+
- Deep Deterministic Policy Gradients (WIP)
62+
- Asynchronous Advantage Actor Critic (A3C) (WIP)

0 commit comments

Comments
 (0)