We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent f637c42 commit 1f2e2ebCopy full SHA for 1f2e2eb
PolicyGradient/README.md
@@ -53,10 +53,10 @@
53
- [Solution](CliffWalk%20REINFORCE%20with%20Baseline%20Solution.ipynb)
54
- Actor-Critic with Baseline
55
- Exercise
56
- - [Solution](CliffWalk%20Actor-Critic%20Solution.ipynb)
+ - [Solution](CliffWalk%20Actor%20Critic%20Solution.ipynb)
57
- Actor-Critic with Baseline for Continuous Action Spaces
58
59
- - [Solution](Continuous%20MountainCar%20Actor-Critic%20Solution.ipynb)
+ - [Solution](Continuous%20MountainCar%20Actor%20Critic%20Solution.ipynb)
60
- Deterministic Policy Gradients for Continuous Action Spaces (WIP)
61
- Deep Deterministic Policy Gradients (WIP)
62
- Asynchronous Advantage Actor-Critic (A3C)
0 commit comments