We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 240c989 commit 7aa7069Copy full SHA for 7aa7069
week09_policy_II/README.md
@@ -21,4 +21,4 @@ While you already know algorithms that will work with continuously many actions,
21
* Deterministic policy gradient - [article](https://arxiv.org/pdf/1512.07679.pdf), [post+code](https://yanpanlau.github.io/2016/10/11/Torcs-Keras.html)
22
* Stochastic value gradient - [article](https://arxiv.org/abs/1510.09142)
23
* Embedding large discrete action spaces for RL - [article](https://arxiv.org/pdf/1512.07679.pdf)
24
- * Lecture by A. Seleznev, 5vision (russian) - [video](www.youtube.com/watch?v=j1L2FnanXPo&t=119m45s)
+ * Lecture by A. Seleznev, 5vision (russian) - [video](https://www.youtube.com/watch?v=j1L2FnanXPo&t=119m45s)
0 commit comments