Skip to content

Commit 4d7085b

Browse files
committed
Add four papers
Control of Memory, Active Perception, and Action in Minecraft Deep Successor Reinforcement Learning Learning Deep Control Policies for Autonomous Aerial Vehicles with MPC-Guided Policy Search Model-Free Episodic Control
1 parent 7f7dbf0 commit 4d7085b

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

README.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -61,12 +61,15 @@ If you want to inform the maintainer of a new paper, feel free to contact [@mooo
6161
- S. Gu, T. Lillicrap, I. Sutskever, and S. Levine, **Continuous Deep Q-Learning with Model-based Acceleration**, arXiv, 2016. [arXiv](http://arxiv.org/abs/1603.00748)
6262
- J. Heinrich and D. Silver, **Deep Reinforcement Learning from Self-Play in Imperfect-Information Games**, arXiv, 2016. [arXiv](http://arxiv.org/abs/1603.01121)
6363
- T. D. Kulkarni, K. R. Narasimhan, A. Saeedi, and J. B. Tenenbaum, **Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation**, arXiv, 2016. [arXiv](http://arxiv.org/abs/1604.06057)
64+
- J. Oh, V. Chockalingam, S. Singh, and H. Lee, **Control of Memory, Active Perception, and Action in Minecraft**, ICML, 2016. [arXiv](http://arxiv.org/abs/1605.09128)
65+
- T. D. Kulkarni, A. Saeedi, S. Gautam, and S. J. Gershman, **Deep Successor Reinforcement Learning**, arXiv, 2016. [arXiv](http://arxiv.org/abs/1606.02396)
6466

6567
### Deep Policy
6668

6769
- S. Levine, C. Finn, T. Darrell, and P. Abbeel, **End-to-End Training of Deep Visuomotor Policies**, arXiv, 2015. [arXiv](http://arxiv.org/abs/1504.00702)
6870
- partially observed guided policy search
6971
- J. Schulman, S. Levine, P. Moritz, M. Jordan, and P. Abbeel, **Trust Region Policy Optimization**, ICML, 2015. [pdf](http://jmlr.org/proceedings/papers/v37/schulman15.pdf)
72+
- T. Zhang, G. Kahn, S. Levine, and P. Abbeel, **Learning Deep Control Policies for Autonomous Aerial Vehicles with MPC-Guided Policy Search**, ICRA, 2016. [arXiv](http://arxiv.org/abs/1509.06791)
7073

7174
### Deep Actor-Critic
7275

@@ -98,6 +101,7 @@ If you want to inform the maintainer of a new paper, feel free to contact [@mooo
98101
- S. Mohamed and D. J. Rezende, **Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning**, arXiv, 2015. [arXiv](http://arxiv.org/abs/1509.08731)
99102
- Y. Liang, M. C. Machado, E. Talvitie, and M. Bowling, **State of the Art Control of Atari Games Using Shallow Reinforcement Learning**, arXiv, 2015. [arXiv](http://arxiv.org/abs/1512.01563)
100103
- A. Tamar, S. Levine, and P. Abbeel, **Value Iteration Networks**, arXiv, 2016. [arXiv](http://arxiv.org/abs/1602.02867)
104+
- C. Blundell, G. Deepmind, B. Uria, A. Pritzel, Y. Li, A. Ruderman, J. Z. Leibo, J. Rae, D. Wierstra, and D. Hassabis, **Model-Free Episodic Control**, arXiv, 2016. [arXiv](http://arxiv.org/abs/1606.04460)
101105

102106
## Talks/Slides
103107

0 commit comments

Comments
 (0)