Skip to content

Commit d28e044

Browse files
committed
Add three papers
Weight Normalization : A Simple Reparameterization to Accelerate Training of Deep Neural Networks Continuous Deep Q-Learning with Model-based Acceleration Deep Reinforcement Learning from Self-Play in Imperfect-Information Games
1 parent 75603b7 commit d28e044

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

README.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -57,6 +57,9 @@ If you want to inform the maintainer of a new paper, feel free to contact [@mooo
5757
- T. Zahavy, N. Ben Zrihem, and S. Mannor, **Graying the black box: Understanding DQNs**, arXiv, 2016. [arXiv](http://arxiv.org/abs/1602.02658)
5858
- J. N. Foerster, Y. M. Assael, N. de Freitas, and S. Whiteson, **Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks**, arXiv, 2016. [arXiv](http://arxiv.org/abs/1602.02672)
5959
- I. Osband, C. Blundell, A. Pritzel, and B. Van Roy, **Deep Exploration via Bootstrapped DQN**, arXiv, 2016. [arXiv](http://arxiv.org/abs/1602.04621)
60+
- T. Salimans and D. P. Kingma, **Weight Normalization : A Simple Reparameterization to Accelerate Training of Deep Neural Networks**, arXiv, 2016. [arXiv](http://arxiv.org/abs/1602.07868)
61+
- S. Gu, T. Lillicrap, I. Sutskever, and S. Levine, **Continuous Deep Q-Learning with Model-based Acceleration**, arXiv, 2016. [arXiv](http://arxiv.org/abs/1603.00748)
62+
- J. Heinrich and D. Silver, **Deep Reinforcement Learning from Self-Play in Imperfect-Information Games David Silve**, arXiv, 2016. [arXiv](http://arxiv.org/abs/1603.01121)
6063

6164
### Deep Policy
6265

0 commit comments

Comments
 (0)