Deep Reinforcement Learning Alogrithms

This Repository Will Implement the Classic Deep Reinforcement Learning Algorithms.

Deep Q-Learning Network(DQN)
Double DQN(DDQN)
Dueling Network Architecture
Deep Deterministic Policy Gradient(DDPG)
Normalized Advantage Function(NAF)
Asynchronous Advantage Actor-Critic(A3C)
Trust Region Policy Optimization(TRPO)
Proximal Policy Optimization(PPO)
Actor Critic using Kronecker-Factored Trust Region(ACKTR)

I has already implemented five of these algorithms. I will implement the rest of algorithms and keep update them in the future.

Something Important

In this repository, the actions are sampled from the beta distribution which could improve the performance. The paper about this is: The Beta Policy for Continuous Control Reinforcement Learning

However, I can't calculate the Back-Propagation of Beta Distribution's Entropy. If someone has the solution of it, please contact me.

Requirements

python 3.5.2
openai gym
gym_ple
mujoco-py - 0.5.7
pytorch
pyro

Instruction To Use the Code

The instruction has been introduced in each repository. In the future, I will revise them and use a common format.

Acknowledgement:

Ilya Kostrikov's GitHub

Papers Related to the Deep Reinforcement Learning

[1] A Brief Survey of Deep Reinforcement Learning
[2] The Beta Policy for Continuous Control Reinforcement Learning
[3] Playing Atari with Deep Reinforcement Learning
[4] Deep Reinforcement Learning with Double Q-learning
[5] Dueling Network Architectures for Deep Reinforcement Learning
[6] Continuous control with deep reinforcement learning
[7] Continuous Deep Q-Learning with Model-based Acceleration
[8] Asynchronous Methods for Deep Reinforcement Learning
[9] Trust Region Policy Optimization
[10] Proximal Policy Optimization Algorithms
[11] Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
a3c		a3c
ddpg		ddpg
double_dqn		double_dqn
dqn		dqn
ppo		ppo
trpo		trpo
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Deep Reinforcement Learning Alogrithms

Something Important

Requirements

Instruction To Use the Code

Acknowledgement:

Papers Related to the Deep Reinforcement Learning

About

Uh oh!

Releases

Packages

Languages

License

lyealy/Reinforcement_Learning_Algorithms

Folders and files

Latest commit

History

Repository files navigation

Deep Reinforcement Learning Alogrithms

Something Important

Requirements

Instruction To Use the Code

Acknowledgement:

Papers Related to the Deep Reinforcement Learning

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages