Implementation of various reinforcement learning algorithms following OpenAI Spinning Up guide.
- Vanilla Policy Gradient (VPG)
- Proximal Policy Optimization (PPO)
- Twin Delayed Deep Deterministic Policy Gradient (TD3)
Here is a list of environments from OpenAI's gym toolkit solved with the above algorithms (link to video of solution in parenthesis).