Skip to content

Commit 6cfa6b3

Browse files
authored
Supported papers (vwxyzjn#291)
* update publication list * cleanup * highlight links
1 parent 42d21bd commit 6cfa6b3

File tree

5 files changed

+35
-33
lines changed

5 files changed

+35
-33
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -153,8 +153,8 @@ We have a [Discord Community](https://discord.gg/D6RCjA6sVT) for support. Feel f
153153
If you use CleanRL in your work, please cite our technical [paper](https://arxiv.org/abs/2111.08819):
154154

155155
```bibtex
156-
@article{JMLR:v23:21-1342,
157-
author = {Shengyi Huang and Rousslan Fernand Julien Dossa and Chang Ye and Jeff Braga and Dipam Chakraborty and Kinal Mehta and João G.M. Araújo},
156+
@article{huang2022cleanrl,
157+
author = {Shengyi Huang and Rousslan Fernand Julien Dossa and Chang Ye and Jeff Braga and Dipam Chakraborty and Kinal Mehta and João G.M. Araújo},
158158
title = {CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms},
159159
journal = {Journal of Machine Learning Research},
160160
year = {2022},
Lines changed: 30 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,30 @@
1+
# CleanRL-supported Papers / Projects
2+
3+
CleanRL has become an increasingly popular deep reinforcement learning library, especially among practitioners who prefer more customizable code. Since its debut in July 2019, CleanRL has supported many open source projects and publications. Below are some CleanRL-supported projects and publications.
4+
5+
**Feel free to edit this list if your project or paper has used CleanRL.**
6+
7+
## Publications
8+
9+
* Centa, Matheus, and Philippe Preux. "Soft Action Priors: Towards Robust Policy Transfer." arXiv preprint arXiv:2209.09882 (2022). [https://arxiv.org/pdf/2209.09882.pdf](https://arxiv.org/pdf/2209.09882.pdf)
10+
11+
* Weng, Jiayi, Min Lin, Shengyi Huang, Bo Liu, Denys Makoviichuk, Viktor Makoviychuk, Zichen Liu et al. "Envpool: A highly parallel reinforcement learning environment execution engine." In Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track. [https://openreview.net/forum?id=BubxnHpuMbG](https://openreview.net/forum?id=BubxnHpuMbG)
12+
13+
* Huang, Shengyi, Rousslan Fernand Julien Dossa, Antonin Raffin, Anssi Kanervisto, and Weixun Wang. "The 37 Implementation Details of Proximal Policy Optimization." International Conference on Learning Representations 2022 Blog Post Track, [https://iclr-blog-track.github.io/2022/03/25/ppo-implementation-details/](https://iclr-blog-track.github.io/2022/03/25/ppo-implementation-details/)
14+
15+
* Huang, Shengyi, and Santiago Ontañón. "A closer look at invalid action masking in policy gradient algorithms." The International FLAIRS Conference Proceedings, 35. [https://](https://)journals.flvc.org/FLAIRS/article/view/130584
16+
17+
* Schmidt, Dominik, and Thomas Schmied. "Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari." Deep Reinforcement Learning Workshop at the 35th Conference on Neural Information Processing Systems, [https://arxiv.org/abs/2111.10247](https://arxiv.org/abs/2111.10247)
18+
19+
20+
* Dossa, Rousslan Fernand Julien, Shengyi Huang, Santiago Ontañón, and Takashi Matsubara. "An Empirical Investigation of Early Stopping Optimizations in Proximal Policy Optimization." IEEE Access 9 (2021): 117981-117992. [https://ieeexplore.ieee.org/abstract/document/9520424](https://ieeexplore.ieee.org/abstract/document/9520424)
21+
22+
* Huang, Shengyi, Santiago Ontañón, Chris Bamford, and Lukasz Grela. "Gym-µRTS: Toward Affordable Full Game Real-time Strategy Games Research with Deep Reinforcement Learning." In 2021 IEEE Conference on Games (CoG), pp. 1-8. IEEE, 2021. [https://ieeexplore.ieee.org/abstract/document/9619076](https://ieeexplore.ieee.org/abstract/document/9619076)
23+
24+
* Huang, Shengyi, and Santiago Ontañón. "Measuring Generalization of Deep Reinforcement Learning Applied to Real-time Strategy Games", AAAI 2021 Reinforcement Learning in Games Workshop, http://aaai-rlg.mlanctot.info/papers/AAAI21-RLG_paper_33.pdf
25+
26+
* Bamford, Chris, Huang, Shengyi, and Lucas, Simon, "Griddly: A platform for AI research in games", *AAAI 2021 Reinforcement Learning in Games Workshop*, [https://arxiv.org/abs/2011.](https://arxiv.org/abs/2011.)06363
27+
28+
* Huang, Shengyi, and Santiago Ontañón. "Action guidance: Getting the best of sparse rewards and shaped rewards for real-time strategy games." AIIDE Workshop on Artificial Intelligence for Strategy Games, [https://arxiv.org/abs/2010.03956](https://arxiv.org/abs/2010.03956)
29+
30+
* Huang, Shengyi, and Santiago Ontañón. "Comparing Observation and Action Representations for Deep Reinforcement Learning in $\mu $ RTS." AIIDE Workshop on Artificial Intelligence for Strategy Gamee, October 2019 [https://arxiv.org/abs/1910.12134](https://arxiv.org/abs/1910.12134)

docs/index.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -36,8 +36,8 @@ CleanRL only contains implementations of **online** deep reinforcement learning
3636
If you use CleanRL in your work, please cite our technical [paper](https://www.jmlr.org/papers/volume23/21-1342/21-1342.pdf):
3737

3838
```bibtex
39-
@article{JMLR:v23:21-1342,
40-
author = {Shengyi Huang and Rousslan Fernand Julien Dossa and Chang Ye and Jeff Braga and Dipam Chakraborty and Kinal Mehta and João G.M. Araújo},
39+
@article{huang2022cleanrl,
40+
author = {Shengyi Huang and Rousslan Fernand Julien Dossa and Chang Ye and Jeff Braga and Dipam Chakraborty and Kinal Mehta and João G.M. Araújo},
4141
title = {CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms},
4242
journal = {Journal of Machine Learning Research},
4343
year = {2022},

docs/made-with-cleanrl.md

Lines changed: 0 additions & 28 deletions
This file was deleted.

mkdocs.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -93,7 +93,7 @@ nav:
9393
- advanced/resume-training.md
9494
- Community:
9595
- contribution.md
96-
- made-with-cleanrl.md
96+
- cleanrl-supported-papers-projects.md
9797
- Cloud Integration:
9898
- cloud/installation.md
9999
- cloud/submit-experiments.md

0 commit comments

Comments
 (0)