We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent dc243f9 commit d4b8619Copy full SHA for d4b8619
Report.md
@@ -44,8 +44,8 @@ WEIGHT_DECAY = 0 # L2 weight decay
44
```
45
46
### 4. Training Scores
47
-The agent was able to solve the environment by achieving score of 30.0 over 100 consecutive episodes after 231 episodes.
48
-
+The agent was able to solve the environment by achieving score of 30.0 over 100 consecutive episodes after about 1000 episodes.
+
49
50
### 5. Training Output
51
0 commit comments