Commit 9e60f01
[doc,algo] feat: Rollout Correction - Fix Metrics, Add Documentation, and Add Batch Normalization (verl-project#4070)
## Overview
This PR fixes bugs, refactors configuration for semantic clarity, and
adds batch normalization support to the rollout correction
implementation introduced in PR verl-project#3984.
---
## Bug Fixes
### 1. Metrics Computation Running in Wrong Mode 1 parent 4ac02e6 commit 9e60f01
File tree
17 files changed
+988
-700
lines changed- docs
- advance
- algo
- examples/rollout_correction
- recipe/dapo
- verl
- trainer
- config
- algorithm
- ppo
- workers/actor
17 files changed
+988
-700
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
166 | 166 | | |
167 | 167 | | |
168 | 168 | | |
169 | | - | |
| 169 | + | |
170 | 170 | | |
171 | 171 | | |
172 | 172 | | |
173 | | - | |
| 173 | + | |
174 | 174 | | |
175 | 175 | | |
176 | 176 | | |
| |||
0 commit comments