-
Notifications
You must be signed in to change notification settings - Fork 3.2k
[trainer, fsdp, vllm, recipe] feat: one step off async training recipe #2231
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
eric-haibin-lin
merged 68 commits into
verl-project:main
from
imh966:recipe/async_training
Jul 17, 2025
Merged
Changes from 1 commit
Commits
Show all changes
68 commits
Select commit
Hold shift + click to select a range
391a1fc
one step off async training recipe
imh966 338c2a9
simplify trainer
imh966 071ddc2
fix resource pool config and simplify the trainer yaml file
imh966 71569f5
separate actor and rollout class
imh966 78ef6f2
update name of recipe and add license
imh966 e274747
Merge branch 'volcengine:main' into recipe/async_training
ArronHZG b9a9618
one_step_off_policy megatron
ArronHZG 8dc0034
use fsdp2 and clear useless code
lalala-2 5ea1c00
fix config
lalala-2 69d58c4
fix
lalala-2 6cdaf2e
one_step_off_policy dapo_7b 2 node
ArronHZG 36ed4f6
recipe/one_step_off_policy
ArronHZG a1966ef
opt gen_next_batch
lalala-2 40df88f
Merge branch 'recipe/async_training_megatron' of https://github.com/i…
lalala-2 5d52efa
4_12_megatron
ArronHZG 59f6be9
4_12_megatron
ArronHZG dfabe15
megatron config
ArronHZG 40e8816
megatron config
ArronHZG fc76d4f
fix megatron
lalala-2 dedc436
Merge branch 'recipe/async_training_megatron' of https://github.com/i…
lalala-2 344581f
megatron config
ArronHZG 0091f52
megatron config
ArronHZG 283f7fd
megatron config
ArronHZG 6871a29
cross epoch
ArronHZG 1b96322
ruff format
ArronHZG 652f91f
# Copyright 2025 Meituan Ltd. and/or its affiliates
ArronHZG b36918c
add Copyright
ArronHZG 84b712d
optim sh
ArronHZG 4685463
python3
ArronHZG 7f3d1db
update recipe
ArronHZG 592f393
add doc
ArronHZG 2fb1cd9
Merge branch 'volcengine:main' into recipe/async_training
ArronHZG dff8f56
update date
ArronHZG 648cb44
update date
ArronHZG c2395f7
config
ArronHZG 165c1b2
Revert "fix config"
lalala-2 aaa356e
fix error
lalala-2 03f1dec
update is_last_step
ArronHZG e2007ef
one_step_off_policy
ArronHZG 204d624
update readme
ArronHZG 19fac39
e2e_one_step_off_policy
ArronHZG c1b86ec
add e2e test for one_step_off_policy
ArronHZG 492ff98
add e2e test for one_step_off_policy
ArronHZG 1e7aa47
add e2e test for one_step_off_policy
ArronHZG 8ab0834
add e2e test for one_step_off_policy
ArronHZG 22dc212
format
ArronHZG dcbfb0c
ruff check
ArronHZG 1e8cee3
add megatron test
ArronHZG 27c9816
Merge pull request #2 from imh966/recipe/async_training_e2e_test
ArronHZG 727320b
Merge branch 'volcengine:main' into recipe/async_training
ArronHZG 8727916
rm spmd
ArronHZG 42ddeed
CI check fix some error
ArronHZG 5ffd8b4
merge main
ArronHZG 1c9b6eb
change author
ArronHZG 8772b14
update e2e_one_step_off_policy CI rule
ArronHZG c8468e6
update comments
ArronHZG d8dd8b0
Merge branch 'volcengine:main' into recipe/async_training
ArronHZG 659b108
update ruff
ArronHZG 9b5646a
Fix pre-commit error: sort imports in async_main_ppo.py
openhands-agent 1ed49c7
rollout.nnodes
ArronHZG 754cfae
update code and doc by comments
ArronHZG 8df1c1b
ruff
ArronHZG 1837fc7
update code and doc by comments
ArronHZG c56467f
update docs
ArronHZG 174d94a
Merge branch 'recipe/async_training' of https://github.com/imh966/ver…
ArronHZG e3db358
Merge branch 'recipe/async_training' into recipe/async_training_rollo…
ArronHZG 8e5b714
Merge pull request #3 from imh966/recipe/async_training_rollout_nodes
ArronHZG 40b2ebe
Merge branch 'volcengine:main' into recipe/async_training
ArronHZG File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
opt gen_next_batch
- Loading branch information
commit a1966ef4eff278521a65ff2648fe8bb366e6b7d5
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can we try to avoid using nested function definitions? For instance, move this to
def _create_continuous_iterator(self)anddef _async_gen_next_batch(self, continuous_iterator)