[rollout] feat: add async llm perf script by wuxibin89 · Pull Request #1930 · verl-project/verl

wuxibin89 · 2025-06-09T14:04:52Z

Checklist Before Starting

Search for similar PR(s).

What does this PR do?

Add perf scripts comparing AsyncLLM backend:

RayDistributedExecutor: default executor with compiled graph
ExternalRayDistributedExecutor: external executor with remote call

High-Level Design

Demonstrate the high-level design if this PR is complex.

Specific Changes

List the specific changes.

API

Demonstrate how the API changes if any.

Usage Example

Provide usage example(s) for easier usage.

# Add code snippet or script demonstrating how to use this

Test

For changes that can not be tested by CI (e.g., algorithm implementation, new model support), validate by experiment(s) and show results like training curve plots, evaluatuion results, etc.

Additional Info.

Issue Number: Fixes issue # or discussion # if any.
Training: [Note which backend this PR will affect: FSDP, Megatron, both, or none]
Inference: [Note which backend this PR will affect: vLLM, SGLang, both, or none]

Checklist Before Submitting

Read the Contribute Guide.
Apply pre-commit checks.
Add [BREAKING] to the PR title if it breaks any API.
Update the documentation about your changes in the docs.
New CI unit test(s) are added to cover the code path.
Rely on existing unit tests on CI that covers the code path.

eric-haibin-lin · 2025-06-09T22:30:01Z

tests/workers/rollout/perf/vllm_async_rollout.py

+        extra_headers = chat_complete_request.pop("extra_headers")
+        timeout = aiohttp.ClientTimeout(total=None)
+        session = aiohttp.ClientSession(timeout=timeout)
+        async with session.post(


i guess the difference is that the vllm sync mode in verl does not require aiohttp.post?

Let me add a perf test for sync mode.

Let me add a perf test for sync mode.

thumbs up

waleko · 2025-06-10T16:14:04Z

@wuxibin89 Hi, have you published the results of running these scripts? It would be very insightful 🙏

eric-haibin-lin · 2025-06-10T20:06:16Z

@wuxibin89 Hi, have you published the results of running these scripts? It would be very insightful 🙏

Here's some reference number from xibin:

bsz=128, n=16 on H20 GPUs

### Checklist Before Starting - [ ] Search for similar PR(s). ### What does this PR do? Add perf scripts comparing AsyncLLM backend: - RayDistributedExecutor: default executor with compiled graph - ExternalRayDistributedExecutor: external executor with remote call ### High-Level Design > Demonstrate the high-level design if this PR is complex. ### Specific Changes > List the specific changes. ### API > Demonstrate how the API changes if any. ### Usage Example > Provide usage example(s) for easier usage. ```python # Add code snippet or script demonstrating how to use this ``` ### Test > For changes that can not be tested by CI (e.g., algorithm implementation, new model support), validate by experiment(s) and show results like training curve plots, evaluatuion results, etc. ### Additional Info. - **Issue Number**: Fixes issue # or discussion # if any. - **Training**: [Note which backend this PR will affect: FSDP, Megatron, both, or none] - **Inference**: [Note which backend this PR will affect: vLLM, SGLang, both, or none] ### Checklist Before Submitting - [ ] Read the [Contribute Guide](https://github.com/volcengine/verl?tab=readme-ov-file#contribution-guide). - [ ] Apply [pre-commit checks](https://github.com/volcengine/verl?tab=readme-ov-file#code-linting-and-formatting). - [ ] Add `[BREAKING]` to the PR title if it breaks any API. - [ ] Update the documentation about your changes in the [docs](https://github.com/volcengine/verl/tree/main/docs). - [ ] New CI unit test(s) are added to cover the code path. - [ ] Rely on existing unit tests on CI that covers the code path.

[rollout] perf: add async llm perf script

d5db4a6

wuxibin89 requested review from chenhaiq, eric-haibin-lin and vermouth1992 June 9, 2025 14:04

eric-haibin-lin reviewed Jun 9, 2025

View reviewed changes

vermouth1992 changed the title ~~[rollout] perf: add async llm perf script~~ [rollout] feat: add async llm perf script Jun 10, 2025

vermouth1992 approved these changes Jun 10, 2025

View reviewed changes

vermouth1992 merged commit 1e1645d into main Jun 10, 2025
38 of 39 checks passed

vermouth1992 deleted the wuxibin/vllm_async_perf branch June 10, 2025 06:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rollout] feat: add async llm perf script#1930

[rollout] feat: add async llm perf script#1930
vermouth1992 merged 1 commit intomainfrom
wuxibin/vllm_async_perf

wuxibin89 commented Jun 9, 2025

Uh oh!

eric-haibin-lin Jun 9, 2025

Uh oh!

wuxibin89 Jun 10, 2025

Uh oh!

litianjian Jun 10, 2025

Uh oh!

Uh oh!

waleko commented Jun 10, 2025

Uh oh!

eric-haibin-lin commented Jun 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

wuxibin89 commented Jun 9, 2025

Checklist Before Starting

What does this PR do?

High-Level Design

Specific Changes

API

Usage Example

Test

Additional Info.

Checklist Before Submitting

Uh oh!

eric-haibin-lin Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

wuxibin89 Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

litianjian Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

waleko commented Jun 10, 2025

Uh oh!

eric-haibin-lin commented Jun 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants