Skip to content

server based multi-turn training load balance #2520

@jQizhang

Description

@jQizhang

Hi, I have a question.

In verl/experimental/agent_loop/agent_loop.py, a heap data structure is used to implement load balancing. When a request is sent to a server, the corresponding server's request_num is +1. However, when the request is completed, the server's request_num is not -1.
Wouldn't this prevent it from reflecting the real-time number of requests being processed?

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions