Hi, I have a question.
In verl/experimental/agent_loop/agent_loop.py, a heap data structure is used to implement load balancing. When a request is sent to a server, the corresponding server's request_num is +1. However, when the request is completed, the server's request_num is not -1.
Wouldn't this prevent it from reflecting the real-time number of requests being processed?