-
-
Notifications
You must be signed in to change notification settings - Fork 10.5k
Closed
Labels
feature requestNew feature or requestNew feature or request
Description
Hi vLLM genius @zhuohan123 @WoosukKwon
I find a new project https://github.com/ModelTC/lightllm
After reading their blog, the performance advantage on the 7b model is not very obvious, but the gap is larger on the 65b. We will also do some verification and comparison later. The reason for bringing up this issue is to hope that we may see what the LightLLM does well, so that we can refer to and port similar optimizations to vLLM. Cheers.
Metadata
Metadata
Assignees
Labels
feature requestNew feature or requestNew feature or request