Large Model High-Concurrency Deployment Investigate and Discuss #12113
xueshuai0922
started this conversation in
General
Replies: 1 comment 1 reply
-
anyone have a idea which will solve this problem? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Large Model High-Concurrency Deployment Investigate and Discuss
Overview
Prerequisites:
Key Points:
VRAM Requirement Analysis:
Hardware Configuration Recommendations:
Deployment Solutions:
Comparison of Inference Tools:
LMDeploy vs vLLM:
Beta Was this translation helpful? Give feedback.
All reactions