-
Notifications
You must be signed in to change notification settings - Fork 6.1k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[LLM APIs] Fast follow ups for 2.44 (1/N) (#51042)
Some of these changes came from bug bash and dogfooding: - [x] Rename to VLLMService to VLLMServer - [x] Remove the extra name space hierarchy for most common import path to make things more consistent with `data.llm` - [x] Some other inconsitency stuff - [x] Use vLLM everywhere (instead of VLLM). Most of these changes should happen on serve side. - [x] in ray.data.llm use model_source in the vllm_config (model_id is a serve only concept as it refers to the model name available to the model discovery layer) from ray.serve.llm - [ ] support vllm v1 [in followup] - [ ] allow a single deployment to llm router (not force people to pass a list with one item) [in follow up] - [x] Update the ray serve docs structure to be more flat based on the dogfooding feedback. serve llm docs now look more flat and consistent with serve docs. Serving LLMs is a single page on the side bar and the apis are a sub-header under the ray serve apis page. Overview page <img width="1446" alt="image" src="https://github.com/user-attachments/assets/db850f35-3a3d-46e1-9892-b3cd17681b98" /> API page <img width="1433" alt="image" src="https://github.com/user-attachments/assets/03a669d1-12c2-46c1-9616-484ea95f7082" /> --------- Signed-off-by: Kourosh Hakhamaneshi <[email protected]>
- Loading branch information
1 parent
c6b06e1
commit e151dc2
Showing
39 changed files
with
482 additions
and
478 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.