[CI/Build] Replace `vllm.entrypoints.openai.api_server` entrypoint with `vllm serve` command #25967

DarkLight1337 · 2025-09-30T17:02:36Z

Purpose

vllm serve <model> works now, without having to add --model. So I think it's safe to just change the command directly.

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: DarkLight1337 <[email protected]>

hmellor

LGTM

gemini-code-assist

Code Review

This pull request updates the Dockerfiles to use the vllm serve command as the entrypoint, replacing the direct call to the python module vllm.entrypoints.openai.api_server. This change aligns the Docker images with the recommended command-line interface, improving consistency and user experience. The changes are applied correctly and consistently across the CUDA, CPU, and XPU Dockerfiles. The new entrypoint is backward compatible with existing argument passing conventions. The changes look good and I don't see any issues.

Signed-off-by: DarkLight1337 <[email protected]>

njhill

Thanks @DarkLight1337!

simon-mo · 2025-09-30T21:44:44Z

For the Docker to be drop in replacement, we need the old flow

docker run .... vllm/vllm-openai --model ABC

to work as well

DarkLight1337 · 2025-10-01T05:22:52Z

Oh, I forgot about vllm serve --model <model> not working, hmm...

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 · 2025-10-01T05:33:01Z

I have updated the CLI parsing logic to deprecate instead of erroring out when vllm serve --model <model> is used.

Signed-off-by: DarkLight1337 <[email protected]>

vllm/utils/__init__.py

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 · 2025-10-01T11:41:14Z

@simon-mo can you check again? Mamba kernels test failure is happening on main as well so it can be ignored.

njhill

Thanks @DarkLight1337!

vllm/utils/__init__.py

njhill · 2025-10-01T16:29:40Z

vllm/utils/__init__.py

                    "With `vllm serve`, you should provide the model as a "
                    "positional argument or in a config file instead of via "
-                    "the `--model` option.")
+                    "the `--model` option. "
+                    "The `--model` option will be removed in v0.13.")


Is there any harm in allowing both indefinitely?

I am not sure who removed the ability to use --model in the first place. Maybe @mgoin ?

Signed-off-by: DarkLight1337 <[email protected]>

…th `vllm serve` command (vllm-project#25967) Signed-off-by: DarkLight1337 <[email protected]>

…th `vllm serve` command (#25967) Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: yewentao256 <[email protected]>

[CI/Build] Update the Dockerfile to use vllm serve command

222fddd

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 requested review from mgoin, njhill, simon-mo and ywang96 September 30, 2025 17:02

DarkLight1337 requested review from jikunshang and bigPYJ1151 as code owners September 30, 2025 17:02

DarkLight1337 requested a review from hmellor September 30, 2025 17:02

mergify bot added the ci/build label Sep 30, 2025

hmellor approved these changes Sep 30, 2025

View reviewed changes

gemini-code-assist bot reviewed Sep 30, 2025

View reviewed changes

Update more

028b953

Signed-off-by: DarkLight1337 <[email protected]>

DarkLight1337 mentioned this pull request Sep 30, 2025

[Bugfix] Enable API server headless mode and scale-out using Dockerfile entrypoint #25309

Closed

5 tasks

njhill approved these changes Sep 30, 2025

View reviewed changes

DarkLight1337 enabled auto-merge (squash) September 30, 2025 17:17

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 30, 2025

DarkLight1337 added 2 commits October 1, 2025 05:32

Deprecate instead of error when vllm serve --model <model> is used

80d1fd0

Signed-off-by: DarkLight1337 <[email protected]>

Merge branch 'main' into docker-entrypoint

bf15dfd

DarkLight1337 changed the title ~~[CI/Build] Update the Dockerfile to use vllm serve command~~ [CI/Build] Replace vllm.entrypoints.openai.api_server entrypoint with vllm serve command Oct 1, 2025

Replace more occurrences of vllm.entrypoints.openai.api_server

74ccddb

Signed-off-by: DarkLight1337 <[email protected]>

mergify bot added documentation Improvements or additions to documentation performance Performance-related issues labels Oct 1, 2025

hmellor reviewed Oct 1, 2025

View reviewed changes

vllm/utils/__init__.py Outdated Show resolved Hide resolved

DarkLight1337 added 2 commits October 1, 2025 09:32

Fix and test edge case

b75de55

Signed-off-by: DarkLight1337 <[email protected]>

Merge branch 'main' into docker-entrypoint

e8ab8d2

njhill reviewed Oct 1, 2025

View reviewed changes

DarkLight1337 added 2 commits October 2, 2025 02:52

Handle --model=my-model case

8a87d04

Signed-off-by: DarkLight1337 <[email protected]>

Merge branch 'main' into docker-entrypoint

b9e76cd

vllm-bot merged commit d00d652 into vllm-project:main Oct 2, 2025
80 of 84 checks passed

DarkLight1337 deleted the docker-entrypoint branch October 2, 2025 17:05

pdasigi pushed a commit to pdasigi/vllm that referenced this pull request Oct 2, 2025

[CI/Build] Replace vllm.entrypoints.openai.api_server entrypoint wi…

2e83ce3

…th `vllm serve` command (vllm-project#25967) Signed-off-by: DarkLight1337 <[email protected]>

yewentao256 pushed a commit that referenced this pull request Oct 3, 2025

[CI/Build] Replace vllm.entrypoints.openai.api_server entrypoint wi…

fa179ab

…th `vllm serve` command (#25967) Signed-off-by: DarkLight1337 <[email protected]> Signed-off-by: yewentao256 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[CI/Build] Replace `vllm.entrypoints.openai.api_server` entrypoint with `vllm serve` command #25967

[CI/Build] Replace `vllm.entrypoints.openai.api_server` entrypoint with `vllm serve` command #25967

Uh oh!

DarkLight1337 commented Sep 30, 2025 •

edited by github-actions bot

Loading

Uh oh!

hmellor left a comment

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

njhill left a comment

Uh oh!

simon-mo commented Sep 30, 2025

Uh oh!

DarkLight1337 commented Oct 1, 2025

Uh oh!

DarkLight1337 commented Oct 1, 2025

Uh oh!

Uh oh!

DarkLight1337 commented Oct 1, 2025

Uh oh!

njhill left a comment

Uh oh!

Uh oh!

njhill Oct 1, 2025

Uh oh!

DarkLight1337 Oct 2, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[CI/Build] Replace vllm.entrypoints.openai.api_server entrypoint with vllm serve command #25967

[CI/Build] Replace vllm.entrypoints.openai.api_server entrypoint with vllm serve command #25967

Uh oh!

Conversation

DarkLight1337 commented Sep 30, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

hmellor left a comment

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

simon-mo commented Sep 30, 2025

Uh oh!

DarkLight1337 commented Oct 1, 2025

Uh oh!

DarkLight1337 commented Oct 1, 2025

Uh oh!

Uh oh!

DarkLight1337 commented Oct 1, 2025

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

njhill Oct 1, 2025

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

[CI/Build] Replace `vllm.entrypoints.openai.api_server` entrypoint with `vllm serve` command #25967

[CI/Build] Replace `vllm.entrypoints.openai.api_server` entrypoint with `vllm serve` command #25967

DarkLight1337 commented Sep 30, 2025 •

edited by github-actions bot

Loading