[MM] Add text-only mode for Qwen3-VL #26000

ywang96 · 2025-10-01T01:24:28Z

Purpose

Since this model is performing pretty well on text only tasks we might want to allow people to serve it as a text-only model.

Test Plan

Test Result

Running vllm serve Qwen/Qwen3-VL-235B-A22B-Instruct --limit-mm-per-prompt.image 0 --limit-mm-per-prompt.video 0 --load-format dummy -tp 8 shows the following in the logs

(APIServer pid=6540) INFO 10-01 01:53:40 [registry.py:117] All limits of multimodal modalities supported by the model are set to 0, running in text-only mode.

Confirm vision model weights are not loaded:
Without setting limit:

(Worker_TP4 pid=7041) INFO 10-01 01:54:18 [gpu_model_runner.py:2758] Model loading took 55.4919 GiB and 0.376107 seconds

Without setting limit + DP ViT:

(Worker_TP3 pid=21254) INFO 10-01 02:11:51 [gpu_model_runner.py:2758] Model loading took 56.4331 GiB and 0.335513 seconds

Setting all limits to 0:

(Worker_TP6 pid=17841) INFO 10-01 02:09:06 [gpu_model_runner.py:2758] Model loading took 55.1608 GiB and 0.320821 seconds

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: Roger Wang <[email protected]>

gemini-code-assist

Code Review

This pull request aims to add text-only support for Qwen3-VL by conditionally initializing the visual components. The changes correctly identify the parts of the code that need to be conditional (Qwen3_VisionTransformer initialization, deepstack_input_embeds initialization, and weight loading). However, there is a critical logic error in the condition for initializing the visual model, which inverts the intended behavior. This would cause the model to initialize visual components in text-only mode and skip them in multimodal mode. I've provided a suggestion to fix this. The other related changes are correct, assuming this primary logic is fixed.

vllm/model_executor/models/qwen3_vl.py

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: Roger Wang <[email protected]>

Signed-off-by: Roger Wang <[email protected]>

Signed-off-by: simon-mo <[email protected]>

ywang96 added 2 commits September 30, 2025 18:21

update

d5540cd

Signed-off-by: Roger Wang <[email protected]>

update

4530461

Signed-off-by: Roger Wang <[email protected]>

ywang96 requested a review from sighingnow as a code owner October 1, 2025 01:24

ywang96 changed the title ~~[MM] Add text-only model for Qwen3-VL~~ [MM] Add text-only mode for Qwen3-VL Oct 1, 2025

mergify bot added the qwen Related to Qwen models label Oct 1, 2025

gemini-code-assist bot reviewed Oct 1, 2025

View reviewed changes

vllm/model_executor/models/qwen3_vl.py Outdated Show resolved Hide resolved

ywang96 and others added 2 commits September 30, 2025 18:26

Update vllm/model_executor/models/qwen3_vl.py

e0d4d89

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: Roger Wang <[email protected]>

add to moe

ad04a4e

Signed-off-by: Roger Wang <[email protected]>

ywang96 added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 1, 2025

simon-mo approved these changes Oct 1, 2025

View reviewed changes

simon-mo enabled auto-merge (squash) October 1, 2025 02:16

simon-mo added this to the v0.11.0 Cherry Picks milestone Oct 1, 2025

DarkLight1337 approved these changes Oct 1, 2025

View reviewed changes

simon-mo merged commit 66bca9b into vllm-project:main Oct 1, 2025
54 of 56 checks passed

simon-mo pushed a commit that referenced this pull request Oct 1, 2025

[MM] Add text-only mode for Qwen3-VL (#26000)

a1825fe

Signed-off-by: simon-mo <[email protected]>

pdasigi pushed a commit to pdasigi/vllm that referenced this pull request Oct 2, 2025

[MM] Add text-only mode for Qwen3-VL (vllm-project#26000)

35fd946

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MM] Add text-only mode for Qwen3-VL #26000

[MM] Add text-only mode for Qwen3-VL #26000

ywang96 commented Oct 1, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[MM] Add text-only mode for Qwen3-VL #26000

[MM] Add text-only mode for Qwen3-VL #26000

Conversation

ywang96 commented Oct 1, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ywang96 commented Oct 1, 2025 •

edited by github-actions bot

Loading