Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[MM] Add text-only mode for Qwen3-VL qwen Related to Qwen models
#26000 opened Oct 1, 2025 by ywang96 Loading…
5 tasks
[Deepseek v3.2] Support indexer prefill chunking deepseek Related to DeepSeek models v1
#25999 opened Oct 1, 2025 by heheda12345 Loading…
5 tasks
[NVIDIA] flashinfer TRTLLM attention prefill token limit ready ONLY add when PR is ready to merge/full CI is needed
#25998 opened Oct 1, 2025 by jasonlizhengjian Loading…
5 tasks
[Bugfix] Fix __syncwarp on ROCM deepseek Related to DeepSeek models ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#25996 opened Sep 30, 2025 by zhewenl Loading… v0.11.0 Cherry Picks
[Doc] updating torch.compile doc link documentation Improvements or additions to documentation
#25989 opened Sep 30, 2025 by nadathurv Loading…
[Bugfix] Allow skipping MoE in NVFP4 (fix for MTP) deepseek Related to DeepSeek models ready ONLY add when PR is ready to merge/full CI is needed speculative-decoding
#25987 opened Sep 30, 2025 by benchislett Loading…
[Model] MTP fallback to eager for DeepSeek v32 deepseek Related to DeepSeek models ready ONLY add when PR is ready to merge/full CI is needed speculative-decoding v1
#25982 opened Sep 30, 2025 by luccafong Loading…
5 tasks
v0.11.0 Cherry Picks
[P/D] Support async transfer for P2P NCCL connector documentation Improvements or additions to documentation kv-connector
#25976 opened Sep 30, 2025 by ruisearch42 Draft
5 tasks
Add more tests for batch invariant kernel-override logic [3/n] v1
#25975 opened Sep 30, 2025 by bwasti Loading…
3 of 5 tasks
[Misc] Add penalties sampling parameters to serve tool performance Performance-related issues ready ONLY add when PR is ready to merge/full CI is needed
#25974 opened Sep 30, 2025 by southfreebird Loading…
[WIP][Core/DBO][4/N] Support alternate low-latency schedule deepseek Related to DeepSeek models documentation Improvements or additions to documentation v1
#25972 opened Sep 30, 2025 by LucasWilkinson Loading…
[torchao] safetensors integration
#25969 opened Sep 30, 2025 by liangel-02 Loading…
[CI/Build] Update the Dockerfile to use vllm serve command ci/build ready ONLY add when PR is ready to merge/full CI is needed
#25967 opened Sep 30, 2025 by DarkLight1337 Loading…
5 tasks
Feature/video support in random mm dataset performance Performance-related issues
#25963 opened Sep 30, 2025 by BloodAxe Draft
5 tasks
ProTip! What’s not been updated in a month: updated:<2025-08-30.