-
Notifications
You must be signed in to change notification settings - Fork 629
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Adapt dispatch_ffn_combine for decoding
module:core
module:ops
module:quantization
module:tests
#4762
opened Dec 6, 2025 by
kiscad
Loading…
[Perf] Update causal conv1d fn for better perf
module:ops
#4759
opened Dec 6, 2025 by
SunnyLee151064
Loading…
refactor rejection sampler
merge-conflicts
#4758
opened Dec 6, 2025 by
realliujiaxu
•
Draft
1 of 3 tasks
[Feature] Reduce the cost of torchair
ready
read for review
ready-for-test
start test by label for PR
#4756
opened Dec 5, 2025 by
jianzs
Loading…
[CI] Fix ngram & suffix test oom
module:tests
ready
read for review
ready-for-test
start test by label for PR
#4755
opened Dec 5, 2025 by
fluctlux
Loading…
[Bugfix] Prevent engine hang during KVCacheSendingThread startup
ready
read for review
ready-for-test
start test by label for PR
#4754
opened Dec 5, 2025 by
jianzs
Loading…
[Bugfix] Disable the dispatch_ffn_combine kernel in MTP path
#4751
opened Dec 5, 2025 by
kiscad
Loading…
[CustomOp][MM] Register AscendMMEncoderAttention CustomOp and remove related patch
module:core
module:ops
#4750
opened Dec 5, 2025 by
shen-shanshan
Loading…
[Bugfix] Add the check for a null VllmConfig
module:core
#4749
opened Dec 5, 2025 by
gcanlin
Loading…
add qwen3_next ops: fused_qkvzba_split_reshape and rope_forward_triton
module:ops
#4747
opened Dec 5, 2025 by
ZT-AIA
Loading…
[Bugfix] Fix MoE MLP related issues (ref #4490)
module:ops
module:tests
#4743
opened Dec 5, 2025 by
Clorist33
Loading…
[Bugfix] Fix eplb device transfer loader issues (ref #4490)
module:tests
#4742
opened Dec 5, 2025 by
Clorist33
Loading…
BugFix: Resolve PolicyFlashlb warm up function attribute error
#4741
opened Dec 5, 2025 by
Mercykid-bash
Loading…
[Misc] Rope optimize
merge-conflicts
module:core
module:ops
#4740
opened Dec 5, 2025 by
Angazenn
Loading…
[WIP][perf] replace all_reduce for kv_consumer and support different num_tokens among all ranks
merge-conflicts
module:ops
module:quantization
#4736
opened Dec 5, 2025 by
linfeng-yuan
Loading…
[Fix] fix llava-1.5-7b-hf & Qwen2-Audio-7B-Instruct accuracy test
module:tests
#4734
opened Dec 5, 2025 by
zhangxinyuehfad
Loading…
[Misc] Upgrade vllm commit to 12_05
documentation
Improvements or additions to documentation
ready
read for review
ready-for-test
start test by label for PR
#4733
opened Dec 5, 2025 by
Potabk
Loading…
feat: implement high-performance Triton kernels for rejection sampling
#4732
opened Dec 5, 2025 by
yuxingcyx
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.