-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[fix] illegal memory in _fwd_kernel_ep_scatter_2
#6348
opened May 16, 2025 by
xutizhou
Loading…
6 tasks
fix: remove content=none test when tool called
#6347
opened May 16, 2025 by
shuaills
Loading…
6 tasks
Add fp8 shared_expert kernel for CPU in sgl-kernel and add UT
#6339
opened May 16, 2025 by
chunyuan-w
•
Draft
WIP: [feat] Support different attention backends for prefill and decode
#6338
opened May 16, 2025 by
Qiaolin-Yu
Loading…
6 tasks
[Fix] Improve dependencies for Blackwell image
#6334
opened May 15, 2025 by
Fridge003
Loading…
6 tasks
misc: Implement RankZeroFilter for rank-specific logging in model_runner.py
#6333
opened May 15, 2025 by
CatherineSue
Loading…
3 of 6 tasks
kernel: Remove unnecessary arguments from
moe_align_block_size
#6332
opened May 15, 2025 by
tanruixiang
Loading…
2 of 6 tasks
The Gemma template is missing a newline after the user role.
#6331
opened May 15, 2025 by
ysulsky
Loading…
1 of 6 tasks
fix: allow
launch_dummy_health_check_server
to start inside of running asyncio loop
#6330
opened May 15, 2025 by
ishandhanani
Loading…
6 tasks
add fused moe tuning config for Llama-4-Maverick-17B-128E-Instruct
wip
#6329
opened May 15, 2025 by
BBuf
Loading…
6 tasks
Refactor DeepSeek logic into atomic operations
#6326
opened May 15, 2025 by
fzyzcjy
Loading…
6 tasks
Refactor DeepSeek MoE layer to unify the two forward branches
#6325
opened May 15, 2025 by
fzyzcjy
Loading…
6 tasks
Minor code cleanup refactor for DeepSeek models
#6324
opened May 15, 2025 by
fzyzcjy
Loading…
6 tasks
refactor: Extract repeated member variables in KVCache subclasses to base class.
#6323
opened May 15, 2025 by
wangxiyu191
Loading…
1 of 6 tasks
Refactor communication logic of DeepSeek for extensibility and understandability
#6321
opened May 15, 2025 by
fzyzcjy
Loading…
6 tasks
fix: limit peak memory usage when computing logprobs
#6318
opened May 15, 2025 by
aftersnow
Loading…
3 of 6 tasks
Fix one wasted kernel in DeepSeek and minor refactor
#6316
opened May 15, 2025 by
fzyzcjy
Loading…
6 tasks
[RL] allow weight updation with dp attention enabled
#6311
opened May 15, 2025 by
zhuzilin
Loading…
1 of 6 tasks
[RL] Remove the w13 weight_scale and input_scale for UnquantizedEPMoE…
#6308
opened May 15, 2025 by
zhuzilin
Loading…
1 of 6 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.