-
Notifications
You must be signed in to change notification settings - Fork 660
Pull requests: PaddlePaddle/FastDeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Polish] Simplify __repr__ method in Request class
#5153
opened Nov 20, 2025 by
Jiang-Jia-Jun
Loading…
5 tasks done
[PD Disaggregation] [Refine] Refine splitwise deployment
#5151
opened Nov 20, 2025 by
juncaipeng
Loading…
5 tasks done
[Feature] The 45VL supports prompt_token_ids + messages input.
contributor
External developers
#5148
opened Nov 20, 2025 by
kxz2002
Loading…
5 tasks done
[XPU] [Optimization] [EP] EP communication optimization.
#5145
opened Nov 20, 2025 by
zccjjj
Loading…
5 tasks
[Feature] Supports separate loading of offline quantization for moe.
#5142
opened Nov 20, 2025 by
xiaoxiaohehe001
Loading…
5 tasks done
[Feature] Support offline quantization for moe and tp4 eplb
#5141
opened Nov 20, 2025 by
xiaoxiaohehe001
Loading…
5 tasks done
[Feature] enable guided decoding ENABLE_V1_KVCACHE_SCHEDULER = 1
#5140
opened Nov 20, 2025 by
ST-XX
Loading…
5 tasks done
[Models] Add forward_meta to moe models' forward function
#5138
opened Nov 20, 2025 by
Wanglongzhi2001
Loading…
5 tasks done
[feat] add port conflict detection for cache-queue-port and engine-wor…
contributor
External developers
#5135
opened Nov 19, 2025 by
sunlei1024
Loading…
3 of 4 tasks
[Feature] support flash_mask_attention backend
#5134
opened Nov 19, 2025 by
lizhenyun01
Loading…
5 tasks
[BugFix] [PD Disaggregation] fix v1 scheduler prefill node profile run & ipc transfer protocol
#5132
opened Nov 19, 2025 by
liyonghua0910
Loading…
5 tasks done
[CI] Unified diff coverage upload logic
#5127
opened Nov 19, 2025 by
EmmonsCurse
Loading…
5 tasks done
[Feature] Guided Decoding add LLguidance backend
#5124
opened Nov 19, 2025 by
ST-XX
Loading…
5 tasks done
[PD Disaggregation][XPU] Add XPU support for PD disaggregation
#5113
opened Nov 18, 2025 by
ddchenhao66
Loading…
5 tasks
[CI] [DEBUG] Coverage merge xpu and gpu
#5111
opened Nov 18, 2025 by
EmmonsCurse
Loading…
5 tasks done
[Optimize] Reduce comm overhead of engine-worker by obtaining requests asynchronously
#5105
opened Nov 18, 2025 by
Jiang-Jia-Jun
Loading…
5 tasks
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.