-
Notifications
You must be signed in to change notification settings - Fork 565
Pull requests: PaddlePaddle/FastDeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
【Sync develop】support vl model name_mapping and ori_vocab_size
#2915
opened Jul 18, 2025 by
gzy19990617
Loading…
[Executor] Avoid OOM when start the service while Enable Chunked Prefill + CudaGraph
contributor
#2914
opened Jul 18, 2025 by
littledgg
Loading…
[BugFix] Deepseek renaming cum_offsets to cu_seqlens_q
contributor
#2895
opened Jul 17, 2025 by
K11OntheBoat
Loading…
[Fix]fix empty prompt_token_ids,update the parser's triggering condit…
contributor
#2891
opened Jul 17, 2025 by
luukunn
Loading…
[Feature] support c16 prefix_cache in flash_attention_v3
#2766
opened Jul 9, 2025 by
lizhenyun01
Loading…
[Feature] mm and thinking model support structred output
#2749
opened Jul 8, 2025 by
kevincheng2
Loading…
Support use safetensors with paddle.MmapStorage to load model files
contributor
#2730
opened Jul 7, 2025 by
zeroRains
Loading…
1 of 2 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.