Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix docker cache mount
#5763 opened Jul 4, 2025 by MartinMarciniszyn Loading…
Add wide-ep benchmarking scripts
#5760 opened Jul 4, 2025 by qiaoxj07 Loading…
Fix --image_path param error in multimodal run.py tests Community want to contribute PRs initiated from Community
#5757 opened Jul 4, 2025 by pandalee99 Loading…
Fix cancel request bug in attentiondp
#5754 opened Jul 4, 2025 by Shunkangz Loading…
chore: log stack trace on error in openai server
#5749 opened Jul 4, 2025 by zhengd-nv Loading…
Update transformers to 4.53.0
#5747 opened Jul 4, 2025 by Wanli-Jiang Loading…
feat: moe prepare support topk % 4 != 0
#5742 opened Jul 4, 2025 by WeiHaocheng Loading…
[feat] Support nvidia/Cosmos-Reason1-7B
#5739 opened Jul 4, 2025 by meatybobby Loading…
chores: merge examples for v1.0 doc
#5736 opened Jul 3, 2025 by hchings Draft
1 of 3 tasks
chore: some refactor on WideEP
#5727 opened Jul 3, 2025 by dongxuy04 Loading…
[ci] speedup fused moe tests
#5726 opened Jul 3, 2025 by omera-nv Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.