Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[AutoRound] Update autoround to the release version
#2062 opened Nov 21, 2025 by yiliu30 Loading…
[bugfix] saving model without model.name_or_path bug Something isn't working ready When a PR is ready for review
#2061 opened Nov 20, 2025 by HDCharles Loading…
support npu platform
#2058 opened Nov 20, 2025 by LHXuuu Loading…
[Misc] Remove is_moe_model
#2053 opened Nov 20, 2025 by kylesayrs Loading…
[Document] Fix formatting and typos in modifiers.
#2047 opened Nov 19, 2025 by mutichung Loading…
Testing Clean-up
#2045 opened Nov 18, 2025 by dsikka Draft
Support RHAIIS images for the e2e tests
#2032 opened Nov 13, 2025 by dhuangnm Loading…
Support wInt4aFp8 for moe
#2027 opened Nov 12, 2025 by Wangzheee Loading…
[Bugfix] IntermediatesCache nested model inputs ready When a PR is ready for review
#2015 opened Nov 10, 2025 by kylesayrs Loading…
[TypeHint] Fix format_calibration_data type hint
#2012 opened Nov 10, 2025 by kylesayrs Loading…
Implement propagate_error argument ready When a PR is ready for review
#2008 opened Nov 10, 2025 by kylesayrs Loading…
Granite4 FP8 Block Quantization
#2001 opened Nov 6, 2025 by krishnateja95 Loading…
[model_free_ptq] NVFP4A16 ready When a PR is ready for review
#1988 opened Nov 3, 2025 by kylesayrs Loading…
[Kimi Linear] FP8 Example
#1986 opened Oct 31, 2025 by dsikka Draft
[AWQ] Generalize AWQ quantization
#1961 opened Oct 22, 2025 by kylesayrs Draft
2 of 3 tasks
[Attention] Support FP4 attention quantization nvfp4 For any PR / issue related to NVFP4 support
#1924 opened Oct 14, 2025 by kylesayrs Loading…
add gpt oss nvfp4 example
#1885 opened Sep 30, 2025 by shanjiaz Draft
ProTip! Exclude everything labeled bug with -label:bug.