-
Notifications
You must be signed in to change notification settings - Fork 114
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add traceable mistral and mistral3 classes
ready
When a PR is ready for review
#1343
opened Apr 9, 2025 by
anmarques
Loading…
Update test_oneshot_and_finetune.py to use pytest.approx
ready
When a PR is ready for review
#1339
opened Apr 9, 2025 by
markurtz
Loading…
[Tracing][Testing] Add tracing tests
ready
When a PR is ready for review
#1335
opened Apr 8, 2025 by
kylesayrs
Loading…
[Compression] Update sparsity calculation lifecycle when fetching the compressor
ready
When a PR is ready for review
#1332
opened Apr 8, 2025 by
dsikka
Loading…
[Sequential] Support models with nested
_no_split_modules
#1329
opened Apr 6, 2025 by
kylesayrs
Loading…
fix: Make Recipe.model_dump() output compatible with model_validate()
#1328
opened Apr 6, 2025 by
ved1beta
Loading…
bugfix kv cache quantization with ignored layers
ready
When a PR is ready for review
#1312
opened Apr 1, 2025 by
brian-dellabetta
Loading…
[Tracing] Remove
TraceableWhisperForConditionalGeneration
#1310
opened Apr 1, 2025 by
kylesayrs
Loading…
[Tracing] Better runtime error messages
ready
When a PR is ready for review
#1307
opened Apr 1, 2025 by
kylesayrs
Loading…
Reduce SmoothQuant Repr
ready
When a PR is ready for review
#1289
opened Mar 27, 2025 by
kylesayrs
Loading…
Smoothquant typehinting and onloading context
ready
When a PR is ready for review
#1285
opened Mar 26, 2025 by
kylesayrs
Loading…
[Tests] Add mark skip for GPU
ready
When a PR is ready for review
#1264
opened Mar 18, 2025 by
kylesayrs
Loading…
[Performance] Sequential onloading
ready
When a PR is ready for review
#1263
opened Mar 18, 2025 by
kylesayrs
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.