Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions src/compressed_tensors/compressors/format.py
Original file line number Diff line number Diff line change
Expand Up @@ -34,13 +34,13 @@ def infer_model_format(
"""
Infers the quantization format for a model based on its modules

For a summary of the formats, see `docs/guides/compression_formats.md`.
For a summary of the formats, see https://docs.vllm.ai/projects/llm-compressor/en/latest/guides/compression_schemes/.
Comment thread
kylesayrs marked this conversation as resolved.

:param model: model to check for quantization
:param quantization_format: optional global format to override
the per module formats
:return: list of formats applied to modules (excluding dense format)
"""
""" # noqa: E501
formats = set()

for _, module in model.named_modules(remove_duplicate=True):
Expand Down
Loading