Skip to content

Add HQQ KV cache quantization for Transformers

244d3cb
Select commit
Loading
Failed to load commit list.
Open

Add Quanto4,2, HQQ4,2 KV cache quantization support to Transformers loader #6768

Add HQQ KV cache quantization for Transformers
244d3cb
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs