We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent c4ec631 commit 0becc72Copy full SHA for 0becc72
examples/llama7b_sparse_quantized/README.md
@@ -2,7 +2,7 @@
2
3
This example uses SparseML and Compressed-Tensors to create a 2:4 sparse and quantized Llama2-7b model.
4
The model is calibrated and trained with the ultachat200k dataset.
5
-At least 75GB of GPU memory is required to run this example.
+At least 85GB of GPU memory is required to run this example.
6
7
Follow the steps below one by one in a code notebook, or run the full example script
8
as `python examples/llama7b_sparse_quantized/llama7b_sparse_w4a16.py`
0 commit comments