Double sparse pruning

The repository is based on SparseGPT code

Dependencies

We also provide LLaMA pruning script with the very same interface:

# Sparsify LLaMa with SparseGPT
python llama.py meta-llama/Llama-2-7b-hf c4 --sparsity 0.5

For replicating other experiments (comparision with OBC a post-training pruning with finetuning) see other_experiments directory.