Skip to content

xinhaoH/SoLA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SoLA

[AAAI 2025] SoLA: Leveraging Soft Activation Sparsity and Low-Rank Decomposition for Large Language Model Compression

image

Usage

Construct Calibration Data

sola_neuron_idx/utils.py get_wikitext2() --> data/wiki_256_4096.json

Compute Neurons Norm

bash sola_neuron_idx/playground.sh --> data/hitter_dict_15_256_4096_wiki_13b.json

Low-Rank Decomposition and Evaluation

bash sola/playground_sola.sh

Llama-2-7B/13B low-rank decomposition and evaluation

Remember modify file path to your own 

Others

sola_neuron_idx/hook.py llama_self_attn() past_key_value.update_get() in transformers-4.37.1/src/transformers/cache_utils.py

You can install modified transformers package: cd transformers-4.37.1; pip install -e .

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors