Name	Name	Last commit message	Last commit date
parent directory ..
.gitkeep	.gitkeep
README.md	README.md
README.zh.md	README.zh.md
prepare_data.py	prepare_data.py

Name

Last commit message

Last commit date

README.md

README.zh.md

prepare_data.py

scripts/

Utility scripts. Run once to set up the environment.

Files

`prepare_data.py`

Downloads calibration and evaluation data from HuggingFace and saves them as JSONL files in data/calibration/ and data/eval/.

Run once after cloning:

python scripts/prepare_data.py

Download specific datasets only:

python scripts/prepare_data.py --dataset gsm8k alpaca humaneval

Available dataset names: wikitext2, alpaca, gsm8k, humaneval, qa, sharegpt, sum

Output files (per dataset):

data/calibration/{name}_128.jsonl — 128 samples for AWQ/GPTQ calibration
data/eval/{name}_eval.jsonl — full eval split for PPL measurement

Requirements: Internet connection (one-time). The downloaded files are committed to the repository, so collaborators and CI don't need to run this again.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

scripts/

Files

`prepare_data.py`

FilesExpand file tree

scripts

Directory actions

More options

Directory actions

More options

Latest commit

History

scripts

Folders and files

parent directory

README.md

scripts/

Files

prepare_data.py

`prepare_data.py`