Smolmodels

Collection of experiments related to tuning small language models for specific tasks.

Installation

pip install uv
uv venv
source .venv/bin/activate
uv sync --group torch
uv sync --no-build-isolation --group training
pyright --createstub transformers

vLLM

CUDA:

export CUDA_HOME=/usr/local/cuda
sudo apt purge cmake
uv pip install setuptools_scm cmake
uv pip install vllm -vv --no-build-isolation

Mac:

uv pip install pip
pip install vllm==0.7.0 --use-deprecated=legacy-resolver

llama-cpp can also be installed with:

uv pip install "llama-cpp-python[server]" --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/metal

Pyright dependencies

pyright --createstub transformers

Modal

# Training
uv run modal run -d  modal_entrypoint.py::training
# Generation
uv run modal run -d modal_entrypoint.py::generation
# Inference
modal deploy modal_vllm.py
python util_scripts.py test_openai_api

Generation

python generate.py --task_name roleplaying_game --batch_size 1

Utils

python util_scripts.py download_dataset gutenberg_backtranslate_from_txt

Name		Name	Last commit message	Last commit date
Latest commit History 854 Commits
.claude		.claude
.vscode		.vscode
.zed		.zed
chat_templates		chat_templates
data		data
dataset_files		dataset_files
finetuning		finetuning
gyms/twenty_questions		gyms/twenty_questions
notebooks		notebooks
scripts		scripts
seed_data_files		seed_data_files
synthetic_data		synthetic_data
tests		tests
trl_wrapper		trl_wrapper
.gitattributes		.gitattributes
.gitignore		.gitignore
.python-version		.python-version
CLAUDE.md		CLAUDE.md
README.md		README.md
evaluate.py		evaluate.py
generate.py		generate.py
get_logprobs.py		get_logprobs.py
gguf_inference.py		gguf_inference.py
gguf_inference.sh		gguf_inference.sh
gradio_ui.py		gradio_ui.py
modal_entrypoint.py		modal_entrypoint.py
modal_vllm.py		modal_vllm.py
pyproject.toml		pyproject.toml
submit.sh		submit.sh
train_lightning.py		train_lightning.py
train_sequence_rank.py		train_sequence_rank.py
util_scripts.py		util_scripts.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Smolmodels

Installation

vLLM

Pyright dependencies

Modal

Generation

Utils

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

brianfitzgerald/smolmodels

Folders and files

Latest commit

History

Repository files navigation

Smolmodels

Installation

vLLM

Pyright dependencies

Modal

Generation

Utils

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages