GitHub

This repository host LLM tutorials based on JAX.

01.miniGPT: build a miniGPT model from scratch and pretrain it on the TinyStories dataset
02.GPT2 pretraning: pretrain 124M and 354M GPT2 on the OpenWebText dataset (inspired by nanoGPT)
03.GPT2 instruction tuning: instruction tune the 124M pretrained GPT2 from above
04.Loading the Llama 3.2 1B Instruct model from Hugging Face: load an existing model from HF and run inference
DPO(WIP)

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
01.miniGPT		01.miniGPT
02.GPT2-pretraining		02.GPT2-pretraining
03.GPT2-instruct-tuning		03.GPT2-instruct-tuning
04.Loading-model-from-HF		04.Loading-model-from-HF
05.GPT2-DPO		05.GPT2-DPO
LICENSE		LICENSE
README.md		README.md

Provide feedback