Instruction Tuning with Increasing Scale

This repository (work in progress) explores instruction tuning techniques for large language models. It starts with a minimal single-GPU setup and gradually introduces more advanced parallelization strategies to enable efficient large-scale training on an NVIDIA DGX system.

Project Structure

Each stage of the project is organized in its own directory:

├── 01-single-gpu/ # Instruction tuning on a single GPU
└── 02-dataparallel-single-node/ # Instruction tuning with Data Parallelism on up to 8 GPUs
└── 03-fsdp-single-node/ # Instruction tuning with Fully Sharded Data Parallelism on up to 8 GPUs

To run a specific stage, navigate to the corresponding folder (e.g., 01-single-gpu) and follow the usage instructions in that folder’s README.md.

Requirements

Dependencies and environment setup are described in the subdirectory READMEs. Typically, you'll need:

Python 3.10+
PyTorch
CUDA-compatible GPU drivers
Hugging Face Transformers (for most examples)

Acknowledgements

The code builds upon:

LLMs from Scratch by Sebastian Raschka
Distributed training guide from Lambda Labs

**Note: All training hyperparameters such as learning rate, batch size, number of epochs were chosen for illustration purposes and not further optimized.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
01-single-gpu		01-single-gpu
02-dataparallel-single-node		02-dataparallel-single-node
03-fsdp-single-node		03-fsdp-single-node
04-fsdp-dpo-single-node		04-fsdp-dpo-single-node
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Instruction Tuning with Increasing Scale

Project Structure

Requirements

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Instruction Tuning with Increasing Scale

Project Structure

Requirements

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages