ai-field-notes

A repository to consolidate my explorations and learnings in the field of artificial intelligence.

Fundamentals

CNN: Backpropagation Applied to Handwritten Zip Code Recognition
AlexNet: ImageNet Classification with Deep Convolutional Neural Networks
Transformer Architecture: Attention Is All You Need

Training

Pretraining

The Ultra-Scale Playbook: Training LLMs on GPU Clusters

Training Frameworks

Distributed Training & 5D Parallelism

Tensor Parallelism (TP)
Sequence/Context Parallelism (SP/CP)
Expert Parallelism (EP)
- Mixtral of Experts
Pipeline Parallelism (PP)
- DualPipe
Data Parallelism (DP)
- Fully Sharded Data Parallelism (FSDP)
- Zero Redundancy Optimizer (ZeRO)

Kernels & Compute

DeepGEMM

Post-training

Supervised Fine-Tuning (SFT)
Parameter-Efficient Fine-Tuning (PEFT)
- Low-Rank Adaptation (LoRA)
Reinforcement Learning (RL)
- Reinforcement Learning from Human Feedback (RLHF)
- Proximal Policy Optimization (PPO)
- Group Relative Policy Optimization (GRPO)
- Direct Preference Optimization (DPO)

Inference

KV Cache
PagedAttention: Efficient Memory Management for Large Language Model Serving with PagedAttention
Speculative Decoding: Speculative Decoding for Efficient LLM Inference, Better & Faster Large Language Models via Multi-token Prediction
Batching
Quantization

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
inference		inference
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ai-field-notes

Fundamentals

Training

Pretraining

Training Frameworks

Distributed Training & 5D Parallelism

Kernels & Compute

Post-training

Inference

Serving Frameworks

Agentic AI

Prompt Engineering

Retrieval Augmented Generation (RAG)

Context Engineering

Model Context Protocol (MCP)

Agent Frameworks

Mechanistic Interpretability

Explorations

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

ai-field-notes

Fundamentals

Training

Pretraining

Training Frameworks

Distributed Training & 5D Parallelism

Kernels & Compute

Post-training

Inference

Serving Frameworks

Agentic AI

Prompt Engineering

Retrieval Augmented Generation (RAG)

Context Engineering

Model Context Protocol (MCP)

Agent Frameworks

Mechanistic Interpretability

Explorations

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages