Skip to content

Latest commit

 

History

History
69 lines (43 loc) · 2.05 KB

File metadata and controls

69 lines (43 loc) · 2.05 KB

Supported by JetBrains Supported by Manning Publications Pull Shark

Systematic CUDA Learning

Learning CUDA properly: from fundamentals to real-world GPU systems.

This repo documents my journey of learning CUDA from scratch.

Just real understanding, built step by step.

This repo is different.

It focuses on:

  • understanding how GPUs actually work
  • building intuition before writing code
  • connecting CUDA to real-world systems (Kubernetes, AI workloads, etc.)

Repo Structure

The project is split into two main parts:

GPU/

This is where everything starts.

It covers the fundamentals:

  • GPU architecture
  • memory model
  • compute capability
  • performance fundamentals
  • hardware evolution (up to 2026 architectures)

Each section is structured step by step.

Each folder contains: notes (clear explanations), visual summaries and structured learning progression.

CUDA/

This is where theory turns into practice.

What makes this different

  • No noise, no fluff
  • No blind copy from docs
  • Built like an engineer, not a tutorial

Everything here is written to answer one question:

“Do I actually understand what’s happening?”

Sponsorship

This project is supported by JetBrains. JetBrains provides professional developer tools that I actively use for CUDA development, experimentation, and documentation.

This project is also supported by Manning Publishing. They provide high-quality technical books that I use to deepen my understanding of CUDA, GPU systems, and parallel computing.

Special thanks to Manning Publishing for providing CUDA for Deep Learning (by Elliot Arledge).