Skip to content

Latest commit

 

History

History
10 lines (7 loc) · 385 Bytes

README.md

File metadata and controls

10 lines (7 loc) · 385 Bytes

AI By Hand - Deepseek solution

The original blank file can be downloaded from ai-by-hand.
This file provides a hands-on approach to the following concepts adopted in Deepseek:

  • Multi-head Latent Attention
  • RoPE (Rotary Position Embedding)
  • Mixture of Experts

Preview