Skip to content

Chunkwise gated linear attention reaching 60~80 TFLOP/s, with step-by-step optimization records#88

Open
learning-chip wants to merge 31 commits into
mainfrom
linear_attn
Open

Chunkwise gated linear attention reaching 60~80 TFLOP/s, with step-by-step optimization records#88
learning-chip wants to merge 31 commits into
mainfrom
linear_attn

Commits

Commits on Apr 15, 2026