High-perf SPMD paged attention (GQA) matching CANN IFA performance#655
Draft
learning-chip wants to merge 2 commits into
Draft
High-perf SPMD paged attention (GQA) matching CANN IFA performance#655learning-chip wants to merge 2 commits into
learning-chip wants to merge 2 commits into