-
Notifications
You must be signed in to change notification settings - Fork 11
Pull requests: huawei-csl/pto-kernels
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add Ascend950 pure-vector simulator examples for SiLU and SwiGLU.
#172
opened May 27, 2026 by
learning-chip
Collaborator
Loading…
msprof simulator on custom A5 kernel + torch_npu wrapper launch
#169
opened May 18, 2026 by
learning-chip
Collaborator
Loading…
Minimum demo to highlight cross-core sync API differences
#158
opened May 11, 2026 by
learning-chip
Collaborator
Loading…
1 task
[Feat] Implement doubly-stochastic Sinkhorn normalization kernel
#134
opened Apr 21, 2026 by
Mocchibird
Contributor
•
Draft
Complete chunkwise GatedDeltaNet
#91
opened Apr 7, 2026 by
learning-chip
Collaborator
Loading…
7 tasks done
Chunkwise gated linear attention reaching 60~80 TFLOP/s, with step-by-step optimization records
#88
opened Apr 5, 2026 by
learning-chip
Collaborator
Loading…
9 of 17 tasks
compare host vs device-side chunk metadata computation
#84
opened Apr 1, 2026 by
learning-chip
Collaborator
•
Draft
c2v sync example using TSYNC or TPUSH/TPOP
#65
opened Mar 23, 2026 by
learning-chip
Collaborator
Loading…
2 tasks
Code hygiene remove membase define
Under Discussion
The issue/pull request is still under discussion
ProTip!
Adding no:label will show everything without a label.