Minimum demo to highlight cross-core sync API differences by learning-chip · Pull Request #158 · huawei-csl/pto-kernels

learning-chip · 2026-05-11T19:13:13Z

Here demo two cases:

Just streaming data along Cube(L0C)->Vector(UB) and Vector(UB)->Cube(L1), to measure bandwidth; no compute
Fused(pipelined) matmul-add (C2V) and add-matmul (V2C) as minimum mix kernel example

Each case is reimplemented in 3~4 different API styles, including raw flag, simple push, advanced push, ...

Required dependency to run: Tested on this pto-isa commit 933ad5d8 on 05/12. Should at least be newer than commit aef3a004 on 05/07, after PR 895.

TODO:

Test on A5/950. The C-V bandwidth should increase from 1 TB/s to the order of 5-10 TB/s

learning-chip added 6 commits May 11, 2026 18:22

readme for cross-core sync demo

9c23866

minimum clean matmul_add using explicit sync flags

cc7fd75

minimum add_matmul v2c example

415ede6

minimum code demo for C<->V data streaming

2fcf504

remove GEMM operation and input matrix B for v2c demo

7ad0e9c

finish push-pop version of cv sync demos

08ca6a7

learning-chip marked this pull request as ready for review May 12, 2026 07:05

learning-chip added 6 commits May 12, 2026 08:36

replace pipe_barrier all by set-wait pairs

4e5f0c7

add "naive separate stage" baseline to highlight pipeline benefit

2f18dab

fix sync error, update gm_pipe numbers for matmul_add

fa21496

remove legacy code, add mising benchmark script

bd53fcd

fix sync error and refresh all README

10ae5d8

update pto-isa dependency to fix c2c push fifo

7d3f471

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minimum demo to highlight cross-core sync API differences#158

Minimum demo to highlight cross-core sync API differences#158
learning-chip wants to merge 12 commits into
huawei-csl:mainfrom
learning-chip:cv_comm_demo

learning-chip commented May 11, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

learning-chip commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

learning-chip commented May 11, 2026 •

edited

Loading