Skip to content

Add DCU backend and related support#197

Open
bjzhjing wants to merge 6 commits into
bytedance:mainfrom
bjzhjing:add-DCU-backend
Open

Add DCU backend and related support#197
bjzhjing wants to merge 6 commits into
bytedance:mainfrom
bjzhjing:add-DCU-backend

Conversation

@bjzhjing
Copy link
Copy Markdown

@bjzhjing bjzhjing commented May 6, 2026

This PR adds DCU integration for micro_perf (DCU backend), extends llm_sim with DCU deploy configs, adds DCU vendor op implementations and vendor_test_DCU workloads, and includes small Python typing / micro_perf CLI fixes.

bjzhjing and others added 5 commits May 6, 2026 16:51
Fix invalid ClassVar annotation for Python 3.10.
Correct moe_gating_gemm to read dst_dtype properly.
Minor cleanup in launch.py/server.py initialization order (no behavior change intended).

Signed-off-by: Zhang Jing <[email protected]>
Introduce BackendDCU and DCU backend entrypoint under src/ following
existing backend structure.

Signed-off-by: Zhang Jing <[email protected]>
Signed-off-by: Zhang Jing <[email protected]>
Co-authored-by: Zhu Fuzhu <[email protected]>
Add DCU vendor provider packages (lightop/custom_ops/lmslim/flash_attn/torch)
and their op implementations.

Fix vendor ops to follow upstream lifecycle (vendor_parser/vendor_impl/vendor_impl_run)
and ensure correct tensor mapping/output allocation where required.

Signed-off-by: Zhang Jing <[email protected]>
Co-authored-by: Zhu Fuzhu <[email protected]>
Co-authored-by: Wang Sen <[email protected]>
Co-authored-by: Zhuang Luo <[email protected]>
Add DCU-specific deploy configs for seed-oss-36b, qwen3-32b, and
qwen3-235b-a22b tp-ep.

Signed-off-by: Zhang Jing <[email protected]>
Co-authored-by: Zhu Fuzhu <[email protected]>
Co-authored-by: Wang Sen <[email protected]>
@CLAassistant
Copy link
Copy Markdown

CLAassistant commented May 6, 2026

CLA assistant check
All committers have signed the CLA.

Signed-off-by: Zhang Jing <[email protected]>
Co-authored-by: Zhu Fuzhu <[email protected]>
Co-authored-by: Wang Sen <[email protected]>
@bjzhjing bjzhjing force-pushed the add-DCU-backend branch from ecd7405 to 8addf97 Compare May 7, 2026 03:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants