[ExecuTorch][WebGPU] Add sigmoid op (aten.sigmoid.default) by JulianCloudNTH · Pull Request #20390 · pytorch/executorch

JulianCloudNTH · 2026-06-18T21:35:23Z

Stack from ghstack (oldest at bottom):

Adds aten.sigmoid.default to the WebGPU delegate: element-wise 1/(1+exp(-x)) over a flat fp32 buffer. On the Llama critical path (F.silu -> sigmoid + mul).

Composition (single dispatch):

sigmoid/UnaryOp.cpp — binds input (storage, read-only) + output (storage) + a Params{num_elements} uniform, 1D-dispatches over num_elements with override wg_size (clamped to the device limit); mirrors the add op (uniform mapped-at-creation, released after the bind group).
sigmoid/sigmoid.wgsl — guards idx >= num_elements and writes the logistic of each element.
@exported-using-ghexport

Differential Revision: D108793157

[ghstack-poisoned]

pytorch-bot · 2026-06-18T21:35:27Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20390

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[ROCm] MI350 CI runner label rename: rebase PRs using old linux.rocm.gpu.gfx950.* labels

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-06-18T21:36:10Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

JulianCloudNTH · 2026-06-18T21:51:51Z

@claude review

claude · 2026-06-18T21:52:37Z

Claude finished @JulianCloudNTH's task in 2m 3s —— View job

Code Review: WebGPU sigmoid op

Read PR diff
Compare against sibling ops (add/mul)
Review correctness, resource management, edge cases
Post review

Clean, well-scoped op that faithfully mirrors the add/mul pattern. WGSL is correct (1/(1+exp(-x)) with the idx >= num_elements guard), the generated header's wgsl-sha256 matches sigmoid.wgsl, and resource lifetimes match the established convention (shader/bgl/pipeline-layout released after the bind group is built; uniform buffer dropped after the bind group takes a ref). Nice cleanup using the shared utils::make_uniform helper instead of inlining the mapped-at-creation dance like add does.

Main issue — op won't actually be delegated / no test

aten.sigmoid.default is registered in the runtime via WEBGPU_REGISTER_OP, but it is not added to WEBGPU_SUPPORTED_OPS in backends/webgpu/test/tester.py:21, and there's no test/ops/sigmoid/test_sigmoid.py. Every other op in the runtime (add, mul, view_copy, select_copy, rms_norm) is present in that allowlist and has a matching test. As-is, nothing exercises this code path.

If a follow-up PR in the ghstack adds the partitioner allowlist entry + test, this is fine — please confirm. Otherwise this op is dead code in this PR. Note also the allowlist gates the Vulkan partitioner (WebGPU consumes the VK00 serialization), so delegation also depends on the Vulkan partitioner lowering sigmoid into the subgraph.

Add sigmoid test + allowlist entry →

Minor

Dtype guard is approximate. The nbytes % sizeof(float) != 0 check (UnaryOp.cpp:35) rejects misaligned buffers but doesn't truly verify fp32 — any 4-byte-divisible dtype (e.g. int32) slips through and gets logistic'd as float. add doesn't validate at all, so this is already stricter than the baseline; just flagging that the error message ("expected fp32") overstates what's checked. Fine to leave.
Workgroup size differs from add (64 vs 256). Not wrong — clamp_workgroup_size handles device limits either way — but if there's a rationale for the smaller size on the unary path it'd be worth a one-line comment; otherwise consider matching add for consistency.
SigmoidParams carries a single num_elements + _pad[3]. Correct for the 16-byte uniform alignment requirement and matches the WGSL Params struct. No change needed.

Nits

UnaryOp.cpp uses the modern nested namespace executorch::backends::webgpu while add/BinaryOp.cpp still uses the old three-line form — your version is the better style, just inconsistent across the directory.
Directory is sigmoid/ but the file is UnaryOp.cpp (generic). Reasonable if more unary ops will share this scaffold later; otherwise Sigmoid.cpp would match the select/Select.cpp convention.

No correctness or memory-management bugs found. The one thing to resolve before merge is whether the partitioner allowlist + test land here or later in the stack.

[ghstack-poisoned]

Update

7242a7d

[ghstack-poisoned]

JulianCloudNTH requested review from kirklandsign and larryliu0820 as code owners June 18, 2026 21:35

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 18, 2026

JulianCloudNTH temporarily deployed to cadence June 18, 2026 21:35 — with GitHub Actions Inactive

Update

8f3a7e5

[ghstack-poisoned]

meta-codesync Bot added the meta-exported label Jun 18, 2026

JulianCloudNTH temporarily deployed to cadence June 18, 2026 22:25 — with GitHub Actions Inactive

Update

f4f4492

[ghstack-poisoned]

JulianCloudNTH temporarily deployed to cadence June 22, 2026 20:40 — with GitHub Actions Inactive

JulianCloudNTH mentioned this pull request Jun 22, 2026

[ExecuTorch][WebGPU] Flatten landed-op test dirs to test/ops/test_<op>.py #20435

Open

Update

0444e23

[ghstack-poisoned]

JulianCloudNTH temporarily deployed to cadence June 23, 2026 20:35 — with GitHub Actions Inactive

This was referenced Jun 23, 2026

[ExecuTorch][WebGPU] Add clone op (aten.clone.default) #20463

Open

[ExecuTorch][WebGPU] Add aten.index.Tensor (1D-self gather) #20464

Open

[ExecuTorch][WebGPU] aten.index.Tensor test suite (export + native golden) #20465

Open

Update

1674bd6

[ghstack-poisoned]

JulianCloudNTH temporarily deployed to cadence June 23, 2026 22:25 — with GitHub Actions Inactive

Update

9016b7f

[ghstack-poisoned]

JulianCloudNTH temporarily deployed to cadence June 25, 2026 17:24 — with GitHub Actions Inactive

JulianCloudNTH requested a deployment to cadence June 25, 2026 17:24 — with GitHub Actions In progress

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ExecuTorch][WebGPU] Add sigmoid op (aten.sigmoid.default)#20390

[ExecuTorch][WebGPU] Add sigmoid op (aten.sigmoid.default)#20390
JulianCloudNTH wants to merge 6 commits into
gh/JulianCloudNTH/39/basefrom
gh/JulianCloudNTH/39/head

JulianCloudNTH commented Jun 18, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Jun 18, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 18, 2026

Uh oh!

JulianCloudNTH commented Jun 18, 2026

Uh oh!

claude Bot commented Jun 18, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

JulianCloudNTH commented Jun 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Jun 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20390

❗ 1 Active SEVs

Uh oh!

github-actions Bot commented Jun 18, 2026

This PR needs a release notes: label

Uh oh!

JulianCloudNTH commented Jun 18, 2026

Uh oh!

claude Bot commented Jun 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review: WebGPU sigmoid op

Main issue — op won't actually be delegated / no test

Minor

Nits

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

JulianCloudNTH commented Jun 18, 2026 •

edited

Loading

pytorch-bot Bot commented Jun 18, 2026 •

edited

Loading

This PR needs a `release notes:` label

claude Bot commented Jun 18, 2026 •

edited

Loading