[ET-VK][qconv] Add layout-agnostic general shader for quantized conv by pytorchbot · Pull Request #17265 · pytorch/executorch

pytorchbot · 2026-02-05T23:28:55Z

This PR was created by the merge bot to help merge the original PR into the main branch.
ghstack PR number: #17219 by @SS-JIA
^ Please use this as the source of truth for the PR details, comments, and reviews
ghstack PR base: https://github.com/pytorch/executorch/tree/gh/SS-JIA/408/base
ghstack PR head: https://github.com/pytorch/executorch/tree/gh/SS-JIA/408/head
Merge bot PR base: https://github.com/pytorch/executorch/tree/gh/SS-JIA/406/orig
Merge bot PR head: https://github.com/pytorch/executorch/tree/gh/SS-JIA/408/orig
Differential Revision: D92307252
@diff-train-skip-merge

pytorch-bot · 2026-02-05T23:28:59Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17265

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Pull Request resolved: #17219 The existing quantized conv2d implementation (`conv2d_q8ta_q8csw_q8to`) only supports the 4W4C memory layout. This limits its use when models require different tensor layouts. This change introduces a new general-purpose quantized conv2d shader (`q8ta_conv2d`) that works with any memory layout by using BufferMetadata for tensor indexing. The routing logic determines which implementation to use based on input/output layouts: when both are 4W4C, the existing optimized path is used; otherwise, the new general shader handles the computation. This enables quantized conv2d to work seamlessly across 4C1W, 4W4C, and 4C memory layouts. Key changes: - New GLSL shader `q8ta_conv2d.glsl` using layout specialization constants - New `Q8taConv2d.cpp` with operator registration and workgroup size heuristics - Refactored routing in QuantizedConvolution.cpp to dispatch based on layout - Extended test coverage to validate all three memory layouts Authored with Claude. ghstack-source-id: 338638545 @exported-using-ghexport Differential Revision: [D92307252](https://our.internmc.facebook.com/intern/diff/D92307252/)

github-actions · 2026-02-06T00:37:48Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

pytorchbot requested review from SS-JIA, kirklandsign and larryliu0820 as code owners February 5, 2026 23:28

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 5, 2026

SS-JIA force-pushed the gh/SS-JIA/406/orig branch from 013230f to 9f3ac65 Compare February 6, 2026 00:29

Base automatically changed from gh/SS-JIA/406/orig to main February 6, 2026 00:36

SS-JIA force-pushed the gh/SS-JIA/408/orig branch from a1e4685 to 23849ec Compare February 6, 2026 00:37

SS-JIA approved these changes Feb 6, 2026

View reviewed changes

SS-JIA merged commit 67ff1b8 into main Feb 6, 2026
27 of 28 checks passed

SS-JIA deleted the gh/SS-JIA/408/orig branch February 6, 2026 00:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ET-VK][qconv] Add layout-agnostic general shader for quantized conv#17265

[ET-VK][qconv] Add layout-agnostic general shader for quantized conv#17265
SS-JIA merged 1 commit intomainfrom
gh/SS-JIA/408/orig

pytorchbot commented Feb 5, 2026

Uh oh!

pytorch-bot bot commented Feb 5, 2026 •

edited

Loading

Uh oh!

Uh oh!

github-actions bot commented Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pytorchbot commented Feb 5, 2026

Uh oh!

pytorch-bot bot commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17265

Uh oh!

Uh oh!

github-actions bot commented Feb 6, 2026

This PR needs a release notes: label

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pytorch-bot bot commented Feb 5, 2026 •

edited

Loading

This PR needs a `release notes:` label