Fix Windows unittest CI: force CPU-only build (CUDA 13.2 toolkit on runner breaks _portable_lib load) by Gasoonjia · Pull Request #20527 · pytorch/executorch

Gasoonjia · 2026-06-26T04:37:51Z

Summary

Fixes the Windows unittest CI breakage introduced by #20440 (Add CUDA 13.2 support and drop unsupported 12.8/12.9).

unittest / windows, unittest-editable / windows, and unittest-release / windows have been red on main since c0643f5 (parent was green).

Root cause

The Windows CI image ships CUDA toolkits on PATH (it has both v13.2 and v13.0; nvcc resolves to 13.2.78).

install_executorch auto-enables the CUDA backend when install_utils.is_cuda_available() returns True (setup.py ~L882-889), and that check is driven purely by the nvcc version being in SUPPORTED_CUDA_VERSIONS.

Before Add CUDA 13.2 support and drop unsupported 12.8/12.9 #20440: 13.2 ∉ SUPPORTED_CUDA_VERSIONS → is_cuda_available() = False → CPU-only build → green.
After Add CUDA 13.2 support and drop unsupported 12.8/12.9 #20440: adding (13, 2) makes is_cuda_available() = True on the Windows runner → setup.py flips -DEXECUTORCH_BUILD_CUDA=ON. But the unittest jobs install CPU torch, so the CUDA build of _portable_lib can't find its CUDA DLLs:

ImportError: DLL load failed while importing _portable_lib: The specified module could not be found.

That aborts pytest collection (24 errors during collection) and fails the job.

Fix

Add a -cpuOnly switch to the shared .ci/scripts/setup-windows.ps1 that forces -DEXECUTORCH_BUILD_CUDA=OFF via CMAKE_ARGS, and pass it from the CPU unittest workflow (_unittest.yml). This restores the pre-#20440 CPU-only behavior for these jobs.

The CUDA Windows jobs (cuda-windows.yml) call the same script without -cpuOnly, so they are unaffected and keep building CUDA.

Note / follow-up

The deeper issue is that the auto-detection keys off nvcc presence rather than whether the installed torch is actually a CUDA build. A more general fix would be to only enable EXECUTORCH_BUILD_CUDA when torch.version.cuda is set. Left out here to keep the unblock low-risk; happy to follow up.

pytorch-bot · 2026-06-26T04:37:55Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20527

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 Unclassified Failures

As of commit d2b3fac with merge base 6021a58 ():

UNCLASSIFIED FAILURES - DrCI could not classify the following jobs because the workflow did not run on the merge base. The failures may be pre-existing on trunk or introduced by this PR:

Build Aarch64 Linux Wheels / pytorch/executorch / build-wheel-py3_10-cpu-aarch64 (gh) (this job did not run on the merge base, so DrCI cannot tell whether the failure is pre-existing)
/__w/executorch/executorch/pytorch/executorch/backends/apple/coreml/runtime/inmemoryfs/inmemory_filesystem.cpp:722:48: error: ‘inmemoryfs::InMemoryFileSystem::InMemoryNode::Kind’ has not been declared
Build Aarch64 Linux Wheels / pytorch/executorch / upload / upload-wheel-py3_10-cpu-aarch64 (gh) (this job did not run on the merge base, so DrCI cannot tell whether the failure is pre-existing)
Unable to download artifact(s): Artifact not found for name: pytorch_executorch__3.10_cpu_aarch64

This comment was automatically generated by Dr. CI and updates every 15 minutes.

linux-foundation-easycla · 2026-06-26T04:38:02Z

The committers listed above are authorized under a signed CLA.

✅ login: Gasoonjia / name: gasoonjia (952e121)

github-actions · 2026-06-26T04:38:38Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

…failure The Windows CI image ships CUDA toolkits on PATH. After adding (13, 2) to SUPPORTED_CUDA_VERSIONS (#20440), install_executorch's auto-detection (setup.py: is_cuda_available() via nvcc) started returning True on the Windows runner (which has the CUDA 13.2 toolkit), so it flipped EXECUTORCH_BUILD_CUDA=ON. But the unittest jobs install CPU torch, so the resulting CUDA build of _portable_lib fails to load its CUDA DLLs at import time: ImportError: DLL load failed while importing _portable_lib causing all pytest collection to error out (unittest / unittest-editable / unittest-release on windows). Add a -cpuOnly switch to setup-windows.ps1 that forces -DEXECUTORCH_BUILD_CUDA=OFF via CMAKE_ARGS, and pass it from the CPU unittest workflow. The CUDA Windows jobs (cuda-windows.yml) keep the default and are unaffected.

…LL load failure Same root cause as the unittest fix in this PR, second site. The Windows wheel build (build-wheels-windows.yml -> .ci/scripts/wheel/) does not go through setup-windows.ps1. The Windows CI image has the CUDA 13.2 toolkit on PATH, so after #20440 added (13, 2) to SUPPORTED_CUDA_VERSIONS, install_executorch's auto-detection enables EXECUTORCH_BUILD_CUDA and bakes a CUDA _portable_lib + aoti_cuda_shims.lib into the CPU wheel. The smoke test then fails with: ImportError: DLL load failed while importing _portable_lib Windows wheels are CPU-only (with-cuda: disabled), so force -DEXECUTORCH_BUILD_CUDA=OFF via CMAKE_ARGS in pre_build_script.sh on Windows.

digantdesai · 2026-06-26T14:59:51Z

Arm Linux Wheels - preexisting
unittest sam export - transient - rerunning it

Gasoonjia temporarily deployed to cadence June 26, 2026 04:37 — with GitHub Actions Inactive

Gasoonjia had a problem deploying to cadence June 26, 2026 04:37 — with GitHub Actions Error

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 26, 2026

Gasoonjia force-pushed the fix-windows-cuda-autodetect branch from 747da69 to 952e121 Compare June 26, 2026 04:58

Gasoonjia temporarily deployed to cadence June 26, 2026 04:59 — with GitHub Actions Inactive

Gasoonjia temporarily deployed to cadence June 26, 2026 05:29 — with GitHub Actions Inactive

digantdesai approved these changes Jun 26, 2026

View reviewed changes

digantdesai merged commit 16ecb3f into main Jun 26, 2026
603 of 607 checks passed

digantdesai deleted the fix-windows-cuda-autodetect branch June 26, 2026 15:00

Reubend added a commit to Reubend/executorch that referenced this pull request Jun 26, 2026

added Vulkan guard to mirror pytorch#20527

3c8d7b2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix Windows unittest CI: force CPU-only build (CUDA 13.2 toolkit on runner breaks _portable_lib load)#20527

Fix Windows unittest CI: force CPU-only build (CUDA 13.2 toolkit on runner breaks _portable_lib load)#20527
digantdesai merged 2 commits into
mainfrom
fix-windows-cuda-autodetect

Gasoonjia commented Jun 26, 2026

Uh oh!

pytorch-bot Bot commented Jun 26, 2026 •

edited

Loading

Uh oh!

linux-foundation-easycla Bot commented Jun 26, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 26, 2026

Uh oh!

digantdesai commented Jun 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Gasoonjia commented Jun 26, 2026

Summary

Root cause

Fix

Note / follow-up

Uh oh!

pytorch-bot Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20527

❌ 2 Unclassified Failures

Uh oh!

linux-foundation-easycla Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 26, 2026

This PR needs a release notes: label

Uh oh!

digantdesai commented Jun 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pytorch-bot Bot commented Jun 26, 2026 •

edited

Loading

linux-foundation-easycla Bot commented Jun 26, 2026 •

edited

Loading

This PR needs a `release notes:` label