Fix C++ -Werror regressions in llama runner by psiddh · Pull Request #19326 · pytorch/executorch

psiddh · 2026-05-06T07:23:45Z

Summary:
Fixes 3 -Werror diagnostics that broke the qualcomm llama runner build on
cfg:android-arm64-clang19-no-san and disabled the following test infra
targets:

xplat/executorch/examples/qualcomm/oss_scripts/llama:runner_lib
xplat/executorch/examples/qualcomm/oss_scripts/llama:runner_lib_static
xplat/executorch/examples/qualcomm/oss_scripts/llama:qnn_llama_runner
xplat/executorch/examples/qualcomm/oss_scripts/llama:qnn_llama_runner_static

Three diagnostics fixed:

-Wreorder-ctor in runner.cpp: attention_sink_rope_module_ is
declared as the 2nd field of Runner<T> (right after module_) but the
constructor initializer list appended it last, after tokenizer_. Moved
it to the correct position in the init list to match declaration order.
Recent regression introduced in the attention-sink diff (Qualcomm AI Engine Direct - Support attention sink for long context usecase #16574).
-Woverloaded-virtual in lhd_token_generator.h and
multimodal_lhd_token_generator.h: the derived classes define a
prepare_io(std::vector<uint64_t>, std::vector<int32_t>) overload that
hides the base class virtual prepare_io(uint64_t, int64_t). Added a
using TokenGenerator<T>::prepare_io; (and equivalent for the
multimodal hierarchy) declaration so the base virtual stays in scope and
the warning is silenced without changing behavior. Latent bug surfaced
by the clang19 toolchain bump.
-Wdelete-non-abstract-non-virtual-dtor in prompt_processor.h:
PromptProcessor<T> has virtual member functions but no virtual
destructor, so deleting via std::unique_ptr<PromptProcessor<T>> in
Runner was undefined behavior under strict warnings. Added
virtual ~PromptProcessor() = default; mirroring the pattern already
used in TokenGenerator (token_generator.h). Also transitively
fixes MultimodalPromptProcessor<T>.

Differential Revision: D103991803

Summary: Fixes 3 `-Werror` diagnostics that broke the qualcomm llama runner build on `cfg:android-arm64-clang19-no-san` and disabled the following test infra targets: - `xplat/executorch/examples/qualcomm/oss_scripts/llama:runner_lib` - `xplat/executorch/examples/qualcomm/oss_scripts/llama:runner_lib_static` - `xplat/executorch/examples/qualcomm/oss_scripts/llama:qnn_llama_runner` - `xplat/executorch/examples/qualcomm/oss_scripts/llama:qnn_llama_runner_static` Three diagnostics fixed: 1. `-Wreorder-ctor` in `runner.cpp`: `attention_sink_rope_module_` is declared as the 2nd field of `Runner<T>` (right after `module_`) but the constructor initializer list appended it last, after `tokenizer_`. Moved it to the correct position in the init list to match declaration order. Recent regression introduced in the attention-sink diff (pytorch#16574). 2. `-Woverloaded-virtual` in `lhd_token_generator.h` and `multimodal_lhd_token_generator.h`: the derived classes define a `prepare_io(std::vector<uint64_t>, std::vector<int32_t>)` overload that hides the base class virtual `prepare_io(uint64_t, int64_t)`. Added a `using TokenGenerator<T>::prepare_io;` (and equivalent for the multimodal hierarchy) declaration so the base virtual stays in scope and the warning is silenced without changing behavior. Latent bug surfaced by the clang19 toolchain bump. 3. `-Wdelete-non-abstract-non-virtual-dtor` in `prompt_processor.h`: `PromptProcessor<T>` has virtual member functions but no virtual destructor, so deleting via `std::unique_ptr<PromptProcessor<T>>` in `Runner` was undefined behavior under strict warnings. Added `virtual ~PromptProcessor() = default;` mirroring the pattern already used in `TokenGenerator` (`token_generator.h`). Also transitively fixes `MultimodalPromptProcessor<T>`. Differential Revision: D103991803

pytorch-bot · 2026-05-06T07:23:49Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19326

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 15 New Failures, 6 Unrelated Failures

As of commit be95216 with merge base 1debeb6 ():

NEW FAILURES - The following jobs have failed:

pull / test-llama-runner-qnn-linux (fp32, qnn_16a16w, qnn) / linux-job (gh)
/pytorch/executorch/examples/qualcomm/../../../executorch/examples/qualcomm/oss_scripts/llama/runner/multimodal_runner/multimodal_lhd_token_generator.h:113:38: error: 'prepare_io' is a private member of 'example::MultimodalTokenGenerator<unsigned char>'
pull / test-llama-runner-qnn-linux (fp32, qnn_8a8w, qnn) / linux-job (gh)
/pytorch/executorch/examples/qualcomm/../../../executorch/examples/qualcomm/oss_scripts/llama/runner/multimodal_runner/multimodal_lhd_token_generator.h:113:38: error: 'prepare_io' is a private member of 'example::MultimodalTokenGenerator<unsigned char>'
pull / test-qnn-delegate-linux / linux-job (gh)
/pytorch/executorch/examples/qualcomm/../../../executorch/examples/qualcomm/oss_scripts/llama/runner/multimodal_runner/multimodal_lhd_token_generator.h:113:38: error: 'prepare_io' is a private member of 'example::MultimodalTokenGenerator<unsigned char>'
pull / test-qnn-models-linux (dl3) / linux-job (gh)
/pytorch/executorch/examples/qualcomm/../../../executorch/examples/qualcomm/oss_scripts/llama/runner/multimodal_runner/multimodal_lhd_token_generator.h:113:38: error: 'prepare_io' is a private member of 'example::MultimodalTokenGenerator<unsigned char>'
pull / test-qnn-models-linux (mv2) / linux-job (gh)
/pytorch/executorch/examples/qualcomm/../../../executorch/examples/qualcomm/oss_scripts/llama/runner/multimodal_runner/multimodal_lhd_token_generator.h:113:38: error: 'prepare_io' is a private member of 'example::MultimodalTokenGenerator<unsigned char>'
pull / test-qnn-models-linux (mv3) / linux-job (gh)
/pytorch/executorch/examples/qualcomm/../../../executorch/examples/qualcomm/oss_scripts/llama/runner/multimodal_runner/multimodal_lhd_token_generator.h:113:38: error: 'prepare_io' is a private member of 'example::MultimodalTokenGenerator<unsigned char>'
pull / test-qnn-passes-linux / linux-job (gh)
/pytorch/executorch/examples/qualcomm/../../../executorch/examples/qualcomm/oss_scripts/llama/runner/multimodal_runner/multimodal_lhd_token_generator.h:113:38: error: 'prepare_io' is a private member of 'example::MultimodalTokenGenerator<unsigned char>'
pull / test-qnn-python-imports-linux / linux-job (gh)
/pytorch/executorch/examples/qualcomm/../../../executorch/examples/qualcomm/oss_scripts/llama/runner/multimodal_runner/multimodal_lhd_token_generator.h:113:38: error: 'prepare_io' is a private member of 'example::MultimodalTokenGenerator<unsigned char>'
pull / test-qnn-testsuite-linux / test-backend-linux (qnn, models) / linux-job (gh)
/pytorch/executorch/examples/qualcomm/../../../executorch/examples/qualcomm/oss_scripts/llama/runner/multimodal_runner/multimodal_lhd_token_generator.h:113:38: error: 'prepare_io' is a private member of 'example::MultimodalTokenGenerator<unsigned char>'
pull / test-qnn-testsuite-linux / test-backend-linux (qnn, operators) / linux-job (gh)
/pytorch/executorch/examples/qualcomm/../../../executorch/examples/qualcomm/oss_scripts/llama/runner/multimodal_runner/multimodal_lhd_token_generator.h:113:38: error: 'prepare_io' is a private member of 'example::MultimodalTokenGenerator<unsigned char>'
pull / test-sqnr-static-llm-qnn-linux (smollm2_135m) / linux-job (gh)
/pytorch/executorch/examples/qualcomm/../../../executorch/examples/qualcomm/oss_scripts/llama/runner/multimodal_runner/multimodal_lhd_token_generator.h:113:38: error: 'prepare_io' is a private member of 'example::MultimodalTokenGenerator<unsigned char>'
pull / test-static-llama-qnn-linux (stories_110m) / linux-job (gh)
/pytorch/executorch/examples/qualcomm/../../../executorch/examples/qualcomm/oss_scripts/llama/runner/multimodal_runner/multimodal_lhd_token_generator.h:113:38: error: 'prepare_io' is a private member of 'example::MultimodalTokenGenerator<unsigned char>'
pull / test-static-llama-qnn-linux (stories_260k_bc) / linux-job (gh)
/pytorch/executorch/examples/qualcomm/../../../executorch/examples/qualcomm/oss_scripts/llama/runner/multimodal_runner/multimodal_lhd_token_generator.h:113:38: error: 'prepare_io' is a private member of 'example::MultimodalTokenGenerator<unsigned char>'
Test QNN Backend / test-qnn / test-backend-linux (qnn, models) / linux-job (gh)
/pytorch/executorch/examples/qualcomm/../../../executorch/examples/qualcomm/oss_scripts/llama/runner/multimodal_runner/multimodal_lhd_token_generator.h:113:38: error: 'prepare_io' is a private member of 'example::MultimodalTokenGenerator<unsigned char>'
Test QNN Backend / test-qnn / test-backend-linux (qnn, operators) / linux-job (gh)
/pytorch/executorch/examples/qualcomm/../../../executorch/examples/qualcomm/oss_scripts/llama/runner/multimodal_runner/multimodal_lhd_token_generator.h:113:38: error: 'prepare_io' is a private member of 'example::MultimodalTokenGenerator<unsigned char>'

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / macos / macos-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest-editable / macos / macos-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest-editable / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.
Test CoreML Backend / test-coreml / test-backend-macos (coreml, models) / macos-job (gh) (trunk failure)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1
Test CoreML Backend / test-coreml / test-backend-macos (coreml, operators) / macos-job (gh) (trunk failure)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2026-05-06T07:24:01Z

@psiddh has exported this pull request. If you are a Meta employee, you can view the originating Diff in D103991803.

github-actions · 2026-05-06T07:24:57Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Copilot

Pull request overview

This PR addresses clang19 -Werror build regressions in the Qualcomm llama runner by fixing constructor member initialization order, correcting virtual function hiding warnings, and ensuring safe polymorphic deletion via a virtual destructor.

Changes:

Reordered Runner<T> constructor initializer list to match member declaration order (-Wreorder-ctor).
Added using ...::prepare_io declarations in LHD token generator subclasses to avoid hiding base virtuals (-Woverloaded-virtual).
Added a virtual destructor to PromptProcessor<T> to avoid UB when deleting through base pointers (-Wdelete-non-abstract-non-virtual-dtor).

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

File	Description
examples/qualcomm/oss_scripts/llama/runner/runner.cpp	Fixes member initializer order to satisfy `-Wreorder-ctor`.
examples/qualcomm/oss_scripts/llama/runner/prompt_processor.h	Adds virtual destructor to make polymorphic deletion well-defined.
examples/qualcomm/oss_scripts/llama/runner/lhd_token_generator.h	Brings base `prepare_io` into scope to avoid `-Woverloaded-virtual`.
examples/qualcomm/oss_scripts/llama/runner/multimodal_runner/multimodal_lhd_token_generator.h	Attempts to bring base `prepare_io` into scope for multimodal LHD (but see review comment re: private access).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+  // Bring base class's virtual prepare_io into scope so the overload below
+  // does not hide it (-Woverloaded-virtual).
+  using MultimodalTokenGenerator<T>::prepare_io;


psiddh requested a review from abhinaykukkadapu as a code owner May 6, 2026 07:23

Copilot AI review requested due to automatic review settings May 6, 2026 07:23

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 6, 2026

meta-codesync Bot added fb-exported meta-exported labels May 6, 2026

Copilot started reviewing on behalf of psiddh May 6, 2026 07:24 View session

psiddh requested a review from JacobSzwejbka May 6, 2026 07:25

Copilot AI reviewed May 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix C++ -Werror regressions in llama runner#19326

Fix C++ -Werror regressions in llama runner#19326
psiddh wants to merge 1 commit intopytorch:mainfrom
psiddh:export-D103991803

psiddh commented May 6, 2026

Uh oh!

pytorch-bot Bot commented May 6, 2026 •

edited

Loading

Uh oh!

meta-codesync Bot commented May 6, 2026

Uh oh!

github-actions Bot commented May 6, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

psiddh commented May 6, 2026

Uh oh!

pytorch-bot Bot commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19326

❌ 15 New Failures, 6 Unrelated Failures

Uh oh!

meta-codesync Bot commented May 6, 2026

Uh oh!

github-actions Bot commented May 6, 2026

This PR needs a release notes: label

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pytorch-bot Bot commented May 6, 2026 •

edited

Loading

This PR needs a `release notes:` label