KFI-203 Improve thread safety of packing in convolve_kleidiai.cpp #26575

Colm-in-Arm · 2025-11-14T10:13:28Z

Description

Making cache objects of packed data thread_local rather than static.

Motivation and Context

Both LHS and RHS packing utilize a cache mechanism based on a static unordered map. There's the potential for interference between parallel inference sessions. Made both structures thread_local.

hariharans29 · 2025-11-14T18:13:00Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2025-11-14T18:13:23Z

Azure Pipelines successfully started running 4 pipeline(s).

Copilot

Pull Request Overview

This PR improves thread safety in the Kleidiai convolution implementation by converting cache storage from static to thread_local scope. This prevents potential data races and interference when multiple inference sessions run in parallel threads.

Key changes:

RHS (weights) cache converted from static to thread_local
LHS (input indirection) cache converted from static to thread_local
Updated comments to explain the thread_local rationale

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

onnxruntime/core/mlas/lib/kleidiai/convolve_kleidiai.cpp

hariharans29 · 2025-11-14T18:35:59Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2025-11-14T18:36:18Z

Azure Pipelines successfully started running 4 pipeline(s).

hariharans29

LGTM. Thanks.

hariharans29 · 2025-11-14T19:51:39Z

It will need this to fix the failing pipeline: #26559

hariharans29 · 2025-11-17T21:55:23Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2025-11-17T21:55:42Z

Azure Pipelines successfully started running 4 pipeline(s).

hariharans29 · 2025-11-18T20:34:53Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2025-11-18T20:35:12Z

Azure Pipelines successfully started running 4 pipeline(s).

hariharans29 · 2025-11-18T21:12:14Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2025-11-18T21:12:33Z

Azure Pipelines successfully started running 4 pipeline(s).

hariharans29 · 2025-11-19T22:46:04Z

/azp run Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2025-11-19T22:46:10Z

No pipelines are associated with this pull request.

* Both LHS and RHS packing utilize a cache mechanism based on a static unordered map. There's the potential for interference between parallel inference sessions. Made both structures thread_local. Signed-off-by: Colm Donelan <[email protected]>

hariharans29 · 2025-11-20T18:33:51Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2025-11-20T18:34:10Z

Azure Pipelines successfully started running 4 pipeline(s).

hariharans29 requested a review from Copilot November 14, 2025 18:13

Copilot started reviewing on behalf of hariharans29 November 14, 2025 18:14 View session

Copilot finished reviewing on behalf of hariharans29 November 14, 2025 18:15

Copilot AI reviewed Nov 14, 2025

View reviewed changes

onnxruntime/core/mlas/lib/kleidiai/convolve_kleidiai.cpp Show resolved Hide resolved

hariharans29 previously approved these changes Nov 14, 2025

View reviewed changes

hariharans29 requested a review from edgchen1 November 14, 2025 18:37

edgchen1 approved these changes Nov 15, 2025

View reviewed changes

edgchen1 previously approved these changes Nov 15, 2025

View reviewed changes

Colm-in-Arm dismissed stale reviews from edgchen1 and hariharans29 via e6f7b21 November 17, 2025 21:11

Colm-in-Arm force-pushed the KFI-203 branch from 1b60764 to e6f7b21 Compare November 17, 2025 21:11

hariharans29 approved these changes Nov 17, 2025

View reviewed changes

hariharans29 enabled auto-merge (squash) November 17, 2025 21:55

hariharans29 closed this Nov 18, 2025

auto-merge was automatically disabled November 18, 2025 20:34
Pull request was closed

hariharans29 reopened this Nov 18, 2025

hariharans29 closed this Nov 18, 2025

hariharans29 reopened this Nov 18, 2025

hariharans29 closed this Nov 18, 2025

hariharans29 reopened this Nov 18, 2025

hariharans29 enabled auto-merge (squash) November 19, 2025 17:48

hariharans29 closed this Nov 19, 2025

auto-merge was automatically disabled November 19, 2025 22:46
Pull request was closed

hariharans29 reopened this Nov 19, 2025

Colm-in-Arm force-pushed the KFI-203 branch from e6f7b21 to 475ee3d Compare November 20, 2025 16:40

hariharans29 closed this Nov 20, 2025

hariharans29 reopened this Nov 20, 2025

hariharans29 enabled auto-merge (squash) November 20, 2025 23:26

hariharans29 approved these changes Nov 20, 2025

View reviewed changes

KFI-203 Improve thread safety of packing in convolve_kleidiai.cpp #26575

Are you sure you want to change the base?

KFI-203 Improve thread safety of packing in convolve_kleidiai.cpp #26575

Uh oh!

Conversation

Colm-in-Arm commented Nov 14, 2025

Description

Motivation and Context

Uh oh!

hariharans29 commented Nov 14, 2025

Uh oh!

azure-pipelines bot commented Nov 14, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

hariharans29 commented Nov 14, 2025

Uh oh!

azure-pipelines bot commented Nov 14, 2025

Uh oh!

hariharans29 left a comment

Choose a reason for hiding this comment

Uh oh!

hariharans29 commented Nov 14, 2025

Uh oh!

hariharans29 commented Nov 17, 2025

Uh oh!

azure-pipelines bot commented Nov 17, 2025

Uh oh!

hariharans29 commented Nov 18, 2025

Uh oh!

azure-pipelines bot commented Nov 18, 2025

Uh oh!

hariharans29 commented Nov 18, 2025

Uh oh!

azure-pipelines bot commented Nov 18, 2025

Uh oh!

hariharans29 commented Nov 19, 2025

Uh oh!

azure-pipelines bot commented Nov 19, 2025

Uh oh!

hariharans29 commented Nov 20, 2025

Uh oh!

azure-pipelines bot commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants