[CIR][CUDA][HIP] Set internal linkage for device variable shadows #2041

RiverDave · 2025-11-29T01:10:20Z

Device variables (__device__, __constant__) now have internal linkage for their host-side shadow variables (non-RDC mode), matching OG behavior.

github-actions · 2025-11-29T01:12:15Z

✅ With the latest revision this PR passed the C/C++ code formatter.

koparasy · 2025-12-01T18:06:59Z

I have some concerns with this. Some of the functionality you perform here should be done during lowering to LLVM IR (

clangir/clang/lib/CIR/Dialect/Transforms/LoweringPrepare.cpp

Line 1414 in 16cabe9

void LoweringPreparePass::buildCUDARegisterVars(cir::CIRBaseBuilderTy &builder,

). I believe the code that you have regarding internalize is proper here.

koparasy · 2025-12-01T17:36:44Z

clang/lib/CIR/CodeGen/CIRGenCUDANV.cpp

+  /// Keeps track of variable containing handle of GPU binary. Populated by
+  /// ModuleCtorFunction() and used to create corresponding cleanup calls in
+  /// ModuleDtorFunction()
+  llvm::GlobalVariable *gpuBinaryHandle = nullptr;


I don't see this being used at CodeGen. We handle the "gpuBinaryHandle" during "lowering". Why do you think we need this here?

Good catch, this is merely an artifact from bringing the skeleton from OG, Will remove.

koparasy · 2025-12-01T17:39:09Z

clang/lib/CIR/CodeGen/CIRGenCUDANV.cpp

+    DeviceVarFlags flags;
+  };
+
+  llvm::SmallVector<VarInfo, 16> deviceVars;


Why do you need this? Does this exist in OG?

koparasy · 2025-12-01T17:40:35Z

clang/test/CIR/CodeGen/CUDA/global-vars.cu

I see you are mixing CUDA and HIP tests here. This is ok, but we had historically split them between CUDA/HIP directories.

Good Point, I'll make sure to split both things from now on.

koparasy · 2025-12-01T17:47:11Z

clang/test/CIR/CodeGen/HIP/registration.cpp

@@ -1,4 +1,4 @@
-#include "cuda.h"
+#include "../Inputs/cuda.h"


We prefer on not having a relative path here #include <cuda.h> :D. We give th path through the -I%S/../Inputs/ flag we pass to CC1.

RiverDave · 2025-12-02T00:02:58Z

I have some concerns with this. Some of the functionality you perform here should be done during lowering to LLVM IR (

clangir/clang/lib/CIR/Dialect/Transforms/LoweringPrepare.cpp

Line 1414 in 16cabe9

void LoweringPreparePass::buildCUDARegisterVars(cir::CIRBaseBuilderTy &builder,

). I believe the code that you have regarding internalize is proper here.

Okay, it took me some time. But I realize I misled with the initial title of this PR, Registration was recently handled in your PR (thanks for that!). The vector stored in the runtime deviceVars is metadata that we need to consume and utilize when making these registration calls. The problem I have with my PR is that we're not consuming that information at the loweringPrepare when we should make use of that.

See how OG handles the deviceVars:

clangir/clang/lib/CodeGen/CGCUDANV.cpp

Line 681 in 16cabe9

for (auto &&Info : DeviceVars) {

The equivalent we have to bookkeep globals in CIR is:

clangir/clang/lib/CIR/Dialect/Transforms/LoweringPrepare.cpp

Line 1466 in 16cabe9

for (auto &[deviceSideName, global] : cudaVarMap) {

If you look at the way we're currently handling the variables to be shadowed on the host in CIR:

clangir/clang/lib/CIR/Dialect/Transforms/LoweringPrepare.cpp

Line 131 in 16cabe9

llvm::StringMap<GlobalOp> cudaVarMap;

I believe we somehow need to preserve the information coming from VarInfo, specifically in DeviceVarFlags. Doing that allows us to give special handling to the different types of globals as seen in OG:

clangir/clang/lib/CodeGen/CGCUDANV.cpp

Line 687 in 16cabe9

switch (Info.Flags.getKind()) {

RiverDave · 2025-12-02T15:21:16Z

I have some concerns with this. Some of the functionality you perform here should be done during lowering to LLVM IR (

clangir/clang/lib/CIR/Dialect/Transforms/LoweringPrepare.cpp

Line 1414 in 16cabe9

void LoweringPreparePass::buildCUDARegisterVars(cir::CIRBaseBuilderTy &builder,

). I believe the code that you have regarding internalize is proper here.

Okay, it took me some time. But I realize I misled with the initial title of this PR, Registration was recently handled in your PR (thanks for that!). The vector stored in the runtime deviceVars is metadata that we need to consume and utilize when making these registration calls. The problem I have with my PR is that we're not consuming that information at the loweringPrepare when we should make use of that.

See how OG handles the deviceVars:

clangir/clang/lib/CodeGen/CGCUDANV.cpp

Line 681 in 16cabe9

for (auto &&Info : DeviceVars) {

The equivalent we have to bookkeep globals in CIR is:

clangir/clang/lib/CIR/Dialect/Transforms/LoweringPrepare.cpp

Line 1466 in 16cabe9

for (auto &[deviceSideName, global] : cudaVarMap) {

If you look at the way we're currently handling the variables to be shadowed on the host in CIR:

clangir/clang/lib/CIR/Dialect/Transforms/LoweringPrepare.cpp

Line 131 in 16cabe9

llvm::StringMap<GlobalOp> cudaVarMap;

I believe we somehow need to preserve the information coming from VarInfo, specifically in DeviceVarFlags. Doing that allows us to give special handling to the different types of globals as seen in OG:

clangir/clang/lib/CodeGen/CGCUDANV.cpp

Line 687 in 16cabe9

switch (Info.Flags.getKind()) {

The way I see it, we have two paths:

Utilizing a richer data structure to preserve those flags and consume that information in loweringPrepare
Preserving those flags through attributes attached to Global Ops, although the implementation would take longer.

Let me know what you think

[CIR][CUDA][HIP] Handle global variable registration

7e4a8c5

RiverDave requested review from andykaylor, bcardosolopes, lanza and xlauko as code owners November 29, 2025 01:10

fix format

1b48dc2

koparasy reviewed Dec 1, 2025

View reviewed changes

RiverDave changed the title ~~[CIR][CUDA][HIP] Handle global variable registration~~ [CIR][CUDA][HIP] Set internal linkage for device variable shadows Dec 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CIR][CUDA][HIP] Set internal linkage for device variable shadows #2041

[CIR][CUDA][HIP] Set internal linkage for device variable shadows #2041

RiverDave commented Nov 29, 2025

Uh oh!

github-actions bot commented Nov 29, 2025 •

edited

Loading

Uh oh!

koparasy commented Dec 1, 2025

Uh oh!

koparasy Dec 1, 2025

Uh oh!

RiverDave Dec 1, 2025

Uh oh!

koparasy Dec 1, 2025

Uh oh!

koparasy Dec 1, 2025

Uh oh!

RiverDave Dec 1, 2025

Uh oh!

koparasy Dec 1, 2025

Uh oh!

RiverDave commented Dec 2, 2025

Uh oh!

RiverDave commented Dec 2, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[CIR][CUDA][HIP] Set internal linkage for device variable shadows #2041

Are you sure you want to change the base?

[CIR][CUDA][HIP] Set internal linkage for device variable shadows #2041

Conversation

RiverDave commented Nov 29, 2025

Uh oh!

github-actions bot commented Nov 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

koparasy commented Dec 1, 2025

Uh oh!

koparasy Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

RiverDave Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

koparasy Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

koparasy Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

RiverDave Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

koparasy Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

RiverDave commented Dec 2, 2025

Uh oh!

RiverDave commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions bot commented Nov 29, 2025 •

edited

Loading

RiverDave commented Dec 2, 2025 •

edited

Loading