Add NVTX annotations to cuda.compute user-facing APIs #6906

Copilot · 2025-12-08T12:59:42Z

Description

closes #3395

Instruments all user-facing functions in cuda.compute with NVTX annotations for Nsight Systems profiling visibility.

Implementation

Created _nvtx.py module with @annotate() decorator wrapping nvtx.annotate()
Applied decorator to:
- All algorithm functions: reduce_into, inclusive_scan, exclusive_scan, transforms, histogram, sorts, etc.
- All make_<algorithm> factory functions
- All __call__ methods of algorithm classes
- All iterator factory functions: CountingIterator, ConstantIterator, TransformIterator, etc.
Added nvtx dependency to pyproject.toml

Configuration

Domain: "cuda.compute"
Color: 0x76B900 (NVIDIA green)
Message: Function name (automatic)

Example

import cuda.compute as compute

# Now visible in Nsight Systems under "cuda.compute" domain
compute.reduce_into(d_in, d_out, op, num_items, init, stream)

Checklist

New or existing tests cover these changes.
The documentation is up to date with these changes.

Original prompt

This section details on the original issue you should resolve

<issue_title>Add nvtx annotations around all functions in cuda.compute.</issue_title>
<issue_description>The user facing APIs in cuda.compute should be annotated using nvtx, so that they show up in Nsight Systems</issue_description>

<agent_instructions>Use https://nvtx.readthedocs.io/en/latest/index.html, introducing nvtx as a dependency in pyproject.toml. For most use cases, the high-level decorator @nvtx.annotate() should suffice. Use the domain name cuda.compute and use NVIDIA green (76B900) as the color. Be sure to also annotate the user-facing iterator creation functions. Be sure to also annotate the make_<algorithm> functions and their __call__ methods.</agent_instructions>

Comments on the Issue (you are @copilot in this section)

Fixes Add nvtx annotations around all functions in cuda.compute. #6905

✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.

copy-pr-bot · 2025-12-08T12:59:46Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Co-authored-by: shwina <[email protected]>

…y config Co-authored-by: shwina <[email protected]>

Co-authored-by: shwina <[email protected]>

shwina · 2025-12-08T14:59:05Z

/ok to test 4ad5fe0

shwina · 2025-12-08T19:59:39Z

/ok to test e3f280a

github-actions · 2025-12-08T21:06:31Z

🥳 CI Workflow Results

🟩 Finished in 1h 04m: Pass: 100%/48 | Total: 12h 16m | Max: 41m 52s

See results here.

Initial plan

37a6bc0

Copilot AI assigned Copilot and shwina Dec 8, 2025

github-project-automation bot added this to CCCL Dec 8, 2025

github-project-automation bot moved this to Todo in CCCL Dec 8, 2025

Copilot started work on behalf of shwina December 8, 2025 13:00 View session

Copilot AI and others added 4 commits December 8, 2025 13:10

Add NVTX annotations to all cuda.compute functions and iterators

7b88310

Co-authored-by: shwina <[email protected]>

Fix NVTX domain to use string instead of Domain object and update myp…

7586e4f

…y config Co-authored-by: shwina <[email protected]>

Fix comment clarity for NVIDIA_GREEN color

04439d8

Clarify color parameter format in docstring

4ad5fe0

Co-authored-by: shwina <[email protected]>

Copilot AI changed the title ~~[WIP] Add nvtx annotations around functions in cuda.compute~~ Add NVTX annotations to cuda.compute user-facing APIs Dec 8, 2025

Copilot AI requested a review from shwina December 8, 2025 13:21

Copilot finished work on behalf of shwina December 8, 2025 13:21

shwina marked this pull request as ready for review December 8, 2025 14:59

shwina requested a review from a team as a code owner December 8, 2025 14:59

cccl-authenticator-app bot moved this from Todo to In Review in CCCL Dec 8, 2025

This comment has been minimized.

Sign in to view

Merge branch 'main' into copilot/add-nvtx-annotations-cuda-compute

e3f280a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add NVTX annotations to cuda.compute user-facing APIs #6906

Add NVTX annotations to cuda.compute user-facing APIs #6906

Uh oh!

Copilot AI commented Dec 8, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Dec 8, 2025

Uh oh!

shwina commented Dec 8, 2025

Uh oh!

This comment has been minimized.

shwina commented Dec 8, 2025

Uh oh!

github-actions bot commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add NVTX annotations to cuda.compute user-facing APIs #6906

Are you sure you want to change the base?

Add NVTX annotations to cuda.compute user-facing APIs #6906

Uh oh!

Conversation

Copilot AI commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Comments on the Issue (you are @copilot in this section)

Uh oh!

copy-pr-bot bot commented Dec 8, 2025

Uh oh!

shwina commented Dec 8, 2025

Uh oh!

This comment has been minimized.

shwina commented Dec 8, 2025

Uh oh!

github-actions bot commented Dec 8, 2025

🥳 CI Workflow Results

🟩 Finished in 1h 04m: Pass: 100%/48 | Total: 12h 16m | Max: 41m 52s

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Dec 8, 2025 •

edited

Loading