GitHub workflows improvements #259

sarahyurick · 2024-09-24T17:44:28Z

There are a couple of GitHub Actions I want to add to NeMo Curator:

GPU CI (already have Add GPU CI/CD #253 open for this)
Auto label PRs with the gpuci label if the PR is (1) created by a user with write access and (2) modifying at least 1 Python file
Automate Changelog build
Automate PyPI releases
Display suggested style changes when the pre-commit CI fails (e.g., with Ruff)

After the first one is merged, I can open separate PRs to add the others.

The text was updated successfully, but these errors were encountered:

sarahyurick · 2024-09-24T17:46:53Z

Note from Oliver about 2:
"
This file didn’t work because using secrets.GITHUB_TOKEN is not permitted to trigger other workflows. GH disallows that to prevent spam. You would need to use your personal access token (PAT). However, since we’re in a public environment, you would need to limit that to a protected branch (via environments). It’s still doable (from branch main, react to all issue change events), but a bit more tricky. As long as the PAT is not exposed to all branches, this is fine.
"

sarahyurick · 2024-09-24T17:52:05Z

Additional tasks outside of GitHub:

Set up a nightly or weekly job that runs these same PyTests with RAPIDS Nightly Builds to identify any upstream breakages (closed with Run GPU Tests on Rapids Nightly at midnight UTC #463)
Releasebot on Slack
CI alerts on Slack

sarahyurick · 2024-09-25T19:16:01Z

NeMo files of interest:

sarahyurick · 2024-10-07T21:44:25Z

Suggestion from @praateekmahajan: parametrizing the GPU tests with commonly used clients.

get_client(cluster_type="gpu", protocol="ucx")
get_client(cluster_type="gpu", protocol="tcp")
Maybe even set_torch_to_use_rmm/enable_spilling (we shouldn't try all the permutations, just the ones where we expect different behavior)

Depending on how many GPUs the node has we could we even try a multi-GPU setup.

sarahyurick · 2024-10-30T18:42:43Z

Regarding the gpuCI label:

Always have it run (without having to re-add the label) for Nvidians/RAPIDS members
For non-verified users we could use /okay to test comments

sarahyurick · 2025-01-17T22:02:32Z

Another check: unused imports.

sarahyurick · 2025-01-18T00:38:24Z

Another: #488.

sarahyurick added the enhancement New feature or request label Sep 24, 2024

sarahyurick self-assigned this Sep 24, 2024

sarahyurick mentioned this issue Sep 24, 2024

Add GPU CI/CD #253

Merged

sarahyurick removed the enhancement New feature or request label Sep 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub workflows improvements #259

GitHub workflows improvements #259

sarahyurick commented Sep 24, 2024 •

edited

Loading

sarahyurick commented Sep 24, 2024

sarahyurick commented Sep 24, 2024 •

edited

Loading

sarahyurick commented Sep 25, 2024

sarahyurick commented Oct 7, 2024

sarahyurick commented Oct 30, 2024

sarahyurick commented Jan 17, 2025

sarahyurick commented Jan 18, 2025

GitHub workflows improvements #259

GitHub workflows improvements #259

Comments

sarahyurick commented Sep 24, 2024 • edited Loading

sarahyurick commented Sep 24, 2024

sarahyurick commented Sep 24, 2024 • edited Loading

sarahyurick commented Sep 25, 2024

sarahyurick commented Oct 7, 2024

sarahyurick commented Oct 30, 2024

sarahyurick commented Jan 17, 2025

sarahyurick commented Jan 18, 2025

sarahyurick commented Sep 24, 2024 •

edited

Loading

sarahyurick commented Sep 24, 2024 •

edited

Loading