Skip to content

Implemented a new resource class for Openshift virtualization #203

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
May 6, 2025

Conversation

QuanMPhm
Copy link
Contributor

@QuanMPhm QuanMPhm commented Apr 3, 2025

Closes nerc-project/operations#820. This PR consists of 2 commits. The first one updates the account manager pin, and the second contains the implementation for the new resource class. More details in the commit message.

Copy link
Collaborator

@knikolla knikolla left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor comment, otherwise seems good to me.

Needs final review from Naved.

@QuanMPhm QuanMPhm force-pushed the add_vm_gpu_quotas branch from 7edfc77 to 76c11c9 Compare April 8, 2025 14:00
@QuanMPhm QuanMPhm force-pushed the add_vm_gpu_quotas branch from 76c11c9 to a1e9877 Compare April 10, 2025 15:15
@QuanMPhm QuanMPhm requested a review from naved001 April 10, 2025 18:41
Copy link
Contributor

@naved001 naved001 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good to me, but the tests are failing

@QuanMPhm QuanMPhm force-pushed the add_vm_gpu_quotas branch 2 times, most recently from 3aaaad7 to e9f1942 Compare April 16, 2025 21:06
@QuanMPhm
Copy link
Contributor Author

@naved001 @knikolla The CI error is caused by the fact that even though this PR updated the version pin for the account manager, the account manager's deployment manifest still uses an ancient image version. The exact reason why an error occured is because this PR introduced new quota attributes, and the account manager restricts quota attributes to a certain list (and this is, of course, out-dated)

I guess this means we can either make a new release of the account manger image, or update the CI file in this repo to build its own image of the account manager. I'll wait for your opinions before moving forward.

@QuanMPhm QuanMPhm requested a review from naved001 April 20, 2025 04:17
@knikolla
Copy link
Collaborator

@QuanMPhm I made a new release of acct-mgt v0.3.5

@QuanMPhm QuanMPhm force-pushed the add_vm_gpu_quotas branch 2 times, most recently from d748204 to abe82b6 Compare May 6, 2025 18:19
- The CI file has been changed to also run the openshift vm function test.
- New VM-specific GPU attributes have been added
- `add_openshift_resource` now has an `vm` option to create
Openshift Virtualization resources
- The `QUOTA_KEY_MAPPING` in `openshift.py` has been moved into
the allocator class, which is then overidden by the Openshift VM
allocator
@QuanMPhm QuanMPhm force-pushed the add_vm_gpu_quotas branch from abe82b6 to 4e216a5 Compare May 6, 2025 18:30
@QuanMPhm
Copy link
Contributor Author

QuanMPhm commented May 6, 2025

@knikolla @naved001 The CI errors have finally been fixed. I'll let you guys decide when to merge this.

@knikolla knikolla merged commit c533baa into nerc-project:main May 6, 2025
6 checks passed
@knikolla
Copy link
Collaborator

knikolla commented May 6, 2025

@knikolla @naved001 The CI errors have finally been fixed. I'll let you guys decide when to merge this.

No time like the present :)

@QuanMPhm
Copy link
Contributor Author

QuanMPhm commented May 6, 2025

@knikolla 🔥 ✍️ 🔥

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

OpenShift Virtualization: Update coldfront to create updated resourcequota for passthroughed GPUs.
3 participants