-
Notifications
You must be signed in to change notification settings - Fork 13
[WIP] Replace DCGM_FI_PROF_GR_ENGINE_ACTIVE with DCGM_FI_DEV_GPU_UTIL #69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: empovit The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
/cc @mresvanis |
* Profiling metrics DCGM_FI_PROF_* aren't available on pre-Volta GPUs * Consolidate the default metrics into the configmap so that installing the console plugin doesn't break other tools
bf6a37a to
63e6e44
Compare
| */ | ||
|
|
||
| /* Used metrics | ||
| DCGM_FI_PROF_GR_ENGINE_ACTIVE, gauge, gpu utilization. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
II{U,R}C we chose DCGM_FI_PROF_GR_ENGINE_ACTIVE because it is available also when using MIG, while DCGM_FI_DEV_GPU_UTIL is not.
If we would like to drop MIG support for other NVIDIA GPU models, then no objection from my part (whatever makes more sense business-wise).
|
/hold until we test the changes with MIG-supported GPUs |
|
There are two DCGM metrics for GPU utilization, depending on the GPU model/architecture:
The entire Profiling group (Group ID 8, |
Uh oh!
There was an error while loading. Please reload this page.