-
Notifications
You must be signed in to change notification settings - Fork 12
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Move exporter to stable release 2.1.0
Includes the following changes: 1. Update DCGM version to 2.0.13 (includes MIG metrics) 2. Added error handling for setcap in the entrypoint script - this fixes an issue where the setcap command fails in OpenShift, in turn causing a CrashLoopBackoff error for the dcgm-exporter pod 3. Renamed entrypoint script 4. Fixed typo in the DCGM download URL for UBI8 5. Updated Helm chart versioning and metadata 6. Updated documentation
- Loading branch information
1 parent
488614d
commit 4d93acc
Showing
11 changed files
with
96 additions
and
65 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,8 +1,17 @@ | ||
apiVersion: v1 | ||
apiVersion: v2 | ||
name: dcgm-exporter | ||
description: A Helm chart for DCGM exporter | ||
version: "2.1.0" | ||
kubeVersion: ">= 1.13.0" | ||
appVersion: "2.1.0" | ||
sources: | ||
- https://gitlab.com/nvidia/container-toolkit/gpu-monitoring-tools | ||
version: "1.1.0" | ||
appVersion: "2.0.10" | ||
kubeVersion: ">= 1.13.0" | ||
home: https://github.com/nvidia/gpu-monitoring-tools/ | ||
icon: https://assets.nvidiagrid.net/ngc/logos/DCGM.png | ||
keywords: | ||
- gpu | ||
- cuda | ||
- compute | ||
- monitoring | ||
- telemetry | ||
- tesla |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,21 @@ | ||
#!/usr/bin/env bash | ||
set -euo pipefail | ||
|
||
# We want to setcap only when the container is started with the right caps | ||
DCGM_EXPORTER=$(readlink -f $(which dcgm-exporter)) | ||
if [ -z "$NO_SETCAP" ]; then | ||
setcap 'cap_sys_admin=+ep' $DCGM_EXPORTER | ||
if [ $? -eq 0 ]; then | ||
if ! $DCGM_EXPORTER -v 1>/dev/null 2>/dev/null; then | ||
>&2 echo "Warning #2: dcgm-exporter doesn't have sufficient privileges to expose profiling metrics. To get profiling metrics with dcgm-exporter, use --cap-add SYS_ADMIN" | ||
setcap 'cap_sys_admin=-ep' $DCGM_EXPORTER | ||
fi | ||
else | ||
>&2 echo "Warning #1: dcgm-exporter doesn't have sufficient privileges to expose profiling metrics. To get profiling metrics with dcgm-exporter, use --cap-add SYS_ADMIN" | ||
fi | ||
|
||
fi | ||
|
||
# Pass the command line arguments to dcgm-exporter | ||
set -- $DCGM_EXPORTER "$@" | ||
exec "$@" |
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters