Skip to content

monitoring: add consolidated workloads CPU/memory dashboard#209

Draft
ejahnGithub wants to merge 3 commits intosigstore:mainfrom
ejahnGithub:workloads-cpu-memory-dashboard
Draft

monitoring: add consolidated workloads CPU/memory dashboard#209
ejahnGithub wants to merge 3 commits intosigstore:mainfrom
ejahnGithub:workloads-cpu-memory-dashboard

Conversation

@ejahnGithub
Copy link
Copy Markdown

Summary

Adds a single GCP Monitoring dashboard Workloads CPU & Memory that consolidates CPU and memory across all Sigstore GKE workloads (grouped by namespace / container_name), so oncallers do not have to navigate multiple metric pages while investigating resource issues.

The dashboard includes:

  • CPU usage in cores (rate of kubernetes.io/container/cpu/core_usage_time)
  • Memory used (non-evictable bytes)
  • CPU & memory limit utilization (% of container limit, REDUCE_MAX so a single hot replica is visible)
  • CPU & memory request utilization (% of container request, REDUCE_MAX)
  • Container restart deltas (5m)
  • Node CPU allocatable utilization

Wired in via a new google_monitoring_dashboard.workloads resource in gcp/modules/monitoring/infra/dashboards.tf (same pattern as the existing clients, timestamp_authority, and rekor_v1 dashboards).

Testing

  • JSON syntax validated
  • terraform fmt -recursive clean

Will roll out to staging first via the usual Sigstore CI flow once merged.

Issue

Resolves sigstore/public-good-instance#1122

Adds a single GCP Monitoring dashboard that surfaces CPU and memory
across all Sigstore GKE workloads (grouped by namespace / container),
so oncall does not have to navigate multiple metric pages while
investigating resource issues.

The dashboard includes:
  - CPU usage in cores (rate of core_usage_time)
  - Memory used (non-evictable bytes)
  - CPU/memory limit utilization (REDUCE_MAX so a hot replica is visible)
  - CPU/memory request utilization (REDUCE_MAX)
  - Container restart deltas
  - Node CPU allocatable utilization

Resolves sigstore/public-good-instance#1122

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
@ejahnGithub ejahnGithub requested a review from a team as a code owner May 5, 2026 19:26
@ejahnGithub ejahnGithub marked this pull request as draft May 5, 2026 19:34
Eugene Jahn and others added 2 commits May 6, 2026 14:00
The xyChart threshold schema does not accept color/direction for these
chart types; the dashboard create rejects them. Keep just the value.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Heights of 16 in a 12-column mosaic produced very tall narrow tiles.
Use h=4 (standard) for charts and keep h=4 for the overview banner.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant