Skip to content

Commit 386e038

Browse files
authored
feat(SPRE-1918) Kueue metrics exposure Stage Part 1 (#9201)
We are building kueue dashboard in app-sre grafana instance based on the actual dashboard created per cluster, so we can centralize the view. There are few metrics to be exposed, this will be the first batch.
1 parent 1850848 commit 386e038

File tree

1 file changed

+8
-0
lines changed

1 file changed

+8
-0
lines changed

components/monitoring/prometheus/staging/base/monitoringstack/endpoints-params.yaml

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -180,6 +180,12 @@
180180
- '{__name__="kueue_cluster_queue_status"}'
181181
- '{__name__="kueue_admission_wait_time_seconds_bucket"}'
182182
- '{__name__="up", job=~".*kueue.*"}'
183+
- '{__name__="kueue_admitted_active_workloads", cluster_queue="cluster-pipeline-queue"}'
184+
- '{__name__="kueue_pending_workloads", cluster_queue="cluster-pipeline-queue"}'
185+
- '{__name__="kueue_admitted_workloads_total"}'
186+
- '{__name__="kueue_cluster_queue_resource_reservation", cluster_queue="cluster-pipeline-queue"}'
187+
- '{__name__="kueue_cluster_queue_nominal_quota", cluster_queue="cluster-pipeline-queue"}'
188+
- '{__name__="kueue_quota_reserved_wait_time_seconds_bucket", cluster_queue="cluster-pipeline-queue"}'
183189

184190
## ArgoCD Metrics
185191
- '{__name__="argocd_app_reconcile_bucket", namespace="gitops-service-argocd"}'
@@ -211,6 +217,7 @@
211217
- '{__name__="etcd_server_id", namespace="openshift-etcd"}'
212218
- '{__name__="etcd_server_quota_backend_bytes", namespace="openshift-etcd"}'
213219
- '{__name__="etcd_mvcc_db_total_size_in_bytes", namespace="openshift-etcd"}'
220+
- '{__name__="etcd_mvcc_db_total_size_in_use_in_bytes", namespace="openshift-etcd"}'
214221
- '{__name__="etcd_server_received_total", namespace="openshift-etcd"}'
215222
- '{__name__="etcd_network_active_peers", namespace="openshift-etcd"}'
216223
- '{__name__="etcd_network_peer_round_trip_time_seconds_bucket"}'
@@ -242,6 +249,7 @@
242249
- '{__name__="kube_node_role"}'
243250
- '{__name__="kube_node_status_allocatable", resource=~"cpu|memory"}'
244251
- '{__name__="kube_node_status_condition", condition="MemoryPressure", status="true"}'
252+
- '{__name__="kube_namespace_labels", label_konflux_ci_dev_type="tenant"}'
245253

246254
## Node Metrics
247255
- '{__name__="node_cpu_seconds_total", mode="idle"}'

0 commit comments

Comments
 (0)