You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* feat: Add initial Flow Control metrics
Introduces initial Prometheus metrics for the experimental Flow Contorl
layer in EPP.
This change adds the following metrics:
- inference_extension_flow_control_request_queue_duration_seconds:
A histogram to track the total time requests spend in the Flow
Control layer, from invocation of EnqueueAndWait to final outcome.
- inference_extension_flow_control_queue_size:
A gauge to track the number of requests currently being managed by
the Flow Control layer.
These metrics are labeled by fairness_id, priority, and outcome (for
the duration metric).
* Rebase onto HEAd and resolve conflicts.
Help: metricsutil.HelpMsgWithStability("Distribution of the total time requests spend in the EPP flow control layer, measured from the start of the EnqueueAndWait call until a final outcome is reached.", compbasemetrics.ALPHA),
Help: metricsutil.HelpMsgWithStability("Current number of requests being actively managed by the EPP flow control layer, from the start of the EnqueueAndWait call until a final outcome is reached.", compbasemetrics.ALPHA),
0 commit comments