fix: ensure usage is requested if telemetry is enabled #3571

mhdawson · 2025-09-26T19:57:46Z

What does this PR do?

When telemetry is enabled the router uncondionally expects the usage attribute to be availble and fails if it is not present.

Usage is not currently being requested by litellm_openai_mixin.py for streaming requests when using the responses API which means that providers like vertexai fail if telemetry is enabled and streaming is used.

This is part of the required fix. Other part is in liteLLM, will plan to submit PR for that soon.

Test Plan

I applied this change along with the change for litellm in a llama stack deployment and validated that I could make streaming requests through the responses API to a gemini model and they would succeed instead of failing due to the missing usage attribute when telemetry is enabled.

Refs: llamastack#3420 When telemetry is enabled the router uncondionally expects the usage attribute to be availble and fails if it is not present. Telemetry is not currently being requested by litellm_openai_mixin.py for streaming requests which means that providers like vertexai fail if telemetry is enabled and streaming is used. This is part of the required fix. Other part is in litell, will plan to submit PR for that soon. Signed-off-by: Michael Dawson <[email protected]>

Refs: llamastack/llama-stack#3571 Llama stack unconditionally expects usage information when using Responses API and streaming when telemetry is enabled. For full details see llamastack/llama-stack#3571. Debugging that issue revealed that LiteLLM does not honour a request for usage when streaming and using the vertex api. This PR adds that reporting using the same function as used elsewhere. Signed-off-by: Michael Dawson <[email protected]>

ashwinb

seems reasonable to me. @ehhuang please take a look once.

cdoern

nice. thank you

ehhuang · 2025-09-29T21:09:04Z

LG

mhdawson · 2025-09-30T15:05:26Z

For reference here is the PR in liteLLM - BerriAI/litellm#14961

mhdawson requested review from ashwinb, yanxi0830, hardikjshah, raghotham, ehhuang, terrytangyuan, leseb, bbrowning, reluctantfuturist, mattf and slekkala1 as code owners September 26, 2025 19:57

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 26, 2025

mhdawson mentioned this pull request Sep 26, 2025

fix: fix usage reporting with CustomWrapper BerriAI/litellm#14961

Open

4 tasks

ashwinb approved these changes Sep 27, 2025

View reviewed changes

cdoern approved these changes Sep 27, 2025

View reviewed changes

ehhuang merged commit ddf3f17 into llamastack:main Sep 29, 2025
23 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: ensure usage is requested if telemetry is enabled #3571

fix: ensure usage is requested if telemetry is enabled #3571

Uh oh!

mhdawson commented Sep 26, 2025 •

edited

Loading

Uh oh!

ashwinb left a comment

Uh oh!

cdoern left a comment

Uh oh!

ehhuang commented Sep 29, 2025

Uh oh!

Uh oh!

mhdawson commented Sep 30, 2025

Uh oh!

Uh oh!

fix: ensure usage is requested if telemetry is enabled #3571

fix: ensure usage is requested if telemetry is enabled #3571

Uh oh!

Conversation

mhdawson commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Test Plan

Uh oh!

ashwinb left a comment

Choose a reason for hiding this comment

Uh oh!

cdoern left a comment

Choose a reason for hiding this comment

Uh oh!

ehhuang commented Sep 29, 2025

Uh oh!

Uh oh!

mhdawson commented Sep 30, 2025

Uh oh!

Uh oh!

mhdawson commented Sep 26, 2025 •

edited

Loading