Model Costs and Cached Tokens #4835

Leon0402 · 2024-12-27T16:59:29Z

What feature would you like to be added?

Previously there where fields client.total_usage_summary and planner.client.actual_usage_summary with the amount of tokens and the costs. There is a class

@dataclass
class RequestUsage:
    prompt_tokens: int
    completion_tokens: int

but I think apart from the logic being flawed (see #4769, #4719) it also lacks important fields. Most notably the costs and the cached tokens.

I think this should also be mentioned in the Migration Guide.

Why is this needed?

Has been there previously in v2 and seems very useful to have in v4 as well.

The text was updated successfully, but these errors were encountered:

ekzhu · 2024-12-27T18:34:13Z

Thanks @Leon0402 for the issue. yes, it is important. We are planning to address this after the 0.4.0. For now, let's targeting 0.4.1 for this one.

#4769 and #4719 must be resolved before tackling this one.

github-actions bot added the needs-triage label Dec 27, 2024

ekzhu changed the title ~~LLM Costs in v4~~ Model Costs and Cached Tokens Dec 27, 2024

ekzhu added this to the 0.4.1 milestone Dec 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model Costs and Cached Tokens #4835

Model Costs and Cached Tokens #4835

Leon0402 commented Dec 27, 2024

ekzhu commented Dec 27, 2024

Model Costs and Cached Tokens #4835

Model Costs and Cached Tokens #4835

Comments

Leon0402 commented Dec 27, 2024

What feature would you like to be added?

Why is this needed?

ekzhu commented Dec 27, 2024