Fix BaseOpenAIChatCompletionClient token usage #4770

gziz · 2024-12-20T06:00:03Z

Why are these changes needed?

To correctly track the token usage in BaseOpenAIChatCompletionClient, more info on the issue.

Related issue number

#4769

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://microsoft.github.io/autogen/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

gziz · 2024-12-20T17:34:39Z

python/packages/autogen-ext/src/autogen_ext/models/openai/_openai_client.py

@@ -561,8 +554,7 @@ async def create(
            logprobs=logprobs,
        )

-        _add_usage(self._actual_usage, usage)


I'm wondering what's the difference between these two...

Traditionally actual vs total is a matter of if the tokens were cached or not. But since we don't have caching atm they should be identical

Thanks @jackgerrits! I have updated the PR to use both actual_usage and total_usage.

jackgerrits · 2024-12-27T13:39:32Z

@gziz The original issue what that we're not using the returned value of the function where we add usage. I'd rather a smaller change where we just assign that value back to the appropriate variable. This change couples actual and total usage into the same function.

If we want to make all this simpler, we can implement__add__ and __iadd__:

That way we can update the code to simply be:

self._actual_usage += usage

Fix openai client token usage (microsoft#4769)

f9cf23a

gziz changed the title ~~Fix openai client token usage (#4769)~~ Fix BaseOpenAIChatCompletionClient token usage Dec 20, 2024

gziz mentioned this pull request Dec 20, 2024

Total token usage and latency metrics should be reflected in TaskResult and Response #4719

Open

gziz commented Dec 20, 2024

View reviewed changes

gziz requested a review from ekzhu December 21, 2024 01:26

ekzhu requested review from lspinheiro and jackgerrits and removed request for ekzhu December 22, 2024 02:03

gziz added 2 commits December 26, 2024 09:39

Include actual_usage in add_usage function

0136055

Merge branch 'main' into fix_client_token_track

813029c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix BaseOpenAIChatCompletionClient token usage #4770

Fix BaseOpenAIChatCompletionClient token usage #4770

gziz commented Dec 20, 2024

gziz Dec 20, 2024

jackgerrits Dec 26, 2024

gziz Dec 26, 2024

jackgerrits commented Dec 27, 2024 •

edited

Loading

Fix BaseOpenAIChatCompletionClient token usage #4770

Are you sure you want to change the base?

Fix BaseOpenAIChatCompletionClient token usage #4770

Conversation

gziz commented Dec 20, 2024

Why are these changes needed?

Related issue number

Checks

gziz Dec 20, 2024

Choose a reason for hiding this comment

jackgerrits Dec 26, 2024

Choose a reason for hiding this comment

gziz Dec 26, 2024

Choose a reason for hiding this comment

jackgerrits commented Dec 27, 2024 • edited Loading

jackgerrits commented Dec 27, 2024 •

edited

Loading