Deprecate OpenAI backend once we've tested that LiteLLM can reach 500 RPS #336

vutrung96 · 2025-01-10T00:59:34Z

We have a separate implementation for OpenAI because LiteLLM could not go above 50 RPS, but they seem to have fixed the issue BerriAI/litellm#6592.

We should test this out, if we can reach 500 RPS with LiteLLM then we can deprecate OpenAI backend

vutrung96 · 2025-01-10T01:05:25Z

On the other hand, LiteLLM also has some performance issues, so maybe we shouldn't do this.....

vutrung96 · 2025-01-10T01:09:54Z

Ok, consensus seems to be that we should keep relying on our own OpenAI backend code...

vutrung96 closed this as completed Jan 10, 2025

Provide feedback