Eliminate wait times on `VLLMServeLM` #274

junya-takayama · 2025-12-25T07:24:52Z

OpenAIChatAPI uses longer retry backoff durations to suit commercial services.
However, VLLMServeLM runs on a self-hosted server, so shorter wait times are sufficient.
This PR reduces the retry wait time for VLLMServeLM.

junya-takayama added 7 commits September 26, 2025 16:03

Merge branch 'main' of github.com:sbintuitions/flexeval

4329a2d

Merge github.com:sbintuitions/flexeval

e15a5be

Merge branch 'main' of github.com:sbintuitions/flexeval

dc089ec

Merge branch 'main' of github.com:sbintuitions/flexeval

afea85a

Eliminate unnecessary wait times in VLLMServeLM

28cac50

fix

555e4e2

make configurable

ddf90c1

junya-takayama force-pushed the eliminate_wait_times_on_VLLMServeLM branch from a05cb1b to ddf90c1 Compare December 25, 2025 09:09

junya-takayama requested a review from a team December 25, 2025 09:24

yuma-hirakawa approved these changes Dec 26, 2025

View reviewed changes

junya-takayama merged commit 3b956be into main Dec 26, 2025
7 of 8 checks passed

junya-takayama deleted the eliminate_wait_times_on_VLLMServeLM branch December 26, 2025 03:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Eliminate wait times on `VLLMServeLM` #274

Eliminate wait times on `VLLMServeLM` #274

Uh oh!

junya-takayama commented Dec 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Eliminate wait times on VLLMServeLM #274

Eliminate wait times on VLLMServeLM #274

Uh oh!

Conversation

junya-takayama commented Dec 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Eliminate wait times on `VLLMServeLM` #274

Eliminate wait times on `VLLMServeLM` #274