scheduler: move OpenAI recording after runner load/release #94

doringeman · 2025-06-26T12:28:23Z

If the runner is defunct, then loader.load will evict it, which means it'll also remove the recorded OpenAI requests/responses for that model. Hence, record the request after loader.load returns so it doesn't get removed in the loading process.

Without this patch, following these steps will result in a lost pair of request-response, the first one after the runner exited/was killed and had to be evicted because it was defunct.

$ MODEL_RUNNER_PORT=8080 make run # in a separate terminal

$ MODEL_RUNNER_HOST=http://localhost:8080 docker model run ai/smollm2 hi

$ curl -s http://localhost:8080/engines/requests\?model\=ai/smollm2 # all good, 1 record

$ pkill com.docker.llama-server

$ MODEL_RUNNER_HOST=http://localhost:8080 docker model run ai/smollm2 hi

$ # See `ERRO[0011] Model ai/smollm2 not found in records - 200` in the terminal where model-runner is running.
# This is because the request was recorded, then `loader.load` removed it, 
and then via a deferred function we attempted to record the response, but no matching request was found.
# Hence, we lost the first pair of request-response.

$ curl -s http://localhost:8080/engines/requests\?model\=ai/smollm2
No records found for model 'ai/smollm2'

Signed-off-by: Dorin Geman <[email protected]>

p1-0tr

LGTM

context: Use cli's current context instead of "desktop-linux"

scheduler: move OpenAI recording after runner load/release

cd8959b

Signed-off-by: Dorin Geman <[email protected]>

doringeman requested a review from a team June 26, 2025 12:28

ilopezluna approved these changes Jun 26, 2025

View reviewed changes

p1-0tr approved these changes Jun 26, 2025

View reviewed changes

xenoscopic approved these changes Jun 26, 2025

View reviewed changes

doringeman merged commit fcf45f4 into docker:main Jun 27, 2025
1 check passed

doringeman added a commit to doringeman/model-runner that referenced this pull request Oct 2, 2025

Merge pull request docker#94 from doringeman/misc

e8f21cc

context: Use cli's current context instead of "desktop-linux"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

scheduler: move OpenAI recording after runner load/release #94

scheduler: move OpenAI recording after runner load/release #94

Uh oh!

doringeman commented Jun 26, 2025

Uh oh!

p1-0tr left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

scheduler: move OpenAI recording after runner load/release #94

scheduler: move OpenAI recording after runner load/release #94

Uh oh!

Conversation

doringeman commented Jun 26, 2025

Uh oh!

p1-0tr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants