feat: added option to select default inferencing runtime #2893

gastoner · 2025-04-16T14:59:05Z

What does this PR do?

Adds settings to chose default inference runtime to preferences
Implements mockup in the issue

Screenshot / video of UI

Screencast_20250428_150354.webm

What issues does this PR fix or reference?

Closes #2611

How to test this PR?

Select different inference runtime -> models should be filtered out

Copilot

Copilot wasn't able to review any files in this pull request.

Files not reviewed (1)

packages/backend/package.json: Language not supported

packages/backend/package.json

packages/frontend/src/lib/select/ModelSelect.svelte

packages/frontend/src/pages/NewInstructLabSession.spec.ts

packages/frontend/src/pages/Recipes.svelte

axel7083

We cannot have a default for inference server not dealing with the same kind of files, like you cannot default to llamacpp instead of whispercpp as model handled by whispercpp cannot be handled by llamacpp.

This need to be specific to GGUF models, or safetensors. Choice should be llamacpp, openvino, vllm.

Meaning this PR should be delayed after the following PRs

In the meantime we cannot have a default, as the inference provided depends on the models, not the other way around

packages/backend/src/registries/ConfigurationRegistry.ts

packages/frontend/src/lib/select/ModelSelect.svelte

jeffmaury

I did not configure anything but when I go to the settings page the settings tell me that llama-cpp is the prefered (I would expect to be none) but if I go to the recipes page I see only one recipe (object detection)
So I think we are missing the no value case

packages/backend/package.json

jeffmaury

LGTM.

However, I though you said you add a select in ModelSelect so that we can select models even if they are not for the preferred runtime ?

gastoner · 2025-05-14T13:16:38Z

LGTM.

However, I though you said you add a select in ModelSelect so that we can select models even if they are not for the preferred runtime ?

In the ModelSelect there is a mechanism of filtering the models based on selected Recipe ~ runime for this recipe

jeffmaury · 2025-05-14T13:19:01Z

LGTM.
However, I though you said you add a select in ModelSelect so that we can select models even if they are not for the preferred runtime ?

In the ModelSelect there is a mechanism of filtering the models based on selected Recipe ~ runime for this recipe

I think ModelSelect is used in other contexts ?

gastoner · 2025-05-14T13:28:29Z

LGTM.
However, I though you said you add a select in ModelSelect so that we can select models even if they are not for the preferred runtime ?

In the ModelSelect there is a mechanism of filtering the models based on selected Recipe ~ runime for this recipe

I think ModelSelect is used in other contexts ?

Yes it is used in 3 or 4 places I think. Should I remove the filter from ModelSelect then?

gastoner · 2025-05-15T13:35:32Z

Should I now also add the openvino filter? since the openvino PR is now closed?

…ences Signed-off-by: Evzen Gasta <[email protected]>

Signed-off-by: Evzen Gasta <[email protected]>

jeffmaury

LGTM

gastoner · 2025-05-15T17:02:25Z

@axel7083 can you PTAL?

slemeur · 2025-05-16T13:34:17Z

I think the behavior is fine. And we have a couple of PRs that are waiting to be merged after this one, which will make the experience more logical.

What we are proposing the user is to use different Inferencing Runtimes with AI Lab. With this ticket, it will default the entire scope of AI Lab to a specific runtime.
For example, a user who is defaulting to Whisper, will get the list of recipes filtered to those who are compatible with Whisper, but the user will still be able to remove the filter and see the entire list.

For other situation (and probably the most common case), the user are most likely going to change the inferencing runtime at the time they are going to start one. For example, when the user is starting a recipe, when the user is starting to serve a model or starting a playground. We have the corresponding tickets for that: #2612, #2613

So the current behavior is good from my point of view.

@slemeur

Dismissing my request change as @slemeur agree to go ahead with this change.

My personal opinion is that this feature will not be used because it is a configuration deep in the settings, and having it increase the complexity of AI Lab, for QE and maintenance

gastoner changed the title ~~feat: added option to select default interferencing runtime~~ feat: added option to select default inferencing runtime Apr 16, 2025

gastoner requested review from slemeur and Copilot April 16, 2025 15:20

Copilot AI reviewed Apr 16, 2025

View reviewed changes

gastoner mentioned this pull request Apr 23, 2025

refactor(svelte5): migrated model select component to svelte5 #2914

Merged

gastoner force-pushed the default_interferencing_runtime branch 5 times, most recently from 105c01f to 26409bf Compare April 28, 2025 12:59

gastoner marked this pull request as ready for review April 28, 2025 13:06

gastoner requested review from benoitf, jeffmaury and a team as code owners April 28, 2025 13:06

gastoner requested review from feloy and axel7083 April 28, 2025 13:06

jeffmaury reviewed Apr 29, 2025

View reviewed changes

axel7083 previously requested changes Apr 29, 2025

View reviewed changes

gastoner marked this pull request as draft April 30, 2025 07:15

jeffmaury requested changes May 5, 2025

View reviewed changes

packages/backend/src/registries/ConfigurationRegistry.ts Outdated Show resolved Hide resolved

gastoner force-pushed the default_interferencing_runtime branch from 964fb7f to df86e95 Compare May 6, 2025 07:17

jeffmaury reviewed May 12, 2025

View reviewed changes

packages/frontend/src/lib/select/ModelSelect.svelte Outdated Show resolved Hide resolved

gastoner requested a review from jeffmaury May 12, 2025 08:25

gastoner force-pushed the default_interferencing_runtime branch from 1b11247 to df86e95 Compare May 12, 2025 09:42

feloy mentioned this pull request May 12, 2025

Implementation of Displaying used Inferencing Runtime #2614

Closed

gastoner force-pushed the default_interferencing_runtime branch from df86e95 to b790b86 Compare May 12, 2025 13:46

gastoner marked this pull request as ready for review May 12, 2025 13:51

gastoner requested a review from axel7083 May 13, 2025 05:58

jeffmaury requested changes May 13, 2025

View reviewed changes

jeffmaury reviewed May 13, 2025

View reviewed changes

packages/backend/package.json Show resolved Hide resolved

gastoner requested a review from jeffmaury May 14, 2025 08:36

jeffmaury reviewed May 14, 2025

View reviewed changes

gastoner added 7 commits May 15, 2025 16:29

feat: added option to select default interferencing runtime to prefer…

b7d38cf

…ences Signed-off-by: Evzen Gasta <[email protected]>

chore(tests): fixed tests

bfb1d66

Signed-off-by: Evzen Gasta <[email protected]>

chore: applied suggestions

68d04b5

Signed-off-by: Evzen Gasta <[email protected]>

chore: fixed tests

f701a08

Signed-off-by: Evzen Gasta <[email protected]>

chore: updated runtimes in preferences

989601d

Signed-off-by: Evzen Gasta <[email protected]>

chore: added all option

15616c1

Signed-off-by: Evzen Gasta <[email protected]>

chore: removed redundant code

aa62f29

Signed-off-by: Evzen Gasta <[email protected]>

gastoner force-pushed the default_interferencing_runtime branch from 8a848a6 to aa62f29 Compare May 15, 2025 14:30

jeffmaury approved these changes May 15, 2025

View reviewed changes

gastoner merged commit f13dddc into containers:main May 17, 2025
7 checks passed

feat: added option to select default inferencing runtime #2893

feat: added option to select default inferencing runtime #2893

Uh oh!

Conversation

gastoner commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Screenshot / video of UI

What issues does this PR fix or reference?

How to test this PR?

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

axel7083 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jeffmaury left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jeffmaury left a comment

Choose a reason for hiding this comment

Uh oh!

gastoner commented May 14, 2025

Uh oh!

jeffmaury commented May 14, 2025

Uh oh!

gastoner commented May 14, 2025

Uh oh!

gastoner commented May 15, 2025

Uh oh!

jeffmaury left a comment

Choose a reason for hiding this comment

Uh oh!

gastoner commented May 15, 2025

Uh oh!

slemeur commented May 16, 2025

Uh oh!

Uh oh!

Uh oh!

gastoner commented Apr 16, 2025 •

edited

Loading