Time-slicing with multiple GPUs - asking for ability to block single GPU #465

Alexbay218 · 2023-01-04T17:47:57Z

The template below is mostly useful for bug reports and support questions. Feel free to remove anything which doesn't apply to you and add more information where it makes sense.

1. Issue or feature description

I'm looking to have the ability to configure the scheduler to perform the exact opposite behavior as specified here: #386

Instead of grabbing gpu resources evenly from all gpus on the node, I'd like a config option to grab from one GPU at a time. This allows some applications to get exclusive access to a single GPU as needed while allowing the rest to time share.

2. Steps to reproduce the issue

Perform a new fresh installation of the GPU operator in a cluster where nodes have more than one GPU.
Enable time-slicing and configure it to allow 2 replicas per GPU
Start a pod to consume 2 gpu extended resource
Pod should have exclusive access to a single GPU, but instead it has access to two GPUs (as intended by Time-slicing with multiple GPUs - asking for two GPUs puts both slots on a single GPU #386)

shivamerla · 2023-01-04T18:48:12Z

@klueska does it make sense to introduce knobs(env/args) to control allocation logic during GetPreferredAllocation within the device plugin?

anencore94 · 2024-01-18T09:26:36Z

@shivamerla I believe there are cases where we would want distributed GPU scheduling, but there might also be opposite scenarios. It would be great if this setting could be easily changed by configmap or so.

zr-idol · 2024-08-04T08:44:16Z

Any update regarding this issue?
We'd also like to allocate 2 or more slices on the same GPU

shivamerla added the enhancement label Jan 9, 2023

ArangoGutierrez removed the enhancement label Feb 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Time-slicing with multiple GPUs - asking for ability to block single GPU #465

Time-slicing with multiple GPUs - asking for ability to block single GPU #465

Alexbay218 commented Jan 4, 2023 •

edited

Loading

shivamerla commented Jan 4, 2023

anencore94 commented Jan 18, 2024

zr-idol commented Aug 4, 2024

Time-slicing with multiple GPUs - asking for ability to block single GPU #465

Time-slicing with multiple GPUs - asking for ability to block single GPU #465

Comments

Alexbay218 commented Jan 4, 2023 • edited Loading

1. Issue or feature description

2. Steps to reproduce the issue

shivamerla commented Jan 4, 2023

anencore94 commented Jan 18, 2024

zr-idol commented Aug 4, 2024

Alexbay218 commented Jan 4, 2023 •

edited

Loading