Misiug/multi node docker containers #134

misiugodfrey · 2025-11-24T21:23:42Z

Support in docker containers to run multiple workers (each with a separate pinned gpu) on a single machine. You can now run multiple workers (up to 4) controlled through the NUM_WORKERS env variable.

I'm not aware of a way to clean up the duplication in the docker services (AFAIK you can't programatically change the gpu parameters). So this set up declares 4 gpu services and then picks them based on the number of workers you want to run.

misiugodfrey · 2025-11-24T21:26:05Z

presto/docker/config/template/etc_worker/config_native.properties

 coordinator=false
 # Worker REST/HTTP port for internal and admin endpoints.
-http-server.http.port=8080
+http-server.http.port=8081


This port should be distinct from the port used to connect to the presto-coordinator if they are on the same machine.

Can you please expand on why this is required now? This should be running as a separate service on the docker network.

I found it to be necessary when working in Slurm environments, but that is because there is no docker network. I'll remove this as it isn't a change that is required for the scope of this PR.

misiugodfrey · 2025-11-24T21:26:39Z

presto/docker/docker-compose.native-gpu.yml

      - ./config/generated/gpu/etc_worker/node.properties:/opt/presto-server/etc/node.properties
      - ./config/generated/gpu/etc_worker/config_native.properties:/opt/presto-server/etc/config.properties
+
+# These workers are available to run on one node with a single GPU pinned to each.


It would have been nice to de-duplicate this code, but AFAIK we can't meta-program services with different NVIDIA_VISIBLE_DEVICES.

misiugodfrey · 2025-11-24T21:27:34Z

presto/scripts/start_presto_helper.sh

 if [[ "$VARIANT_TYPE" == "java" ]]; then
  DOCKER_COMPOSE_FILE="java"
  conditionally_add_build_target $JAVA_WORKER_IMAGE $JAVA_WORKER_SERVICE "worker|w"
+  WORKERS="$JAVA_WORKER_SERVICE"


Since we are no longer guaranteed to want all the services in a docker-compose file to run, we need to specify which default worker service we want to run.

misiugodfrey · 2025-11-24T21:28:07Z

presto/scripts/start_presto_helper.sh

 fi

-docker compose -f $DOCKER_COMPOSE_FILE_PATH up -d
+function duplicate_worker_configs() {


This is based on the way we do multiple configs for the Slurm clusters right now. This will probably be changed in the future.

paul-aiyedun · 2025-11-25T23:59:46Z

presto/docker/config/template/etc_worker/config_native.properties

 coordinator=false
 # Worker REST/HTTP port for internal and admin endpoints.
-http-server.http.port=8080
+http-server.http.port=8081


Can you please expand on why this is required now? This should be running as a separate service on the docker network.

paul-aiyedun · 2025-11-26T00:06:02Z

presto/docker/docker-compose.native-gpu.yml

      - ./config/generated/gpu/etc_worker/config_native.properties:/opt/presto-server/etc/config.properties
+
+# These workers are available to run on one node with a single GPU pinned to each.
+  presto-native-worker-gpu-0:


We seem to be hardcoding and deploying a fixed number of workers?

misiugodfrey · 2025-12-11T02:19:10Z

presto/scripts/generate_presto_config.sh

+
    # Adds a cluster tag for gpu variant
-    WORKER_CONFIG="${CONFIG_DIR}/etc_coordinator/config_native.properties"
+    WORKER_CONFIG="${CONFIG_DIR}/etc_worker/config_native.properties"


I don't think this was correct before; as we were referring to the coordinator's config as worker_config.

misiugodfrey · 2025-12-11T02:20:32Z

presto/docker/config/template/etc_worker/config_native.properties

 coordinator=false
 # Worker REST/HTTP port for internal and admin endpoints.
-http-server.http.port=8080
+http-server.http.port=8081


I found it to be necessary when working in Slurm environments, but that is because there is no docker network. I'll remove this as it isn't a change that is required for the scope of this PR.

misiugodfrey · 2025-12-11T02:22:10Z

presto/scripts/stop_presto.sh


-docker compose -f ../docker/docker-compose.java.yml -f ../docker/docker-compose.native-cpu.yml -f ../docker/docker-compose.native-gpu.yml down
+OVERRIDE=""
+[ -f ../docker/docker-compose.workers.override.yml ] && OVERRIDE="-f ../docker/docker-compose.workers.override.yml"


If we generated a multi-workers file, then use that to stop containers or we may leave them dangling.

misiugodfrey · 2025-12-11T03:00:33Z

presto/scripts/start_presto_helper.sh

+YAML
+}
+
+function generate_worker_compose() {


I've changed the setup so that the docker-compose.native-gpu file is always generated/overwritten based on the number of workers. If NUM_WORKERS is not specified, or is "1", then it should generate exactly what was there before. If NUM_WORKERS > 1 then it will generate separate services for each worker.

misiugodfrey added 3 commits November 19, 2025 10:59

Added support for multiple gpu workers on the same node

0ad9ed7

Changed scripts to use services instead of all

a496b61

fixed worker id

4a21359

misiugodfrey requested review from Avinash-Raj, mattgara, paul-aiyedun and simoneves November 24, 2025 21:23

Merge branch 'main' into misiug/MultiNodeDockerContainers

bf7ac7d

misiugodfrey commented Nov 24, 2025

View reviewed changes

paul-aiyedun reviewed Nov 26, 2025

View reviewed changes

misiugodfrey added 5 commits December 10, 2025 18:11

Rewrite to dynamically generate multi-worker dockerfile

0d33647

Rewrite to dynamically generate multi-worker dockerfile

52a4755

Removed port change

ae18cea

removed unnecessary code

d265e0b

generate all native-gpu dockerfiles

6753aea

misiugodfrey commented Dec 11, 2025

View reviewed changes

misiugodfrey requested a review from paul-aiyedun December 11, 2025 03:00

Misiug/multi node docker containers #134

Are you sure you want to change the base?

Misiug/multi node docker containers #134

Uh oh!

Conversation

misiugodfrey commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

misiugodfrey commented Nov 24, 2025 •

edited

Loading