feat: Add support for uvicorn workers for llama-stack #211

VaishnaviHire · 2025-12-05T20:51:59Z

Fixes #164
Expose worker field in LlamaStackDistribution CR

Example CR

spec:
  server:
    containerSpec:
      env:
        - name: OLLAMA_INFERENCE_MODEL
          value: 'llama3.2:1b'
        - name: OLLAMA_URL
          value: 'http://ollama-server-service.ollama-dist.svc.cluster.local:11434'
      name: llama-stack
    distribution:
      name: starter
    workers: 2

VaishnaviHire · 2025-12-08T14:40:06Z

@mergify rebase

mergify · 2025-12-08T14:40:22Z

rebase

✅ Branch has been successfully rebased

leseb

~~I think this is the premise of adding more run.yaml into the CRD and i think we should think about how we want to lay out things.~~

~~Should we go with:~~

DistributionServer *DistributionServerSpec

And

type DistributionServerSpec struct {
  Workers  *int32
}

~~Something like this, i think we need to a way to encapsulate worker into another section?~~

~~Thoughts?~~

EDIT: ok discussed offline:

we will revisit this design better when we introduce config properties in the CRD, this will be a new CRD version
we just need to make sure the workers match the resource/request

leseb · 2025-12-08T14:40:51Z

api/v1alpha1/llamastackdistribution_types.go

 	Distribution  DistributionType `json:"distribution"`
 	ContainerSpec ContainerSpec    `json:"containerSpec,omitempty"`
-	PodOverrides  *PodOverrides    `json:"podOverrides,omitempty"` // Optional pod-level overrides
+	// Workers configures the number of uvicorn worker processes to run.


Should we link https://fastapi.tiangolo.com/deployment/server-workers/ for more doc, to be better understand the usage?

VaishnaviHire · 2025-12-09T17:31:39Z

EDIT: ok discussed offline:

we will revisit this design better when we introduce config properties in the CRD, this will be a new CRD version

we just need to make sure the workers match the resource/request

Updated

leseb

one nit thanks for the follow up

leseb · 2025-12-10T11:28:49Z

controllers/resource_helper.go

 // resolveContainerResources ensures the container always has CPU and memory
 // requests defined so that HPAs using utilization metrics can function.
-func resolveContainerResources(spec llamav1alpha1.ContainerSpec) corev1.ResourceRequirements {
+func resolveContainerResources(spec llamav1alpha1.ContainerSpec, workers int32, workersSet bool) corev1.ResourceRequirements {


I think we need to log somewhere that we are setting the resources based out of the value of the workers

Included it in api docs as well as added a log.

VaishnaviHire · 2025-12-10T21:42:09Z

@mergify rebase

mergify · 2025-12-10T21:42:37Z

rebase

✅ Branch has been successfully rebased

leseb

final nit 🙏🏻

leseb · 2025-12-11T08:52:57Z

controllers/resource_helper.go

-    2) llama stack run /etc/llama-stack/run.yaml ;;
-    *) echo "Invalid version code: $VERSION_CODE, using new CLI"; llama stack run /etc/llama-stack/run.yaml ;;
+    2) exec uvicorn llama_stack.core.server.server:create_app --host 0.0.0.0 --port "$PORT" --workers "$WORKERS" --factory ;;
+    *) exec uvicorn llama_stack.core.server.server:create_app --host 0.0.0.0 --port "$PORT" --workers "$WORKERS" --factory ;;


Can we add the log back in here?

VaishnaviHire · 2025-12-11T13:36:02Z

@mergify rebase

Signed-off-by: Vaishnavi Hire <[email protected]>

mergify · 2025-12-11T13:36:21Z

rebase

✅ Branch has been successfully rebased

VaishnaviHire requested review from cdoern, derekhiggins, leseb, mfleader, nathan-weinberg, rhdedgar and rhuss as code owners December 5, 2025 20:52

VaishnaviHire force-pushed the add_workers branch 2 times, most recently from 5c03fe0 to af4d5f7 Compare December 5, 2025 21:04

VaishnaviHire force-pushed the add_workers branch from af4d5f7 to b2c11a7 Compare December 8, 2025 14:40

leseb requested changes Dec 8, 2025

View reviewed changes

mfleader approved these changes Dec 8, 2025

View reviewed changes

VaishnaviHire force-pushed the add_workers branch 3 times, most recently from 5f7df7e to dafd7d7 Compare December 9, 2025 17:31

leseb reviewed Dec 10, 2025

View reviewed changes

VaishnaviHire added the v0.5.0 label Dec 10, 2025

VaishnaviHire force-pushed the add_workers branch from dafd7d7 to dd380e4 Compare December 10, 2025 21:34

VaishnaviHire force-pushed the add_workers branch from dd380e4 to 853140b Compare December 10, 2025 21:42

leseb reviewed Dec 11, 2025

View reviewed changes

VaishnaviHire force-pushed the add_workers branch from 853140b to 18523da Compare December 11, 2025 13:35

feat: Add support for uvicorn workers for llama-stack

6b78ff5

Signed-off-by: Vaishnavi Hire <[email protected]>

VaishnaviHire force-pushed the add_workers branch from 18523da to 6b78ff5 Compare December 11, 2025 13:36

feat: Add support for uvicorn workers for llama-stack #211

Are you sure you want to change the base?

feat: Add support for uvicorn workers for llama-stack #211

Uh oh!

Conversation

VaishnaviHire commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

VaishnaviHire commented Dec 8, 2025

Uh oh!

mergify bot commented Dec 8, 2025

✅ Branch has been successfully rebased

Uh oh!

leseb left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

leseb Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

VaishnaviHire Dec 9, 2025

Choose a reason for hiding this comment

Uh oh!

VaishnaviHire commented Dec 9, 2025

Uh oh!

leseb left a comment

Choose a reason for hiding this comment

Uh oh!

leseb Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

VaishnaviHire Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

VaishnaviHire commented Dec 10, 2025

Uh oh!

mergify bot commented Dec 10, 2025

✅ Branch has been successfully rebased

Uh oh!

leseb left a comment

Choose a reason for hiding this comment

Uh oh!

leseb Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

VaishnaviHire Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

VaishnaviHire commented Dec 11, 2025

Uh oh!

mergify bot commented Dec 11, 2025

✅ Branch has been successfully rebased

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

VaishnaviHire commented Dec 5, 2025 •

edited

Loading

leseb left a comment •

edited

Loading