Skip to content

Releases: ome-projects/ome

OME 0.1.5

30 Mar 20:18
957f541

Choose a tag to compare

OME 0.1.5

🚀 What's New

Accelerator Class Enhancements

  • Updating AC now directly overrides runtime resources, args, and commands logic.
  • Added select-by-policy with cheapest selection to complete the Accelerator Class Selection feature.
  • small fixes: PARALLELISM_SIZE env is skipped when AC is absent from runtime and lookup is skipped when name/policy is unspecified.
  • Improved model formats selection to consider all model formats in runtime when an Accelerator Class is specified.

Diffusers model & Runtime Support

Full support has been added for the DiffUsers model family, including model parsing and one the most advanced runtime configurations to data. This release integrated the latest version of SGLang and SMG Router.

Qwen model family runtime Enhancements

Consolidate Qwen model family config parsing. Fixed quantization config parsing for Qwen models to improve import correctness.

Runtime & InferenceService Enhancements

  • Added model ready label to nodeSelector to guarantee InferenceService readiness before serving traffic.
  • Consolidated model parsing by sanitizing config.json before parsing.
  • Cleanup blocklist volume/mount in ISVC reconcile
  • Skip model init injection for FT serving with merged weights

Optimize Model Artifact Download

Optimize downloading model artifact if an identical model artifact already exists in the node with HF download.

Bug Fixes

  • Replace ome-instance-type-map with model-agent-config-map
  • Fix reconcile scaling deployments down to 1 replica
  • Fix download progress update race condition

📦 Container Images

The following container images are available:

# OME Manager
docker pull ghcr.io/moirai-internal/ome-manager:v0.1.5

# Model Agent
docker pull ghcr.io/moirai-internal/model-agent:v0.1.5

# OME Agent
docker pull ghcr.io/moirai-internal/ome-agent:v0.1.5

# Multinode Prober
docker pull ghcr.io/moirai-internal/multinode-prober:v0.1.5

⎈ Helm Installation

Option 1: OCI Registry (Recommended)

# Install directly from OCI registry
helm install ome-crd oci://ghcr.io/moirai-internal/charts/ome-crd --version 0.1.5 --namespace ome --create-namespace
helm install ome oci://ghcr.io/moirai-internal/charts/ome-resources --version 0.1.5 --namespace ome

Option 2: GitHub Releases

# Add the OME Helm repository
helm repo add ome https://github.com/sgl-project/ome/releases/download/v0.1.5
helm repo update

# Install OME
helm install ome-crd ome/ome-crd --namespace ome --create-namespace
helm install ome ome/ome-resources --namespace ome

📋 Changelog

v0.1.4...v0.1.5

📄 Software Bill of Materials

SBOMs are available in both SPDX and CycloneDX formats for:

  • Source code
  • All container images

🔐 Signatures

All container images are signed with cosign. Verify with:

cosign verify ghcr.io/moirai-internal/ome-manager:v0.1.5   --certificate-identity-regexp=https://github.com/sgl-project/ome/.github/workflows/release.yaml@refs/tags/.*   --certificate-oidc-issuer=https://token.actions.githubusercontent.com

What's Changed

  • [Misc]use ac to override resource rather than get a larger resource number between runtime and ac by @pallasathena92 in #468
  • [core] Sanitize model config file before parsing by @XinyueZhang369 in #470
  • [doc] update readme by @slin1237 in #471
  • Bump golang.org/x/sys from 0.34.0 to 0.39.0 by @dependabot[bot] in #438
  • Bump sigs.k8s.io/kind from 0.24.0 to 0.31.0 in /hack/internal/tools by @dependabot[bot] in #472
  • Bump the k8s group with 4 updates by @dependabot[bot] in #436
  • [Mics]support for user-selected runtimes to consider all model formats by @pallasathena92 in #490
  • [Misc] Cleanup blocklist volume, volume mount in inference service reconcile by @beiguo218 in #491
  • Add Prom Cloud Auth and metrics validations to Keda by @Kangyan-Zhou in #488
  • [API] Adding support for serving of DiffUsers models. by @shenoyvvarun in #489
  • [Core] Add owner reference to RBAC resources for automatic cleanup by @bcfre in #492
  • Bump github.com/gohugoio/hugo from 0.153.0 to 0.154.2 in /hack/internal/tools by @dependabot[bot] in #486
  • [Docs] Adding guide for running OME on kind cluster. by @shenoyvvarun in #495
  • [Misc]update code owner for modelagent package by @truddy0 in #496
  • Fix progress update race condition in model download by @Kangyan-Zhou in #493
  • Bump github.com/aws/aws-sdk-go-v2/service/sts from 1.38.2 to 1.41.5 by @dependabot[bot] in #487
  • Bump google.golang.org/protobuf from 1.36.10 to 1.36.11 by @dependabot[bot] in #485
  • Skip to set PARALLELISM_SIZE env variable when AC not added in runtime by @beiguo218 in #503
  • [API] introduce the logic of artifact deletion considering artifact by @truddy0 in #502
  • [OEP] Add OEP-0006: Workload Policy Layer and Unified Workload Manage by @bcfre in #505
  • [Mics]imporve ac args and command override by @pallasathena92 in #509
  • [API] introduce logic of handling model CR update with DownloadPolicy by @truddy0 in #511
  • [Mics]skip accelerator class getting if not name and policy specified by @pallasathena92 in #512
  • [Mics]update runtime and inference service crd by @pallasathena92 in #513
  • [Misc] Fix kustomization.yaml; Decouple npm registry with Oracle by @heymrbox in #515
  • [Mics]add model ready label in podspec.nodeSelector to force pod sche… by @pallasathena92 in #514
  • Surface replication error to ome agent pod status after pod termination by @chengjieyao in #522
  • [Bugfix][Core] Skip model init injection for FT serving with merged weights by @beiguo218 in #524
  • [Misc] Fix make install errors; Force update cert manager in make intall by @heymrbox in #525
  • [Helm] Update default image version to v0.1.4 by @MohanKumar21 in #510
  • [Feature] Model Agent support diffusers models. by @shenoyvvarun in #520
  • Update model capability detection for Qwen3 embedding models by @chengjieyao in #527
  • [BUG] Replace ome-instance-type-map with model-agent-config-map by @YouNeedCryDear in #528
  • Do not add grafana related annotations to k8s service by @Kangyan-Zhou in #501
  • Fix OME scale down Deployments to 1 replica temporarily whenever it changes something by @fzyzcjy in #412
  • [Helm] Add OCI Compute Shape B200 support in model agent by @heymrbox in #529
  • (feat)(fix) Fix Qwen quantization parsing and add Qwen 3.5 model config tests by @YouNeedCryDear in #533
  • Bump docker/build-push-action from 5 to 7 by @dependabot[bot] in #539
  • Bump actions/download-artifact from 7 to 8 by @dependabot[bot] in #530
  • Bump actions/upload-artifact from 6 to 7 by @dependabot[bot] in #532
  • Bump docker/login-action from 3 to 4 by @dependabot[bot] in #537
  • [API]Change ApiCapabilities to array of strings by @XinyueZhang369 in #544
  • [Core]add select by policy for acceleratorclass by @pallasathena92 in #518
  • [Core]acceleratorclass cheapest selection policy logic update by @pallasathena92 in #547

New Contributors

OME 0.1.4

26 Nov 20:18
a26e43d

Choose a tag to compare

OME 0.1.4

📦 Container Images

The following container images are available:

# OME Manager
docker pull ghcr.io/moirai-internal/ome-manager:v0.1.4

# Model Agent
docker pull ghcr.io/moirai-internal/model-agent:v0.1.4

# OME Agent
docker pull ghcr.io/moirai-internal/ome-agent:v0.1.4

# Multinode Prober
docker pull ghcr.io/moirai-internal/multinode-prober:v0.1.4

⎈ Helm Installation

Option 1: OCI Registry (Recommended)

# Install directly from OCI registry
helm install ome-crd oci://ghcr.io/moirai-internal/charts/ome-crd --version 0.1.4 --namespace ome --create-namespace
helm install ome oci://ghcr.io/moirai-internal/charts/ome-resources --version 0.1.4 --namespace ome

Option 2: GitHub Releases

# Add the OME Helm repository
helm repo add ome https://github.com/sgl-project/ome/releases/download/v0.1.4
helm repo update

# Install OME
helm install ome-crd ome/ome-crd --namespace ome --create-namespace
helm install ome ome/ome-resources --namespace ome

📋 Changelog

📄 Software Bill of Materials

SBOMs are available in both SPDX and CycloneDX formats for:

  • Source code
  • All container images

🔐 Signatures

All container images are signed with cosign. Verify with:

cosign verify ghcr.io/moirai-internal/ome-manager:v0.1.4   --certificate-identity-regexp=https://github.com/sgl-project/ome/.github/workflows/release.yaml@refs/tags/.*   --certificate-oidc-issuer=https://token.actions.githubusercontent.com

What's Changed

Read more

OME v.0.1.3

16 Jul 17:07
578577a

Choose a tag to compare

🚀 What's New

Kimi K2 Model & Runtime Support

Full support has been added for the Kimi K2 model family, including model parsing and one of the most advanced runtime configurations to date. This release integrates the latest versions of SGLang and SGLang Router, with support for prefill and decode disaggregation, EPLB, and DeepEP.

Model Architecture Expansion

Added support for over 20 new model architectures in the Hugging Face model loader, significantly expanding the range of models that can be deployed and managed.

Multi-Cloud & CoreWeave Support

  • Introduced support for storing benchmark results across multiple cloud storage providers.
  • Added native support for CoreWeave infrastructure, including H200 instance shape configurations.
  • Refactored node shape utilities to support instance types across multiple cloud providers.

Runtime Selection and CRD Enhancements

Serving runtimes now support weighted selection logic based on model format and framework version. These enhancements make the runtime selection process more intelligent and adaptable across diverse model types.

Model Replication & Authentication

Introduced first-class support for replicating models directly from Hugging Face to OCI Object Storage. This includes robust handling of authentication and authorization using both Hugging Face tokens and OCI IAM credentials.

📦 Container Images

The following container images are available:

# OME Manager
docker pull ghcr.io/moirai-internal/ome-manager:v0.1.3

# Model Agent
docker pull ghcr.io/moirai-internal/model-agent:v0.1.3

# OME Agent
docker pull ghcr.io/moirai-internal/ome-agent:v0.1.3

# Multinode Prober
docker pull ghcr.io/moirai-internal/multinode-prober:v0.1.3

⎈ Helm Installation

Option 1: OCI Registry (Recommended)

# Install directly from OCI registry
helm install ome-crd oci://ghcr.io/moirai-internal/charts/ome-crd --version 0.1.3 --namespace ome --create-namespace
helm install ome oci://ghcr.io/moirai-internal/charts/ome-resources --version 0.1.3 --namespace ome

Option 2: GitHub Releases

# Add the OME Helm repository
helm repo add ome https://github.com/sgl-project/ome/releases/download/v0.1.3
helm repo update

# Install OME
helm install ome-crd ome/ome-crd --namespace ome --create-namespace
helm install ome ome/ome-resources --namespace ome

New Contributors

What's Changed

  • Renames SRT to SGLang for clarity by @JustinTong0323 in #125
  • [BUG] Fix Inference Service Status Propagation by @YouNeedCryDear in #89
  • [bugfix] fix: resolve model agent download loops and timeout issues by @slin1237 in #126
  • [core] feat: add multi-cloud storage support for benchmark results by @slin1237 in #127
  • [misc] Add support for 20+ new model architectures to HuggingFace mod… by @slin1237 in #128
  • feat(helm): Add imagePullSecrets support to ome-resources chart by @abatilo in #131
  • [docs] update intallation document by @slin1237 in #130
  • Fix YAML syntax error in helm workflow by @abatilo in #133
  • Fix typo: rename Scount to Scout by @abatilo in #135
  • fix: use non-deprecated node instance type label by @abatilo in #132
  • Fix model-agent health check timing to account for startup jitter by @abatilo in #136
  • Refactor node shape utilities to support multi-cloud instance types by @abatilo in #134
  • Add LWS worker-index 0 to sample DS manifest by @abatilo in #138
  • [bugfix] fix nodeshape for unknown vendors by @slin1237 in #141
  • [core] adds model node selector to components by @slin1237 in #142
  • [bugfix] fix ingress annotation reconciliation by @slin1237 in #144
  • [bugfix] Separate OCI Object Storage clients for ome agent by @chengjieyao in #143
  • [misc] refactor component cleanup by @slin1237 in #145
  • [docs] remove empty pages and fixed styling by @slin1237 in #146
  • [docs] fix prism style issue in docs page by @slin1237 in #147
  • [Bugfix] make install reports an erros : /bin/sh: 1: read: arg count by @mupeifeiyi in #150
  • [Bugfix] Fix models yaml file no such file or directory by @mupeifeiyi in #152
  • [DOC]Fix doc architecture diagram SVG title by @YouNeedCryDear in #154
  • add modelFormat, modelFramework version compare by @pallasathena92 in #148
  • [Misc] replica ome-agent: add support to replicate models from HF to OCI by @beiguo218 in #157
  • [Misc] replica ome-agent: minor improvements + cleanup by @beiguo218 in #158
  • [Bugfix] try to get baseModelName and protocolVersion from Model and … by @leprode in #156
  • [oep] add pvc storage support by @slin1237 in #169
  • [core] add pvc storage parsing supporting base model and cluster base… by @slin1237 in #170
  • [core] skip model agent task for pvc storage by @slin1237 in #171
  • [core] add model metadata parser to ome agent by @slin1237 in #172
  • [ci] update ci and makefile by @slin1237 in #173
  • [utils] Add CoreWeave H200 shape configuration by @abatilo in #177
  • Bump oraclelinux from 9-slim to 10-slim in /dockerfiles by @dependabot[bot] in #174
  • fix missing the artifacts dir during make artifacts by @Iceber in #178
  • [misc] fix incorrect integration path by @carlory in #179
  • Remove training-agent references and update docs by @carlory in #180
  • [core] add kimi k2 model support by @slin1237 in #182
  • imporve supported runtime sort logic by @pallasathena92 in #183
  • [misc] add kimi k2 runtime, isvc, and model by @slin1237 in #184
  • [ci] add dev artifact publishing and fix openvex by @slin1237 in #187
  • [core] add runtime selector package and deprecate runtime util by @slin1237 in #186
  • [chore] update code owner by @slin1237 in #188
  • fix: 🔧 Missing pkg/version package. by @yafengio in #190
  • fix: Update status using Status().Update method by @carlory in #192
  • fix: don't panic when webhook fails to get deploy config by @carlory in #193
  • [ci] Fix dev chart handling in workflow by @carlory in #194
  • [misc] fix external service being cleaned up and kimi runtime update by @slin1237 in #195
  • [chore] update image for 0.1.3 release by @slin1237 in #196

📋 Changelog

Full Changelog: v0.1.2...v0.1.3

📄 Software Bill of Materials

SBOMs are available in both SPDX and CycloneDX formats for:

  • Source code
  • All container images

🔐 Signatures

All container images are signed with cosign. Verify with:

cosign verify ghcr.io/moirai-internal/ome-manager:v0.1.3   --certificate-identity-regexp=https://github.com/sgl-project/ome/.github/workflows/release.yaml@refs/tags/.*   --certificate-oidc-issuer=https://token.actions.githubusercontent.com

OME v0.1.2

02 Jul 14:09
07085a3

Choose a tag to compare

OME v0.1.2

🚀 What's New

📦 Container Images

The following container images are available:

# OME Manager
docker pull ghcr.io/moirai-internal/ome-manager:v0.1.2

# Model Agent
docker pull ghcr.io/moirai-internal/model-agent:v0.1.2

# OME Agent
docker pull ghcr.io/moirai-internal/ome-agent:v0.1.2

# Multinode Prober
docker pull ghcr.io/moirai-internal/multinode-prober:v0.1.2

⎈ Helm Installation

Option 1: OCI Registry (Recommended)

# Install directly from OCI registry
helm install ome-crd oci://ghcr.io/moirai-internal/charts/ome-crd --version 0.1.2 --namespace ome --create-namespace
helm install ome oci://ghcr.io/moirai-internal/charts/ome-resources --version 0.1.2 --namespace ome

Option 2: GitHub Releases

# Add the OME Helm repository
helm repo add ome https://github.com/sgl-project/ome/releases/download/v0.1.2
helm repo update

# Install OME
helm install ome-crd ome/ome-crd --namespace ome --create-namespace
helm install ome ome/ome-resources --namespace ome

📋 Changelog

📄 Software Bill of Materials

SBOMs are available in both SPDX and CycloneDX formats for:

  • Source code
  • All container images

🔐 Signatures

All container images are signed with cosign. Verify with:

cosign verify ghcr.io/moirai-internal/ome-manager:v0.1.2   --certificate-identity-regexp=https://github.com/sgl-project/ome/.github/workflows/release.yaml@refs/tags/.*   --certificate-oidc-issuer=https://token.actions.githubusercontent.com

Full Changelog: v0.1.1...v0.1.2

OME v0.1.1

28 Jun 02:16
ed55a09

Choose a tag to compare

OME v0.1.1

🚀 What's New

📦 Container Images

The following container images are available:

# OME Manager
docker pull ghcr.io/moirai-internal/ome-manager:v0.1.1

# Model Agent
docker pull ghcr.io/moirai-internal/model-agent:v0.1.1

# OME Agent
docker pull ghcr.io/moirai-internal/ome-agent:v0.1.1

# Multinode Prober
docker pull ghcr.io/moirai-internal/multinode-prober:v0.1.1

⎈ Helm Installation

Option 1: OCI Registry (Recommended)

# Install directly from OCI registry
helm install ome-crd oci://ghcr.io/moirai-internal/charts/ome-crd --version 0.1.1 --namespace ome --create-namespace
helm install ome oci://ghcr.io/moirai-internal/charts/ome-resources --version 0.1.1 --namespace ome

Option 2: GitHub Releases

# Add the OME Helm repository
helm repo add ome https://github.com/sgl-project/ome/releases/download/v0.1.1
helm repo update

# Install OME
helm install ome-crd ome/ome-crd --namespace ome --create-namespace
helm install ome ome/ome-resources --namespace ome

📋 Changelog

Full Changelog: v0.1.0...v0.1.1

📄 Software Bill of Materials

SBOMs are available in both SPDX and CycloneDX formats for:

  • Source code
  • All container images

🔐 Signatures

All container images are signed with cosign. Verify with:

cosign verify ghcr.io/moirai-internal/ome-manager:v0.1.1   --certificate-identity-regexp=https://github.com/sgl-project/ome/.github/workflows/release.yaml@refs/tags/.*   --certificate-oidc-issuer=https://token.actions.githubusercontent.com

New Contributors

OME v0.1.0

26 Jun 22:21
c97e424

Choose a tag to compare

OME v0.1.0

🚀 What's New

First release of OME!!!!

📦 Container Images

The following container images are available:

# OME Manager
docker pull ghcr.io/moirai-internal/ome-manager:v0.1.0

# Model Agent
docker pull ghcr.io/moirai-internal/model-agent:v0.1.0

# OME Agent
docker pull ghcr.io/moirai-internal/ome-agent:v0.1.0

# Multinode Prober
docker pull ghcr.io/moirai-internal/multinode-prober:v0.1.0

⎈ Helm Installation

# Add the OME Helm repository
helm repo add ome https://github.com/sgl-project/ome/releases/download/v0.1.0
helm repo update

# Install OME
helm install ome-crd ome/ome-crd --namespace ome --create-namespace
helm install ome ome/ome-resources --namespace ome

📋 Changelog

📄 Software Bill of Materials

SBOMs are available in both SPDX and CycloneDX formats for:

  • Source code
  • All container images

🔐 Signatures

All container images are signed with cosign. Verify with:

cosign verify ghcr.io/moirai-internal/ome-manager:v0.1.0   --certificate-identity-regexp=https://github.com/sgl-project/ome/.github/workflows/release.yaml@refs/tags/.*   --certificate-oidc-issuer=https://token.actions.githubusercontent.com