Skip to content

Commit 257ad87

Browse files
committed
Schema order changed.
Signed-off-by: Andrews Arokiam <[email protected]>
1 parent 6cd625c commit 257ad87

File tree

1 file changed

+12
-12
lines changed

1 file changed

+12
-12
lines changed

docs/modelserving/autoscaling/autoscaling.md

+12-12
Original file line numberDiff line numberDiff line change
@@ -510,7 +510,7 @@ KServe supports `RawDeployment` mode to enable `InferenceService` deployment wit
510510
When using Kserve with the `RawDeployment` mode, Knative is not installed. In this mode, if you deploy an `InferenceService`, Kserve uses **Kubernetes’ Horizontal Pod Autoscaler (HPA)** for autoscaling instead of **Knative Pod Autoscaler (KPA)**. For more information about Kserve's autoscaler, you can refer [`this`](https://kserve.github.io/website/master/modelserving/v1beta1/torchserve/#knative-autoscaler)
511511

512512

513-
=== "Old Schema"
513+
=== "New Schema"
514514

515515
```yaml
516516
apiVersion: "serving.kserve.io/v1beta1"
@@ -524,11 +524,13 @@ When using Kserve with the `RawDeployment` mode, Knative is not installed. In th
524524
serving.kserve.io/targetUtilizationPercentage: "80"
525525
spec:
526526
predictor:
527-
sklearn:
527+
model:
528+
modelFormat:
529+
name: sklearn
528530
storageUri: "gs://kfserving-examples/models/sklearn/1.0/model"
529531
```
530532

531-
=== "New Schema"
533+
=== "Old Schema"
532534

533535
```yaml
534536
apiVersion: "serving.kserve.io/v1beta1"
@@ -542,17 +544,15 @@ When using Kserve with the `RawDeployment` mode, Knative is not installed. In th
542544
serving.kserve.io/targetUtilizationPercentage: "80"
543545
spec:
544546
predictor:
545-
model:
546-
modelFormat:
547-
name: sklearn
547+
sklearn:
548548
storageUri: "gs://kfserving-examples/models/sklearn/1.0/model"
549549
```
550550

551551
### Disable HPA in Raw Deployment
552552

553553
If you want to control the scaling of the deployment created by KServe inference service with an external tool like [`KEDA`](https://keda.sh/). You can disable KServe's creation of the **HPA** by replacing **external** value with autoscaler class annotaion that should be disable the creation of HPA
554554

555-
=== "Old Schema"
555+
=== "New Schema"
556556

557557
```yaml
558558
apiVersion: "serving.kserve.io/v1beta1"
@@ -564,11 +564,13 @@ If you want to control the scaling of the deployment created by KServe inference
564564
name: "sklearn-iris"
565565
spec:
566566
predictor:
567-
sklearn:
567+
model:
568+
modelFormat:
569+
name: sklearn
568570
storageUri: "gs://kfserving-examples/models/sklearn/1.0/model"
569571
```
570572

571-
=== "New Schema"
573+
=== "Old Schema"
572574

573575
```yaml
574576
apiVersion: "serving.kserve.io/v1beta1"
@@ -580,8 +582,6 @@ If you want to control the scaling of the deployment created by KServe inference
580582
name: "sklearn-iris"
581583
spec:
582584
predictor:
583-
model:
584-
modelFormat:
585-
name: sklearn
585+
sklearn:
586586
storageUri: "gs://kfserving-examples/models/sklearn/1.0/model"
587587
```

0 commit comments

Comments
 (0)