Skip to content

Commit 8135ecd

Browse files
committed
Schema order changed.
Signed-off-by: Andrews Arokiam <[email protected]>
1 parent c781ccf commit 8135ecd

File tree

1 file changed

+12
-12
lines changed

1 file changed

+12
-12
lines changed

docs/modelserving/autoscaling/autoscaling.md

+12-12
Original file line numberDiff line numberDiff line change
@@ -505,7 +505,7 @@ KServe supports `RawDeployment` mode to enable `InferenceService` deployment wit
505505
When using Kserve with the `RawDeployment` mode, Knative is not installed. In this mode, if you deploy an `InferenceService`, Kserve uses **Kubernetes’ Horizontal Pod Autoscaler (HPA)** for autoscaling instead of **Knative Pod Autoscaler (KPA)**. For more information about Kserve's autoscaler, you can refer [`this`](https://kserve.github.io/website/master/modelserving/v1beta1/torchserve/#knative-autoscaler)
506506

507507

508-
=== "Old Schema"
508+
=== "New Schema"
509509

510510
```yaml
511511
apiVersion: "serving.kserve.io/v1beta1"
@@ -519,11 +519,13 @@ When using Kserve with the `RawDeployment` mode, Knative is not installed. In th
519519
serving.kserve.io/targetUtilizationPercentage: "80"
520520
spec:
521521
predictor:
522-
sklearn:
522+
model:
523+
modelFormat:
524+
name: sklearn
523525
storageUri: "gs://kfserving-examples/models/sklearn/1.0/model"
524526
```
525527

526-
=== "New Schema"
528+
=== "Old Schema"
527529

528530
```yaml
529531
apiVersion: "serving.kserve.io/v1beta1"
@@ -537,17 +539,15 @@ When using Kserve with the `RawDeployment` mode, Knative is not installed. In th
537539
serving.kserve.io/targetUtilizationPercentage: "80"
538540
spec:
539541
predictor:
540-
model:
541-
modelFormat:
542-
name: sklearn
542+
sklearn:
543543
storageUri: "gs://kfserving-examples/models/sklearn/1.0/model"
544544
```
545545

546546
### Disable HPA in Raw Deployment
547547

548548
If you want to control the scaling of the deployment created by KServe inference service with an external tool like [`KEDA`](https://keda.sh/). You can disable KServe's creation of the **HPA** by replacing **external** value with autoscaler class annotaion that should be disable the creation of HPA
549549

550-
=== "Old Schema"
550+
=== "New Schema"
551551

552552
```yaml
553553
apiVersion: "serving.kserve.io/v1beta1"
@@ -559,11 +559,13 @@ If you want to control the scaling of the deployment created by KServe inference
559559
name: "sklearn-iris"
560560
spec:
561561
predictor:
562-
sklearn:
562+
model:
563+
modelFormat:
564+
name: sklearn
563565
storageUri: "gs://kfserving-examples/models/sklearn/1.0/model"
564566
```
565567

566-
=== "New Schema"
568+
=== "Old Schema"
567569

568570
```yaml
569571
apiVersion: "serving.kserve.io/v1beta1"
@@ -575,8 +577,6 @@ If you want to control the scaling of the deployment created by KServe inference
575577
name: "sklearn-iris"
576578
spec:
577579
predictor:
578-
model:
579-
modelFormat:
580-
name: sklearn
580+
sklearn:
581581
storageUri: "gs://kfserving-examples/models/sklearn/1.0/model"
582582
```

0 commit comments

Comments
 (0)