Docs: added migration guide #1558

capri-xiyue · 2025-09-09T21:41:59Z

What type of PR is this?
/kind documentation

What this PR does / why we need it:
Added migration guide for Inference Pool v1 apo

Which issue(s) this PR fixes:

Fixes #

Does this PR introduce a user-facing change?:

None

k8s-ci-robot · 2025-09-09T21:42:06Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: capri-xiyue
Once this PR has been reviewed and has the lgtm label, please assign nirrozenbaum for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

netlify · 2025-09-09T21:42:07Z

✅ Deploy Preview for gateway-api-inference-extension ready!

Name	Link
🔨 Latest commit	`e934b58`
🔍 Latest deploy log	https://app.netlify.com/projects/gateway-api-inference-extension/deploys/68c0a3231e50b80009cfaf72
😎 Deploy Preview	https://deploy-preview-1558--gateway-api-inference-extension.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

Signed-off-by: Xiyue Yu <[email protected]>

capri-xiyue · 2025-09-09T21:59:18Z

/assign @kfswain

kfswain · 2025-09-09T23:50:50Z

site-src/guides/ga-migration.md

+If you are not using Helm, you will need to manually delete all resources associated with your `v1alpha2` deployment. The key is to remove the `HTTPRoute`'s reference to the old `InferencePool` and then delete the `v1alpha2` resources themselves.
+
+1.  **Update or Delete the `HTTPRoute`**: Modify the `HTTPRoute` to remove the `backendRef` that points to the `v1alpha2` `InferencePool`.
+2.  **Delete the `InferencePool` and associated resources**: You must delete the `v1alpha2` `InferencePool`, any `InferenceModel` resources that point to it, and the corresponding Endpoint Picker (EPP) Deployment and Service.


Suggested change

2. **Delete the `InferencePool` and associated resources**: You must delete the `v1alpha2` `InferencePool`, any `InferenceModel` resources that point to it, and the corresponding Endpoint Picker (EPP) Deployment and Service.

2. **Delete the `InferencePool` and associated resources**: You must delete the `v1alpha2` `InferencePool`, any `InferenceModel` (or 'InferenceObjective') resources that point to it, and the corresponding Endpoint Picker (EPP) Deployment and Service.

kfswain · 2025-09-09T23:51:59Z

site-src/guides/ga-migration.md

+2.  **Delete the `InferencePool` and associated resources**: You must delete the `v1alpha2` `InferencePool`, any `InferenceModel` resources that point to it, and the corresponding Endpoint Picker (EPP) Deployment and Service.
+3.  **Delete the `v1alpha2` CRDs**: Once all `v1alpha2` custom resources are deleted, you can remove the CRD definitions from your cluster.
+    ```bash
+    kubectl delete -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/releases/download/v0.3.0/manifests.yaml


Consider making the version portion of the path configurable

kfswain · 2025-09-09T23:57:53Z

site-src/guides/ga-migration.md

+Curl the endpoint to make sure you are getting a successful response with a **200** response code.
+
+```bash
+IP=$(kubectl get gateway/inference-gateway -o jsonpath='{.status.addresses[0].value}')


I think leaving the GW name as inference-gateway is fine in this case, but I would make mention that you need to put your GW name here

kfswain · 2025-09-10T00:00:04Z

site-src/guides/ga-migration.md

+After cleaning up the old resources, you can proceed with a fresh installation of the `v1` Inference Gateway. This involves installing the new `v1` CRDs, creating a new `v1` `InferencePool` and corresponding `InferenceObjective` resources, and creating a new `HTTPRoute` that directs traffic to your new `v1` `InferencePool`.
+
+
+### 3. Verify the Deployment


This should probably include mention of the fact that you need to deploy a new EPP image that is compatible with the v1 API

kfswain · 2025-09-10T00:13:36Z

Added comments, the split seems fine, but we need to add that deploying a new EPP is likely needed.

But this is a great addition, thank you!

added migration guide

9556ab7

k8s-ci-robot added kind/documentation Categorizes issue or PR as related to documentation. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Sep 9, 2025

k8s-ci-robot requested review from danehans and robscott September 9, 2025 21:42

k8s-ci-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Sep 9, 2025

capri-xiyue added 4 commits September 9, 2025 14:44

updated index

f9c4d3e

changed typo

cb8e669

updated the index

027f85a

Signed-off-by: Xiyue Yu <[email protected]>

updated docs

e934b58

k8s-ci-robot assigned kfswain Sep 9, 2025

kfswain reviewed Sep 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Docs: added migration guide #1558

Docs: added migration guide #1558

capri-xiyue commented Sep 9, 2025

Uh oh!

k8s-ci-robot commented Sep 9, 2025

Uh oh!

netlify bot commented Sep 9, 2025 •

edited

Loading

Uh oh!

capri-xiyue commented Sep 9, 2025

Uh oh!

kfswain Sep 9, 2025

Uh oh!

kfswain Sep 9, 2025

Uh oh!

kfswain Sep 9, 2025

Uh oh!

kfswain Sep 10, 2025

Uh oh!

kfswain commented Sep 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

	2. Delete the `InferencePool` and associated resources: You must delete the `v1alpha2` `InferencePool`, any `InferenceModel` resources that point to it, and the corresponding Endpoint Picker (EPP) Deployment and Service.
	2. Delete the `InferencePool` and associated resources: You must delete the `v1alpha2` `InferencePool`, any `InferenceModel` (or 'InferenceObjective') resources that point to it, and the corresponding Endpoint Picker (EPP) Deployment and Service.

		After cleaning up the old resources, you can proceed with a fresh installation of the `v1` Inference Gateway. This involves installing the new `v1` CRDs, creating a new `v1` `InferencePool` and corresponding `InferenceObjective` resources, and creating a new `HTTPRoute` that directs traffic to your new `v1` `InferencePool`.


		### 3. Verify the Deployment

Docs: added migration guide #1558

Are you sure you want to change the base?

Docs: added migration guide #1558

Conversation

capri-xiyue commented Sep 9, 2025

Uh oh!

k8s-ci-robot commented Sep 9, 2025

Uh oh!

netlify bot commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for gateway-api-inference-extension ready!

Uh oh!

capri-xiyue commented Sep 9, 2025

Uh oh!

kfswain Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

kfswain Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

kfswain Sep 9, 2025

Choose a reason for hiding this comment

Uh oh!

kfswain Sep 10, 2025

Choose a reason for hiding this comment

Uh oh!

kfswain commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

netlify bot commented Sep 9, 2025 •

edited

Loading

kfswain commented Sep 10, 2025 •

edited

Loading