🐛 Fix OpenStackServer reconciliation stuck when cluster is unpaused #2833

bnallapeta · 2025-11-13T11:43:21Z

What this PR does / why we need it:
When a cluster is paused (e.g., during a pivot operation), OpenStackServer resources stop reconciling. However, when the cluster is unpaused, they don't resume because the controller doesn't watch for cluster pause/unpause events.

This PR adds a watch on Cluster resources so OpenStackServers are re-queued when their parent cluster transitions from paused to unpaused state.

Which issue(s) this PR fixes (optional, in fixes #<issue number>(, fixes #<issue_number>, ...) format, will close the issue(s) when PR gets merged):
Fixes #2824

TODOs:

squashed commits
if necessary:
- includes documentation
- adds unit tests

/hold

netlify · 2025-11-13T11:44:04Z

✅ Deploy Preview for kubernetes-sigs-cluster-api-openstack ready!

Name	Link
🔨 Latest commit	`828af4f`
🔍 Latest deploy log	https://app.netlify.com/projects/kubernetes-sigs-cluster-api-openstack/deploys/69267f2f3e792900087e4539
😎 Deploy Preview	https://deploy-preview-2833--kubernetes-sigs-cluster-api-openstack.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

EmilienM · 2025-11-19T14:58:23Z

/approve

k8s-ci-robot · 2025-11-19T14:58:32Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: EmilienM

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [EmilienM]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

bnallapeta · 2025-11-20T01:36:56Z

/ok-to-test

lentzi90 · 2025-11-21T06:33:06Z

controllers/openstackserver_controller.go

+		// Don't handle deleted clusters
+		if !c.DeletionTimestamp.IsZero() {
+			log.V(4).Info("Cluster has a deletion timestamp, skipping mapping.")
+			return nil
+		}


Don't we need to clean up if it is deleting? I am thinking that there could be a situation where the user paused the cluster, then deleted (forgetting to unpause first), then unpause.

hmm, to address this edge case, here's what I did:
I've moved the pause check after the deletion timestamp check (similar to OpenStackCluster controller). Now the reconciliation flow is:

Check if server is being deleted -> proceed with deletion (regardless of pause state)

If not deleted, check if cluster is paused -> skip reconciliation

Otherwise, proceed normally

This ensures deletion always proceeds even if the cluster was paused when deletion started, then later unpaused.

@lentzi90 ptal ^^

Signed-off-by: Bharath Nallapeta <[email protected]>

lentzi90 · 2025-12-11T09:00:45Z

I am a bit worried that this will break clusterctl move. We will probably need to handle all these cases:

Cluster paused and has deletion timestamp: do nothing <- This is now broken I think (we delete even when paused)
Cluster unpaused and has deletion timestamp: reconcileDelete <- This was missing in first version I think
Cluster paused and no deletion timestamp: do nothing ✔️
Cluster unpaused and no deletion timestamp: reconcileNormal ✔️

The full test is testing clusterctl move, so let's check
/test pull-cluster-api-provider-openstack-e2e-full-test

k8s-ci-robot · 2025-12-11T10:01:07Z

@bnallapeta: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
pull-cluster-api-provider-openstack-e2e-full-test	`828af4f`	link	false	`/test pull-cluster-api-provider-openstack-e2e-full-test`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

github-project-automation bot added this to CAPO Roadmap Nov 13, 2025

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Nov 13, 2025

github-project-automation bot moved this to Inbox in CAPO Roadmap Nov 13, 2025

k8s-ci-robot requested review from EmilienM and lentzi90 November 13, 2025 11:43

k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Nov 13, 2025

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Nov 19, 2025

k8s-ci-robot added the ok-to-test Indicates a non-member PR verified by an org member that is safe to test. label Nov 20, 2025

lentzi90 reviewed Nov 21, 2025

View reviewed changes

fix: openstackserver reconciliation when cluster is paused

828af4f

Signed-off-by: Bharath Nallapeta <[email protected]>

bnallapeta force-pushed the pause-2824 branch from dd01cc9 to 828af4f Compare November 26, 2025 04:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

🐛 Fix OpenStackServer reconciliation stuck when cluster is unpaused #2833

🐛 Fix OpenStackServer reconciliation stuck when cluster is unpaused #2833

bnallapeta commented Nov 13, 2025

Uh oh!

netlify bot commented Nov 13, 2025 •

edited

Loading

Uh oh!

EmilienM commented Nov 19, 2025

Uh oh!

k8s-ci-robot commented Nov 19, 2025

Uh oh!

bnallapeta commented Nov 20, 2025

Uh oh!

lentzi90 Nov 21, 2025

Uh oh!

bnallapeta Nov 26, 2025

Uh oh!

bnallapeta Dec 4, 2025

Uh oh!

lentzi90 commented Dec 11, 2025

Uh oh!

k8s-ci-robot commented Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

🐛 Fix OpenStackServer reconciliation stuck when cluster is unpaused #2833

Are you sure you want to change the base?

🐛 Fix OpenStackServer reconciliation stuck when cluster is unpaused #2833

Conversation

bnallapeta commented Nov 13, 2025

Uh oh!

netlify bot commented Nov 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for kubernetes-sigs-cluster-api-openstack ready!

Uh oh!

EmilienM commented Nov 19, 2025

Uh oh!

k8s-ci-robot commented Nov 19, 2025

Uh oh!

bnallapeta commented Nov 20, 2025

Uh oh!

lentzi90 Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

bnallapeta Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

bnallapeta Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

lentzi90 commented Dec 11, 2025

Uh oh!

k8s-ci-robot commented Dec 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

netlify bot commented Nov 13, 2025 •

edited

Loading