Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Resync 20240926 #231

Merged
merged 58 commits into from
Oct 1, 2024
Merged

Resync 20240926 #231

merged 58 commits into from
Oct 1, 2024

Conversation

ffromani
Copy link
Member

resync against upstream master

googs1025 and others added 30 commits June 10, 2024 17:07
- Add linux/ppc64le,linux/s390x
- Switch to distroless
- Add support for podman

Signed-off-by: Paul Bastide <[email protected]>
Add support for ppc64le and s390x architectures and distroless builds
chore: fix EventsToRegister method comment
Huang-Wei and others added 14 commits August 18, 2024 10:20
use `/bin/kube-scheduler` and `/bin/controller` as the entrypoints of Dockerfiles
solve struct literal uses unkeyed fields
In order to improve the debuggability of the overreserve cache, we
would like to
1. correlate the cache state being used with
2. the actions the resync loop is doing
3. infer in a easier way the current state of the cache

This change aims to improve points 1 and 2, while also trying to make
3 easier in the future.

We introduce the concept of "generation" which is an opaque
monotonically increasing integer similar in spirit to the
`resourceVersion` kube API field.
Every time the internal state of the cache is updated, which happens
only in the resync loop by design, we increment the generation.

GetCachedNRTCopy will also return the generation of the data
being used, so we have now an uniform way to correlate readers
and writer of the cache, and we gain better visibility of the data
being used.

With verbose enough logging, using the generation is now easier
(albeit admittedly still clunky) to reconstruct the chain of changes
which lead to a given cache state, which was much harder previously.
Similarly, there's now a clear way to learn which cache state was
used to make a given scheduling decision, which was much harder before.

The changes involve mostly logging; to avoid proliferation of return
values, however, a trivial refactoring is done in `GetCachedNRTCopy`.
A beneficial side effect is much improved documentation of the
return values.

Signed-off-by: Francesco Romani <[email protected]>
nrt: log: introduce and use "generation" for cache
* Fixes controller's Dockerfile casing

* Fixes scheduler's Dockerfile casing

When using the Makefile to build the images, a warning arises regarding the incosistent casing of the "FROM" and "as" keywords.
* KEP for PEAKS plugin

Signed-off-by: Krishnasuri Narayanam <[email protected]>

* changes to address the KEP review comments

* fixing toc as review comments

Signed-off-by: Krishnasuri Narayanam <[email protected]>

---------

Signed-off-by: Krishnasuri Narayanam <[email protected]>
Signed-off-by: Francesco Romani <[email protected]>
Copy link

openshift-ci bot commented Sep 26, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ffromani

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Sep 26, 2024
@ffromani
Copy link
Member Author

/hold

@openshift-ci openshift-ci bot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Sep 26, 2024
@ffromani ffromani requested review from Tal-or and shajmakh September 27, 2024 08:19
@ffromani
Copy link
Member Author

key commit relevant for us: 0dae3ec

@ffromani
Copy link
Member Author

ffromani commented Oct 1, 2024

/hold cancel

@openshift-ci openshift-ci bot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Oct 1, 2024
@shajmakh
Copy link
Member

shajmakh commented Oct 1, 2024

/lgtm
Thanks for this

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Oct 1, 2024
@ffromani ffromani merged commit b66c594 into master Oct 1, 2024
7 checks passed
@ffromani ffromani deleted the resync-20240926 branch October 1, 2024 14:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. lgtm Indicates that a PR is ready to be merged.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

10 participants