-
Notifications
You must be signed in to change notification settings - Fork 42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
brupop API server TLS cert is untrusted #486
Comments
I found some additional information: when describing the API server pods, I noticed this event: |
@jackgill Can you share with us how you install brupops? like installing cert-manager first and then install brupop? I'm trying to reproduce this issue. Usually we need cert-manager running on the EKS cluster first and then install brupop after few minus when we confirm cert-manager is running. |
@jackgill I just reproduce the same error But this maybe not same as your situation. |
@gthao313 cert-manager was installed via helm long before brupop was deployed on the cluster. Usually we use terraform to apply the brupop manifests but I've noticed that it creates the resources out of order, which I thought might be causing this problem. So I tried installing brupop with |
Thanks for the info. Can you check if secret |
The secret is there currently. I think that it might not have been there when the pods were first created, so I tried recreating the pods now that the secret exists. However the same error remains. |
We were facing the same problem today. In cert-manager logs:
In my point of view in bottlerocket-update-operator.yaml line 15
needs to be replaced by
|
Update: I am able to reproduce this on version 1.3.0 of brupop, installed via Helm. However, based on @stefan-lipinski's comment above, I deployed the CRD using a manifest file which I had edited to specify The fix may be as simple as updating the certificate name here:
|
I just fixed the same bug. In my case I am using cert-manager and the solution was to set the annotation to the value "brupop-apiserver-certificate" |
We've been investigating this issue and the chain of trust appears correct as-implemented in Brupop today. We shouldn't need to refer to the "top-level" self-signed certificate authority for SSL to function. Even though the change proposed in #595 resolves the issue for folks who are seeing it, it doesn't seem like we actually want to merge that upstream -- there is likely some other issue that this works around. We have a few theories about what that issue could be:
Can you provide us more details on your setup? Specifically:
@v0lumehi since you've chimed in recently, could you possibly provide a bit more information? |
Ah, I think the change mentioned in this comment by @v0lumehi is actually more in-line with how we wish to use the chain of trust:
|
Hi, if you set the annotation to the value mentioned in my comment the problem is solved. |
Unfortunately I can't remember the exact error message, but I think it was a "connection closed" or something like that when trying to call the brupop-api-agent |
We started using Bottlerocket on EKS in the second half of 2023. The initial config was locals {
eks_cluster_version = "1.24"
eks_vpc_cni_version = "v1.11.4-eksbuild.1"
eks_coredns_version = "v1.8.7-eksbuild.3"
eks_kube_proxy_version = "v1.24.7-eksbuild.2"
eks_node_group_version = "1.24"
cluster_autoscaler_version = "v1.24.1"
cert-manager = "v1.8.2"
ingress_image_repository = "registry.k8s.io/ingress-nginx/controller"
ingress_image_tag = "v1.8.2"
load_balancer_image_tag = "v2.4.3"
load_balancer_helm_chart_version = "1.4.4"
cloudwatch_agent_version = "amazon/cloudwatch-agent:1.300026.3b189"
fluent-bit_agent_version = "amazon/aws-for-fluent-bit:2.31.12.20230911"
eks_ami_type = "BOTTLEROCKET_x86_64"
eks_ami_release_version = "1.14.3-764e37e4"
} We were facing this issue from the beginning. The error messages appeared every few seconds in the logs and looked like this:
|
As I mentioned in #478, the brupop API server on one of my EKS clusters apparently has an untrusted TLS cert:
I installed brupop using the 1.1.0 manifest file and it is working fine on several other EKS clusters deployed using the same method.
Image I'm using:
1.1.0
Issue or Feature Request:
Looking at the PKI for brupop I see a self-signed issuer cert, but I'm not clear on how this cert is supposed to be trusted. Any advice on how to troubleshoot this issue would be appreciated.
The text was updated successfully, but these errors were encountered: