Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cannot add nodes to EKS cluster after the k8ssandra-operator installed in this cluster and delete all nodes #1175

Closed
xkxrxm opened this issue Jan 16, 2024 · 4 comments
Labels
bug Something isn't working done Issues in the state 'done'

Comments

@xkxrxm
Copy link

xkxrxm commented Jan 16, 2024

What happened?

We installed the k8ssandra-operator in our EKS cluster(for test), and we delete all nodes after work every day. But when we add nodes to EKS cluster next day, the nodes cannot be ready.

Did you expect to see something different?

Nodes be ready

How to reproduce it (as minimally and precisely as possible):

  1. Install the k8ssandra-operator:
helm install k8ssandra-operator k8ssandra/k8ssandra-operator -n k8ssandra-operator --create-namespace
  1. Delete all nodes and wait for Scheduling until
➜  kubectl get nodes
No resources found
  1. Add nodes back, but nodes cannot be ready
➜  kubectl get nodes
NAME                                               STATUS     ROLES    AGE    VERSION
ip-****.ap-northeast-1.compute.internal   NotReady   <none>   9m7s   v1.28.3-eks-e71965b

Environment

  • K8ssandra Operator version:

    v1.11.0

  • Kubernetes version information:

Client Version: v1.29.0
Kustomize Version: v5.0.4-0.20230601165947-6ce0bf390ce3
Server Version: v1.28.4-eks-8cb36c9
  • Kubernetes cluster kind:

    eks

  • Manifests:

insert manifests relevant to the issue
  • K8ssandra Operator Logs:
insert K8ssandra Operator logs relevant to the issue here

Anything else we need to know?:

After we uninstalled the k8ssandra-operator, We can add nodes normally.

@xkxrxm xkxrxm added the bug Something isn't working label Jan 16, 2024
@adejanovski
Copy link
Contributor

Hi @xkxrxm,

I think this could be related to this issue. The Mutating Webhook with k8ssandra-operator down could prevent the creation of pods that will be necessary to bring the node to a ready state (such as coredns, etc...).

I'll release a patch version of the operator today with the fix and update this ticket so that you can test again.

@xkxrxm
Copy link
Author

xkxrxm commented Jan 16, 2024

Thanks for your quick response.

@xkxrxm
Copy link
Author

xkxrxm commented Jan 16, 2024

@adejanovski Thanks for your quick response again, I tested on version 1.12.0-20240116080917-413f04f3 just now, it run normally

@adejanovski
Copy link
Contributor

Thanks for testing so quickly!

I've released v1.11.0 with the fix.

@adejanovski adejanovski added the done Issues in the state 'done' label Jan 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working done Issues in the state 'done'
Projects
No open projects
Archived in project
Development

No branches or pull requests

2 participants