Ensure removal of security group rules on deleting load balancers #752

JoelSpeed · 2023-11-23T17:30:43Z

What type of PR is this?
/kind bug

What this PR does / why we need it:

This PR updates the load balancer security group update logic for ELBs so that we can actually delete a security group when there is an untagged group present.

In the current flow, we can add rules to the security group if it is untagged, but we cannot remove them since the logic excludes untagged groups from the actualGroups list. When we are deleting the load balancer and deleting the security group, we need to make sure that we remove all rules that refer to the security group else the security group deletion will reach a dependency violation deadlock.

To ensure compatibility with BYO security groups, the code ensures that we only pass the isDeleting parameter as true when the existing logic determines that the load balancer should be removing the security group already. This should mean that we only do the full removal of all references when we are about to delete the security group, and if the security group is being left over, we won't remove any references - I don't think in the BYO security group case we have any way to track what we have added so I can't fix that bug here.

Which issue(s) this PR fixes:

Fixes #566

Special notes for your reviewer:

Does this PR introduce a user-facing change?:

When removing a load balancer, the service controller will now remove all security group rules referencing the load balancer's security group, even when the security group containing the rule is unmanaged.

cartermckinnon · 2023-11-27T22:00:31Z

We're recommending that folks run the separate load balancer controller: https://github.com/kubernetes-sigs/aws-load-balancer-controller

Instead of using the legacy service controller here. Do you see this issue with that controller as well?

JoelSpeed · 2023-11-28T13:10:46Z

As far as I know, this bug only exists in the classic load balancer code path, which has no parallel in the load balancer controller. Until recently, NLBs didn't even support security groups right?

While I appreciate that there's a new way to do things and you can recommend moving people across, the CLB has no equivalent support and as such, still needs to be supported here, and this is a fairly major bug that requires users to manually enter the AWS console/CLI to remove resources to be able to clean up their environment. Else they are leaked.

Note as well that the controllers logic today will happily remove the load balancer and service object, leaking the resources and not even giving the user any indication that resources were leaked, I think this is pretty bad behaviour for a controller like this.

cartermckinnon · 2023-12-01T22:01:41Z

/assign @M00nF1sh @kishorj

can y'all take a look at this?

cartermckinnon · 2023-12-01T22:02:17Z

/retest

CI should be fixed now

JoelSpeed · 2024-01-16T09:33:29Z

Any of the maintainers able to give this bug a review please?

k8s-triage-robot · 2024-04-15T09:49:14Z

The Kubernetes project currently lacks enough contributors to adequately respond to all PRs.

This bot triages PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the PR is closed

You can:

Mark this PR as fresh with /remove-lifecycle stale
Close this PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

JoelSpeed · 2024-05-08T12:46:37Z

Any indication that this might be an acceptable patch? The bug is still present

k8s-triage-robot · 2024-06-07T13:37:32Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all PRs.

This bot triages PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the PR is closed

You can:

Mark this PR as fresh with /remove-lifecycle rotten
Close this PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

k8s-triage-robot · 2024-07-07T14:01:43Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the PR is closed

You can:

Reopen this PR with /reopen
Mark this PR as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

k8s-ci-robot · 2024-07-07T14:01:48Z

@k8s-triage-robot: Closed this PR.

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the PR is closed

You can:

Reopen this PR with /reopen

Mark this PR as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

JoelSpeed · 2024-07-08T07:18:03Z

/reopen

Any AWS maintainers able to review this bug?

k8s-ci-robot · 2024-07-08T07:18:09Z

@JoelSpeed: Reopened this PR.

In response to this:

/reopen

Any AWS maintainers able to review this bug?

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

kmala · 2024-07-31T16:34:20Z

/triage accepted

kmala · 2024-07-31T16:36:06Z

/remove-lifecycle rotten

kmala · 2024-08-05T21:47:00Z

changes mostly looks good, can we do a rebase?

JoelSpeed · 2024-08-15T11:29:57Z

The E2E failures here don't look to be related as far as I can tell, @kmala do you agree or do I need to look deeper?

kmala · 2024-08-15T17:58:03Z

The E2E failures here don't look to be related as far as I can tell, @kmala do you agree or do I need to look deeper?

they are happening for other PR's also #1016 (comment) . Looking at the reason for the issue.

kmala · 2024-08-19T23:39:05Z

@JoelSpeed can you rebase such that the e2e works as its been fixed

JoelSpeed · 2024-08-20T08:15:04Z

I think Prow is supposed to handle that for you, but I've rebased anyway

kmala · 2024-08-20T16:18:24Z

/lgtm
/approve

k8s-ci-robot · 2024-08-20T16:18:32Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: kmala

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [kmala]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

jimmidyson · 2024-11-25T12:30:58Z

@JoelSpeed Thanks for this fix! Can we get this backported to release-1.31 and release-1.30 release branches please? 🙏

k8s-ci-robot requested review from andrewsykim and cartermckinnon November 23, 2023 17:30

k8s-ci-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Nov 23, 2023

k8s-ci-robot assigned kishorj and M00nF1sh Dec 1, 2023

JoelSpeed force-pushed the remove-sg-rules-untagged-groups branch from 60fe9aa to 09471b9 Compare December 6, 2023 15:49

k8s-ci-robot added lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. labels Apr 15, 2024

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jun 7, 2024

k8s-ci-robot closed this Jul 7, 2024

k8s-ci-robot reopened this Jul 8, 2024

k8s-ci-robot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Jul 31, 2024

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Jul 31, 2024

JoelSpeed force-pushed the remove-sg-rules-untagged-groups branch from 09471b9 to 4ee1d55 Compare August 15, 2024 10:30

k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Aug 15, 2024

JoelSpeed added 2 commits August 20, 2024 09:08

Ensure removal of security group rules on deleting load balancers

a84ea61

Sorting LB security groups should prefer tagged security group

912f047

JoelSpeed force-pushed the remove-sg-rules-untagged-groups branch from 4ee1d55 to 912f047 Compare August 20, 2024 08:08

k8s-ci-robot assigned kmala Aug 20, 2024

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 20, 2024

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Aug 20, 2024

k8s-ci-robot merged commit d7e05d5 into kubernetes:master Aug 20, 2024
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure removal of security group rules on deleting load balancers #752

Ensure removal of security group rules on deleting load balancers #752

JoelSpeed commented Nov 23, 2023

cartermckinnon commented Nov 27, 2023

JoelSpeed commented Nov 28, 2023

cartermckinnon commented Dec 1, 2023

cartermckinnon commented Dec 1, 2023

JoelSpeed commented Jan 16, 2024

k8s-triage-robot commented Apr 15, 2024

JoelSpeed commented May 8, 2024

k8s-triage-robot commented Jun 7, 2024

k8s-triage-robot commented Jul 7, 2024

k8s-ci-robot commented Jul 7, 2024

JoelSpeed commented Jul 8, 2024

k8s-ci-robot commented Jul 8, 2024

kmala commented Jul 31, 2024

kmala commented Jul 31, 2024

kmala commented Aug 5, 2024

JoelSpeed commented Aug 15, 2024

kmala commented Aug 15, 2024

kmala commented Aug 19, 2024

JoelSpeed commented Aug 20, 2024

kmala commented Aug 20, 2024

k8s-ci-robot commented Aug 20, 2024

jimmidyson commented Nov 25, 2024 •

edited

Loading

Ensure removal of security group rules on deleting load balancers #752

Ensure removal of security group rules on deleting load balancers #752

Conversation

JoelSpeed commented Nov 23, 2023

cartermckinnon commented Nov 27, 2023

JoelSpeed commented Nov 28, 2023

cartermckinnon commented Dec 1, 2023

cartermckinnon commented Dec 1, 2023

JoelSpeed commented Jan 16, 2024

k8s-triage-robot commented Apr 15, 2024

JoelSpeed commented May 8, 2024

k8s-triage-robot commented Jun 7, 2024

k8s-triage-robot commented Jul 7, 2024

k8s-ci-robot commented Jul 7, 2024

JoelSpeed commented Jul 8, 2024

k8s-ci-robot commented Jul 8, 2024

kmala commented Jul 31, 2024

kmala commented Jul 31, 2024

kmala commented Aug 5, 2024

JoelSpeed commented Aug 15, 2024

kmala commented Aug 15, 2024

kmala commented Aug 19, 2024

JoelSpeed commented Aug 20, 2024

kmala commented Aug 20, 2024

k8s-ci-robot commented Aug 20, 2024

jimmidyson commented Nov 25, 2024 • edited Loading

jimmidyson commented Nov 25, 2024 •

edited

Loading