AWS security groups not being destroyed #2445

szczad · 2017-11-27T11:43:16Z

Hi there,

Terraform Version

Terraform v0.11.0

provider.aws v1.3.1

Affected Resource(s)

Please list the resources as a list, for example:

aws_security_group
(probably more)

Terraform Configuration Files

Changed "name" from "all-zabbix" to "all-zabbix-test"

resource "aws_security_group" "all-zabbix" {
  vpc_id                 = "${aws_vpc.infra.id}"
  name                   = "all-zabbix-test"
  description            = "Zabbix"
}

resource "aws_security_group_rule" "all-zabbix-in-tcp-10051" {
  security_group_id = "${aws_security_group.all-zabbix.id}"
  description       = "Zabbix internal communication"
  type              = "ingress"
  protocol          = "tcp"

  from_port = 10051
  to_port   = 10051
  self      = true
}

Debug Output

aws_security_group_rule.all-zabbix-in-tcp-10051: Destroying... (ID: sgrule-1234567890)
aws_security_group_rule.all-zabbix-in-tcp-10051: Destruction complete after 1s
aws_security_group.all-zabbix: Destroying... (ID: sg-12345678)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 10s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 20s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 30s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 40s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 50s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 1m0s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 1m10s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 1m20s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 1m30s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 1m40s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 1m50s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 2m0s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 2m10s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 2m20s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 2m30s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 2m40s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 2m50s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 3m0s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 3m10s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 3m20s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 3m30s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 3m40s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 3m50s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 4m0s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 4m10s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 4m20s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 4m30s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 4m40s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 4m50s elapsed)
aws_security_group.all-zabbix: Still destroying... (ID: sg-12345678, 5m0s elapsed)

Error: Error applying plan:

1 error(s) occurred:

* aws_security_group.all-zabbix (destroy): 1 error(s) occurred:

* aws_security_group.all-zabbix: DependencyViolation: resource sg-12345678 has a dependent object
        status code: 400, request id: abcdefg-dead-beef-dead-abcdefg123456

Terraform does not automatically rollback in the face of errors.
Instead, your Terraform state file has been partially updated with
any resources that successfully completed. Please address the error
above and apply again to incrementally change your infrastructure.

Expected Behavior

remove SG's rules
remove old SG from instances/interfaces
delete old SG
create SG with new values
adds new SG to instances/interfaces

Actual Behavior

remove SG's rules
delete old SG
times out during deletion leaving SG without rules

Steps to Reproduce

Please list the steps required to reproduce the issue, for example:

change security group name to force resource recreation
terraform apply

The text was updated successfully, but these errors were encountered:

mzupan · 2017-11-30T19:38:40Z

is the SG attached to anything like a ELB or instance?

szczad · 2017-12-01T09:01:21Z

Yeah. It's attached to one instance managed by Terraform. If I manually remove association everything goes well, but terraform does not do that on its own.

apparentlymart · 2017-12-19T13:18:51Z

Hi @szczad! Sorry for this limitation.

At the moment Terraform doesn't have any mechanism to deal with these "enforced dependencies" in the underlying service, and indeed it affects a number of resources, such as what we see in #646, #151, #2201.

The problem is that this requires some extra coordination between operations -- creating multiple coordinated steps as you mentioned -- which Terraform's provider model isn't currently able to represent. In certain cases such dependencies could be represented in principle -- for example, in this case where Terraform is managing both the security group and the other resources that belong to it -- while in other cases Terraform can't "see" the dependency at all, because e.g. the EC2 instances are being created implicitly by an aws_autoscaling_group.

We would like to find a way to address this limitation in the long run, for certain situations at least, but at this time we do not have a suitable design figured out to deal with it, and our current Terraform Core development focus is elsewhere. For now it is, as you noted, required to manually break the dependency somehow before making these changes, which is definitely not ideal.

In principle we could improve the behavior here by at least producing a helpful error when this situation arises. Unfortunately the long-running polling behavior you saw here was introduced to work around a different problem: network interfaces tend to live for a few minutes after their associated resources are destroyed, acting as dependencies on the security group that aren't reflected in the API at all. Terraform therefore retries here so that it can wait until VPC has finished deallocating the network interface before proceeding, rather than failing with a hard error in that case.

szczad · 2017-12-19T13:25:35Z

Hmm... That sounds complicated a lot. But thanks for detailed explanation.
It looks like I have to stick to managing our SGs manually. Lucky those who split their configuration into smaller pieces. :-)

FrenchBen · 2018-04-11T00:18:49Z

I often run into this exact issue, which is quite annoying, especially when I'm trying to destroy -force.

Having a dependency violation come up, when I'm trying to get rid of my resources, is not expected behavior.
I'm asking for them to be destroyed (deleted), why are we even evaluating dependencies?

Does this mean that the -force flag is only useful to prevent a user from having any input?

eriksw · 2019-12-30T22:48:06Z

Has 0.12 introduced anything that'd allow for a fix to this? Having to manually alter >80 security groups via the console is not a pleasant experience.

rehevkor5 · 2020-02-01T00:00:17Z

Also related (SG with RDS, SG with VPCE): #9692

linuxman79 · 2020-02-06T14:58:03Z

This is also manifest by removing a security group AND its association with any instances. In this case, the dependency is broken by the change to the instance, but terraform is trying to delete the security group BEFORE modifying the instance. Just another manifestation of the same issue.

pneigel-ca · 2020-02-17T21:23:14Z

I experience this problem with the following resources:

aws_launch_template
aws_autoscaling_group
aws_security_group

I am using:
Terraform v0.12.20
provider.aws v2.49.0

When I attempt to destroy the resources, the security group hangs for several minutes. When investigating in the UI, it appears there is a network interface dependency which prevents it's deletion from the AWS console. If I delete the network interface manually, the terraform deletion succeeds very quickly. If I do not, ultimately the operation fails similarly.

Error: Error deleting security group: DependencyViolation: resource sg-some-resource0 has a dependent object status code: 400, request id: some-id

perelin · 2020-03-18T12:07:28Z

Same issue here. When I try to apply a change to a SG that forces replacement:

Error: Error deleting security group: DependencyViolation: resource sg-0deddff8230759a10 has a dependent object
	status code: 400, request id: 451d4b6f-4d10-4a62-a70e-cfefce3cbd3c

brandon-fryslie · 2020-03-19T16:03:04Z

What are the workarounds for this? We're evaluating Terraform right now to potentially switch from using Ansible for provisioning cloud infrastructure, but this seems like a pretty glaring omission.

Is there really no way to tell Terraform to remove the SG from places it is used before updating it? I can't imagine heavy users of Terraform are continuing to make manual configuration changes. Thanks for your help

davehewy · 2020-03-25T16:51:11Z

To add to this thread. I often find upon a second run of the destroy command the previously endlessly hanging security_group deletion task will immediately delete. Perhaps suggesting Terraform needs to perform some sort of re-calculation at each retry attempt?

johnlabarge · 2020-04-24T18:33:42Z

I'm getting this as well. Are there any workarounds?

weibeld · 2020-04-26T15:10:49Z

This issue occurs when renaming a security group in Terraform and also updating the aws_instance resource to reference the security group by its new name.

Terraform tries to do the following:

Destroy existing security group
Create new security group
Modify EC2 instance (disassociate old SG and associate new SG)

This causes (1) to hang because the AWS API prevents deleting a SG that's still associated with an instance. (2) succeeds, and (3) is never executed.

The correct order of actions should be:

Create new security group
Modify EC2 instance (disassociate old SG and associate new SG)
Destroy existing security group

I didn't find any working solutions, except during the hanging going to the AWS Management Console Actions > Networking > Change Security Groups, disassociating the old security group, and associating the new security group.

gchek · 2020-05-12T09:34:36Z

Hitting the same:
if the SG of an EC2 is added to the main VPC SG, terraform can not destroy.
need to manually remove the main SG rules and remove the EC2 SG to succeed. Painful.

BcTpe4HbIu · 2020-05-29T22:52:02Z

With create_before_destroy = true in SG it works.

resource "aws_security_group" "intercluster" {
  name   = "some sg"
  vpc_id = var.vpc_id

  lifecycle {
    create_before_destroy = true
  }
<snip>

bengiddins · 2020-06-09T01:18:29Z

lifecycle {
create_before_destroy = true
}

Thanks for this! I'm watching a SG try to delete for over 60 minutes now - painstakingly went through discrete steps of creating a new group, assigning the new group, deleting the old group - oh wait, I tested manually adding a Lambda function to the SG and removing it, but now the SG is still attached to those network interfaces and won't delete ಠ_ಠ

aws_security_group.this: Still destroying... [id=sg-0920404d643c46xxx, 1h2m51s elapsed]
aws_security_group.this: Still destroying... [id=sg-0920404d643c46xxx, 1h3m1s elapsed]
aws_security_group.this: Still destroying... [id=sg-0920404d643c46xxx, 1h3m11s elapsed]

gchek · 2020-06-09T10:07:35Z

create_before_destroy = true doesn't help. To destroy my SG, i need to MANUALLY remove the other SG from the rules.
Simple test code available here

Terraform v0.12.26

provider.aws v2.65.0

The issue is the the default SG is detroyed BEFORE the EC2 SG is and in fact it's not true - the default SG is still there even if Terraform says "destroyed" (see 2nd line below)

module.VPCs.aws_default_security_group.default: Destroying... [id=sg-02057153bb443e13d]
module.VPCs.aws_default_security_group.default: Destruction complete after 0s
module.VPCs.aws_security_group.GC-SG-VPC-test: Destroying... [id=sg-02a06934c19a8efaa]

The GC-SG-VPC-test security group is part the the default SG rules !!!

Roxyrob · 2020-07-17T06:55:49Z

Thanks @BcTpe4HbIu

  lifecycle {
    create_before_destroy = true
  }

worked in my case.

saurabh-hirani · 2020-12-07T13:48:55Z

This worked for me:

Add

  lifecycle {
    create_before_destroy = true
  }

to the old security group while renaming it.

Run terraform - it will create the new security group, do the attachments and it will hang at terminating the old security group.
Go to the AWS console and attach the new group, detach the old group from your targets. Ensure that old group attachments are 0.
Terraform will like that and delete the old group thereby not messing up your state file.

gchek · 2020-12-07T14:54:57Z

this is what I am trying to avoid - doing things manually in the console.
if a security group contains a second security group and we do terraform destroy, the second SG rule should be removed, the second SG destroyed and then the first. Seems so logical to me.

gdavison · 2021-03-09T22:27:58Z

Hi everyone,

We know this is a frustrating issue for you all. Unfortunately, the Terraform dependency model doesn't yet support bi-directional dependencies between resources that would allow general modifying other resources as part of deletion or modification. We have an open issue on Terraform core to address this.

There is a general workaround for this dependency case, described at hashicorp/terraform#16065 (comment), though it may not be applicable here.

One additional workaround that may work in some of your cases with aws_security_group is to set the revoke_rules_on_delete parameter on the aws_security_group resource. Note that I haven't tested this, and it may have side effects such as deleting additional rules, which could be re-created by running terraform apply again.

Since this issue requires changes to the core Terraform dependency model, I'm going to close this issue. Once the support is available, we will address this and other issues caused by dependencies across resources. You may be able to find other workarounds or solutions in our forums for the AWS Provider or Terraform.

gchek · 2021-03-10T11:28:46Z

So Terraform gives up on that - woow - i can't believe it

amartopoulos · 2021-03-10T21:17:34Z

Our scenario/workaround: TF couldn't destroy/replace a security group because it was still attached to an ALB. We had to do 3 separate TF runs:

Create new security groups and attach them to ALB
Remove old SGs from ALB
Remove SG resources

This avoids horrid manual console/statefile intervention. Hope this helps someone!

ghost · 2021-04-09T17:10:12Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.

If you feel this issue should be reopened, we encourage creating a new issue linking back to this one for added context. Thanks!

apparentlymart added the bug Addresses a defect in current functionality. label Dec 19, 2017

radeksimko added the service/ec2 Issues and PRs that pertain to the ec2 service. label Jan 28, 2018

radeksimko mentioned this issue Dec 10, 2019

Terraform crash, "Mismatch reason: attribute mismatch: subnets" when deleting instance hashicorp/terraform-plugin-sdk#108

Closed

eriksw mentioned this issue Dec 30, 2019

Terraform doesn't handle basic dependencies #10654

Open

Roxyrob mentioned this issue Jul 17, 2020

Destroying Security Groups Takes Forever with Attached SG #265

Closed

satadruroy mentioned this issue Aug 13, 2020

eks aws_auth configmap management may cause race conditions SUSE/cap-terraform#84

Open

gdavison added the upstream-terraform Addresses functionality related to the Terraform core binary. label Mar 9, 2021

gdavison closed this as completed Mar 9, 2021

ghost locked as resolved and limited conversation to collaborators Apr 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AWS security groups not being destroyed #2445

AWS security groups not being destroyed #2445

szczad commented Nov 27, 2017

mzupan commented Nov 30, 2017

szczad commented Dec 1, 2017

apparentlymart commented Dec 19, 2017

szczad commented Dec 19, 2017

FrenchBen commented Apr 11, 2018

eriksw commented Dec 30, 2019

rehevkor5 commented Feb 1, 2020 •

edited

Loading

linuxman79 commented Feb 6, 2020

pneigel-ca commented Feb 17, 2020

perelin commented Mar 18, 2020

brandon-fryslie commented Mar 19, 2020

davehewy commented Mar 25, 2020

johnlabarge commented Apr 24, 2020

weibeld commented Apr 26, 2020 •

edited

Loading

gchek commented May 12, 2020

BcTpe4HbIu commented May 29, 2020

bengiddins commented Jun 9, 2020 •

edited

Loading

gchek commented Jun 9, 2020 •

edited

Loading

Roxyrob commented Jul 17, 2020

saurabh-hirani commented Dec 7, 2020

gchek commented Dec 7, 2020

gdavison commented Mar 9, 2021

gchek commented Mar 10, 2021

amartopoulos commented Mar 10, 2021

ghost commented Apr 9, 2021

AWS security groups not being destroyed #2445

AWS security groups not being destroyed #2445

Comments

szczad commented Nov 27, 2017

Terraform Version

Affected Resource(s)

Terraform Configuration Files

Debug Output

Expected Behavior

Actual Behavior

Steps to Reproduce

mzupan commented Nov 30, 2017

szczad commented Dec 1, 2017

apparentlymart commented Dec 19, 2017

szczad commented Dec 19, 2017

FrenchBen commented Apr 11, 2018

eriksw commented Dec 30, 2019

rehevkor5 commented Feb 1, 2020 • edited Loading

linuxman79 commented Feb 6, 2020

pneigel-ca commented Feb 17, 2020

perelin commented Mar 18, 2020

brandon-fryslie commented Mar 19, 2020

davehewy commented Mar 25, 2020

johnlabarge commented Apr 24, 2020

weibeld commented Apr 26, 2020 • edited Loading

gchek commented May 12, 2020

BcTpe4HbIu commented May 29, 2020

bengiddins commented Jun 9, 2020 • edited Loading

gchek commented Jun 9, 2020 • edited Loading

Roxyrob commented Jul 17, 2020

saurabh-hirani commented Dec 7, 2020

gchek commented Dec 7, 2020

gdavison commented Mar 9, 2021

gchek commented Mar 10, 2021

amartopoulos commented Mar 10, 2021

ghost commented Apr 9, 2021

rehevkor5 commented Feb 1, 2020 •

edited

Loading

weibeld commented Apr 26, 2020 •

edited

Loading

bengiddins commented Jun 9, 2020 •

edited

Loading

gchek commented Jun 9, 2020 •

edited

Loading