Module does not refresh data sources unless applied with a target #26109

favoretti · 2020-09-03T08:38:31Z

Terraform version

0.13.2

Given the following module piece:

resource "azurerm_key_vault" "this" {
  name                            = var.kv_name == "" ? module.labels.id_for_az_key_vault : var.kv_name
  resource_group_name             = var.resource_group_name
  location                        = module.labels.location
  sku_name                        = "premium"
  tenant_id                       = data.azurerm_subscription.this.tenant_id
  soft_delete_enabled             = true
  enabled_for_deployment          = true
  enabled_for_disk_encryption     = true
  enabled_for_template_deployment = false
  tags                            = module.labels.tags
}

resource "azurerm_key_vault_secret" "this" {
  for_each     = var.secrets
  name         = each.key
  value        = each.value
  key_vault_id = azurerm_key_vault.this.id
  content_type = "text/plain"

  depends_on = [
    azurerm_key_vault_access_policy.admins,
  ]
}

data "azurerm_key_vault_secret" "this" {
  for_each     = toset(var.get_secrets)
  name         = each.key
  key_vault_id = azurerm_key_vault.this.id

  depends_on = [
    azurerm_key_vault_access_policy.admins,
    azurerm_key_vault_access_policy.readers,
    azurerm_key_vault_access_policy.listers,
  ]
}

locals {
  data_secrets = { for s in var.get_secrets :
    s => data.azurerm_key_vault_secret.this[s].value
  }
}

Used in a plan like:

module "kv_fdx_external" {
  source = "git::git@ssh.dev.azure.com:v3/fdx-strat-pgm/common/terraform-azurerm-keyvault?ref=master"
  name   = "master"
  resource_group_name = azurerm_resource_group.this["primary"].name
  labels_context      = module.eastus2_cosmosdb_prediction_labels.context
  tenant_id           = local.tenant_id

  readers = {}

  admins = {
    (local.ad.groups.azure_fxs_strat_lumina_analyst_npe.name)   = local.ad.groups.azure_fxs_strat_lumina_analyst_npe.id
    (local.ad.groups.azure_fxs_strat_lumina_developer_npe.name) = local.ad.groups.azure_fxs_strat_lumina_developer_npe.id,
    (local.ad.groups.azure_fxs_strat_pgms_mgmt.name)            = local.ad.groups.azure_fxs_strat_pgms_mgmt.id
    (local.ad.groups.azure_fxs_strat_lumina_systems_team.name)  = local.ad.groups.azure_fxs_strat_lumina_systems_team.id
    terraform                                                   = local.terraform_service_principal_id
  }

  get_secrets = [
    "accuweather-locations-uri",
    "accuweather-key",
    "accuweather-forecasts-uri",
    "accuweather-conditions-uri",
 ...
  ]

The data source in get_secrets list will just be refreshed once.
If I add more values to it, let's say clientsecret-portal-dev and try to use it later as:

      clientsecret                                           = module.kv_fdx_external.secrets["clientsecret-portal-dev"]

This will fail, saying that kv_fdx_external just contains the initial number of elements.

Use-case - allow use of manually added secrets in terraform code.

What am I doing wrong?

The text was updated successfully, but these errors were encountered:

remche · 2020-09-04T12:37:09Z

I think I face the same issue.

When changing the instance count from 2 to 4, the null_data_source length is not updated and zipmap function fails.

data "null_data_source" "nodes" {
  count = var.nodes_count
  inputs = {
    id          = openstack_compute_instance_v2.instance[count.index].id
    internal_ip = openstack_compute_instance_v2.instance[count.index].access_ip_v4
    floating_ip = var.assign_floating_ip ? openstack_networking_floatingip_v2.floating_ip[count.index].address : ""
  }
}

output "nodes" {
  value = zipmap(openstack_compute_instance_v2.instance[*].name, data.null_data_source.nodes[*].outputs)
}

Error: Error in function call

  on modules/node/output.tf line 15, in output "nodes":
  15:   value = zipmap(openstack_compute_instance_v2.instance[*].name, data.null_data_source.nodes[*].outputs)
    |----------------
    | data.null_data_source.nodes is tuple with 2 elements
    | openstack_compute_instance_v2.instance is tuple with 4 elements

Call to function "zipmap" failed: number of keys (4) does not match number of
values (2).

Note that when using a interpolating value in the data source it triggers a read (but needs confirmation to continue).

gbataille · 2020-09-07T09:01:30Z

Same thing here:

I have an existing data using for_each

data aws_iam_policy_document "deploy_to_sub_account" {
  for_each = aws_organizations_account.sub_accounts

  policy_id = "DeployToSubAccount-${each.value.id}"

  statement {
    sid       = "AssumeRoleDeployment"
    effect    = "Allow"
    actions   = ["sts:AssumeRole"]
    resources = ["arn:aws:iam::${each.value.id}:role/role_deployment"]
  }
}

resource aws_iam_policy "deploy_to_sub_account" {
  for_each = aws_organizations_account.sub_accounts

  name        = "deploy_to_sub_account-${each.value.name}"
  path        = "/"
  description = "Allow to take the role_deployment on sub-account ${each.value.id}-${each.value.name}"
  policy      = data.aws_iam_policy_document.deploy_to_sub_account[each.key].json
}

as you can see, the aws_iam_policy resource using the data.aws_iam_policy_document. They both use the same for_each iterator. Yet, when I try to terraform apply it says

Error: Invalid index

  on sub_account.deployment.tf line 20, in resource "aws_iam_policy" "deploy_to_sub_account":
  20:   policy      = data.aws_iam_policy_document.deploy_to_sub_account[each.key].json
    |----------------
    | data.aws_iam_policy_document.deploy_to_sub_account is object with 3 attributes
    | each.key is "discovery-research-production"

The given key does not identify an element in this collection value.

Where the 3 attributes mentioned are the one that are already in the state. I'm actually trying to add one (I have added one element to the for_each iterator).

gbataille · 2020-09-07T09:10:26Z

I think I found an ugly workaround. The problem is in the dependency resolution (resources that needs the new data element).

So the solution is to do a targetted apply to create just the new data element (and its upstream dependencies)

In my case

terraform apply -target 'data.aws_iam_policy_document.deploy_to_sub_account["discovery-research-production"]'

After that, I can apply the rest

favoretti · 2020-09-07T09:48:24Z

@gbataille Yeah, that's precisely what I stated in the subject of the issue - if you apply it with a target - it will work after that. Although I do not think this is the right way to go...

gbataille · 2020-09-07T10:31:05Z

@favoretti 👍

I had not understood it like that. Agreed that's not the way to go, but it unblocked me

alisdair · 2020-09-17T13:25:11Z

@favoretti Thanks for reporting this issue!

We are working on changes related to this, and while I'm not sure that this use case will be fixed, I'd like to be able to understand this issue. I've tried to reproduce it with local-only providers, and I haven't been able to do so. Can you help?

Here's my configuration:

variable "get_secrets" {
  type    = set(string)
  default = ["foo"]
}

data "null_data_source" "secrets" {
  for_each = var.get_secrets
  inputs = {
    secret = sha256(each.value)
  }
}

output "foo" {
  value = data.null_data_source.secrets["foo"].outputs.secret
}

This works as expected:

$ terraform-0.13.2 apply -auto-approve
data.null_data_source.secrets["foo"]: Refreshing state...

Apply complete! Resources: 0 added, 0 changed, 0 destroyed.

Outputs:

foo = 2c26b46b68ffc68ff99b453c1d30413413422d706483bfa0f98a5e886266e7ae

If I then add a new output to the configuration like so:

output "bar" {
  value = data.null_data_source.secrets["bar"].outputs.secret
}

And run an apply including "bar" in the set of secret IDs, it also works:

$ terraform-0.13.2 apply -auto-approve -var 'get_secrets=["foo", "bar"]'
data.null_data_source.secrets["bar"]: Refreshing state...
data.null_data_source.secrets["foo"]: Refreshing state... [id=static]

Apply complete! Resources: 0 added, 0 changed, 0 destroyed.

Outputs:

bar = fcde2b2edba56bf408601fb721fe9b5c338d10ee429ea04fae5511b68fbf8fb9
foo = 2c26b46b68ffc68ff99b453c1d30413413422d706483bfa0f98a5e886266e7ae

Can you change this simple example to be more like yours, so that it reproduces the issue you're seeing?

Failing that, can you provide a more minimal example of the configuration you're using which exhibits the bug?

favoretti · 2020-09-17T15:44:14Z

I'll try to create a dummy scenario to reproduce it in the coming day or two unless someone beats me to it.

jbardin · 2020-10-06T16:14:31Z

Hi @favoretti

I have a feeling that this is being caused by the fact that (up until 0.14) data sources were read during refresh whenever possible, so they would only reflect the existing state before any resources were applied. Using target in your case was causing something in the configuration to be shown as unknown, and forcing the data source to be read again during apply.

The azurerm_key_vault_secret.this resource is not referenced from data.azurerm_key_vault_secret.this, so terraform has no way to know that it should be refreshed after the resource is created, updated or replaced. There should be no reason however to use a data source and a managed resource representing the same thing within the same configuration (especially within the same module). If there are attributes not exposed by the resource but exist in the data source of the same type, that should be an issue raised with the provider to make the missing attributes available.

The next 0.14 release should greatly improve the ability to more accurately evaluate data sources during plan, possibly allowing for this to work if a reference were added from data.azurerm_key_vault_secret.this to azurerm_key_vault_secret.this, assuming there is a problem with the resources that needs to be worked around with this arrangement. As it stands, the terraform appears to be working as intended here, and we can close the issue. If there are additional problems, feel free to file a new issue, or ask in the community forum where there are more people ready to help.

Thanks!

Nuru · 2020-10-15T20:46:09Z

@jbardin I am running into the same problem @gbataille had: #26109 (comment)

Are you sure this you want to write this off as "works as designed"? Or do you want to open a new ticket about it because it is different than the OP's issue? @gbataille and I are having a problem where the data source is a pure data source, not referencing an existing resource, and it is being referenced from the resource that needs it, so it should trigger an update.

jbardin · 2020-10-15T21:50:55Z

@Nuru

It could be debated that the concept of data sources being read during refresh was bug in the initial design, but that is one of the reasons we have removed the refresh phase altogether in 0.14. Having an independent data source and managed resource that represent the same actual resource has never been fully supported, because the data source always needed to be read, and hence the configuration fully evaluated, before anything is planned or applied.

This is mostly speculation here, because the issue was left without a clear reproduction, and no debugging information. If you have a way to reproduce your issue, please feel free to open a new one. It would be great if you could also try this on the latest 0.14 release as well, since that will help eliminate many common known issues.

Thanks!

Nuru · 2020-10-16T02:42:08Z

@jbardin I experienced this in a full-blown complicated project, but have not been able to reproduce it in a small test case yet. If you have an idea why that might be, please let me know. My case was exactly like @gbataille reported in #26109 (comment) except I was going from 2 to 4 instances instead of from 3 to 4. Going from 1 to 2 in the successful test case, I see

Refreshing Terraform state in-memory prior to plan...
The refreshed state will be used to calculate this plan, but will not be
persisted to local or remote state storage.

data.aws_iam_policy_document.acct["a"]: Refreshing state... [id=1964335212]
data.aws_iam_policy_document.acct["c"]: Refreshing state...
aws_iam_policy.acct["a"]: Refreshing state... [id=arn:aws:iam::<redacted>:policy/acct-a]

In the complicated real case, the lines corresponding to

data.aws_iam_policy_document.acct["c"]: Refreshing state...

are missing from the plan output.

jbardin · 2020-10-16T13:23:30Z

@Nuru, I don't doubt you are experiencing an issue, and can definitely sympathize with the complexity of troubleshooting these. Unfortunately the information provided here doesn't narrow down the possibilities in any significant way. The vast majority of questions with this type of behavior are configuration issues, often combined with a misunderstanding of details in how data sources work. If a data source is not being read when expected, there is a dependency of some sort deferring that read. The logs may provide more info, but we usually need that paired with the full configuration to see what is actually being fed into the data source configuration to know for sure.

Hopefully the improvements in 0.14 will alleviate a lot of this confusion because data sources will always evaluated with the planning information available, which was not previously possible.

ghost · 2020-11-06T01:50:09Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.

If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

favoretti added bug new new issue not yet triaged labels Sep 3, 2020

favoretti mentioned this issue Sep 3, 2020

data_source depends_on count vs for_each #26111

Closed

remche added a commit to remche/terraform-openstack-rke that referenced this issue Sep 4, 2020

fix error due to hashicorp/terraform#26109

43c393f

alisdair added waiting for reproduction unable to reproduce issue without further information waiting-response An issue/pull request is waiting for a response from the community and removed new new issue not yet triaged labels Sep 17, 2020

jbardin closed this as completed Oct 6, 2020

ghost locked as resolved and limited conversation to collaborators Nov 6, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Module does not refresh data sources unless applied with a target #26109

Module does not refresh data sources unless applied with a target #26109

favoretti commented Sep 3, 2020 •

edited

Loading

remche commented Sep 4, 2020

gbataille commented Sep 7, 2020

gbataille commented Sep 7, 2020

favoretti commented Sep 7, 2020

gbataille commented Sep 7, 2020

alisdair commented Sep 17, 2020

favoretti commented Sep 17, 2020

jbardin commented Oct 6, 2020

Nuru commented Oct 15, 2020

jbardin commented Oct 15, 2020

Nuru commented Oct 16, 2020

jbardin commented Oct 16, 2020 •

edited

Loading

ghost commented Nov 6, 2020

Module does not refresh data sources unless applied with a target #26109

Module does not refresh data sources unless applied with a target #26109

Comments

favoretti commented Sep 3, 2020 • edited Loading

Terraform version

remche commented Sep 4, 2020

gbataille commented Sep 7, 2020

gbataille commented Sep 7, 2020

favoretti commented Sep 7, 2020

gbataille commented Sep 7, 2020

alisdair commented Sep 17, 2020

favoretti commented Sep 17, 2020

jbardin commented Oct 6, 2020

Nuru commented Oct 15, 2020

jbardin commented Oct 15, 2020

Nuru commented Oct 16, 2020

jbardin commented Oct 16, 2020 • edited Loading

ghost commented Nov 6, 2020

favoretti commented Sep 3, 2020 •

edited

Loading

jbardin commented Oct 16, 2020 •

edited

Loading