Error: Post https://xxxx.eks.amazonaws.com/api/v1/namespaces/kube-system/configmaps: dial tcp xxx:443: i/o timeout #621

max-rocket-internet · 2019-12-10T17:08:20Z

On every example on first run:

Error: Post https://C39B1571479D46ED13A85BC0105510E2.gr7.us-west-2.eks.amazonaws.com/api/v1/namespaces/kube-system/configmaps: dial tcp 44.229.104.205:443: i/o timeout

  on ../../aws_auth.tf line 45, in resource "kubernetes_config_map" "aws_auth":
  45: resource "kubernetes_config_map" "aws_auth" {

This is because theres a small between when the cluster endpoint is returned after creation but is not available.

If you rerun terraform apply the configmap is created fine.

Quite annoying 😅

The text was updated successfully, but these errors were encountered:

dpiddockcmp · 2019-12-10T17:16:15Z

Is this an EKS bug that's worth reporting to AWS container team, or an interplay between the aws and kubernetes providers?

xposix · 2019-12-10T17:19:57Z

I keep on having the same issue. IMO, the kubernetes provider should wait / retry few times before coming back with an error.

max-rocket-internet · 2019-12-11T16:56:01Z

Related: hashicorp/terraform-provider-kubernetes#96

max-rocket-internet · 2019-12-11T16:57:08Z

Is this an EKS bug that's worth reporting to AWS container team, or an interplay between the aws and kubernetes providers?

I think a socket error should be retried by the provider.

In the mean time, could we make a work around somehow? Like a null resource that sleeps for 10 seconds after cluster creation?

ivanich · 2019-12-13T09:37:48Z

I'm using kubectl hack for this but it requires kubectl and a lot of depends_on and in general it ugly

  provisioner "local-exec" {
    command = "until kubectl --kubeconfig ${path.root}/kubeconfig_${var.cluster_id} -n kube-system get pods >/dev/null 2>&1;do echo 'Waiting for EKS API';sleep 5;done"
  }
}

in the meantime I'm going to fork eks module with kubernetes provider commits reverted because it's not usable for our usecases.

max-rocket-internet · 2019-12-13T10:28:55Z

in the meantime I'm going to fork eks module with kubernetes provider commits reverted because it's not usable for our usecases.

You can just use release v7.0.1, no?

ivanich · 2019-12-13T12:02:07Z

You can just use release v7.0.1, no?

7.0.1 has faulty kubernetes provider commit, so staying with 7.0.0 for now, and if it's will be an issue for a while fork is the way to go at least we can have new features provided by upstream.

max-rocket-internet · 2019-12-13T12:40:09Z

7.0.1 has faulty kubernetes provider commit

No it doesn't 🙂 I made it specifically the commit before: #630

https://github.com/terraform-aws-modules/terraform-aws-eks/releases/tag/v7.0.1

ivanich · 2019-12-13T13:07:37Z

No it doesn't 🙂 I made it specifically the commit before: #630

Nice, thanks for the tip, haven't checked commits since kubernetes provider merge. 👍

dpiddockcmp · 2019-12-15T11:41:57Z

This is definitely a "feature" of the AWS EKS API. It returns creation complete before the cluster is actually ready to be used 😞

This requires all tools creating clusters to have a retry on their first access. e.g.:

eksctl's WaitForControlPlane
logic in the "quickstart" example run_command
v7 of this module local_exec

We need to go back to having a retry loop hack. Which kind of defeats the point of moving to the kubernetes provider 😞

I was comparing with GCP's kubernetes offering. Their service only returns ready once the cluster is actually usable. You can easily chain GKE creation with the kubernetes terraform provider.
Test code here

Raised an issue in the containers roadmap here . See what they say

barryib · 2019-12-26T23:56:08Z

FYI, I opened a PR in the AWS provider to wait for kubernetes endpoint hashicorp/terraform-provider-aws#11426. I don't know if it the right quick win before aws/containers-roadmap#654 get solved. Feedbacks are welcome.

dpiddockcmp · 2019-12-29T13:11:02Z

Hit another issue with this damn provider when fiddling about with node_groups:
EKS Node Groups modify, and if necessary, create the aws-auth ConfigMap. This causes the kubernetes provider to always fail if you create the cluster and node groups from scratch in one run. The k8s provider needs resources to not exist or to be imported. There's no force create option.

Can we go back to kubectl and forget this whole exercise?

Edit: Raised an issue with the k8s provider. See what they say. hashicorp/terraform-provider-kubernetes#719

max-rocket-internet · 2020-01-02T14:18:48Z

Can we go back to kubectl and forget this whole exercise?

I'm leaning towards this also.

max-rocket-internet · 2020-01-02T16:15:33Z

What should we do? Revert the k8s provider change? Or implement something like in this PR? #639

dpiddockcmp · 2020-01-02T20:39:25Z

Hit another issue with this damn provider when fiddling about with node_groups:
EKS Node Groups modify, and if necessary, create the aws-auth ConfigMap. This causes the kubernetes provider to always fail if you create the cluster and node groups from scratch in one run. The k8s provider needs resources to not exist or to be imported. There's no force create option.

Solved this problem with a null_data_source to enforce the ordering. So the main issue remaining is the endpoint not always being available after first creation. #639 looks like it might be the solution but that PR needs a little cleaning up.

max-rocket-internet · 2020-01-03T14:04:05Z

Solved this problem with a null_data_source to enforce the ordering

Great!

jamescross91 · 2020-01-03T15:00:59Z

Hit another issue with this damn provider when fiddling about with node_groups:
EKS Node Groups modify, and if necessary, create the aws-auth ConfigMap. This causes the kubernetes provider to always fail if you create the cluster and node groups from scratch in one run. The k8s provider needs resources to not exist or to be imported. There's no force create option.

Solved this problem with a null_data_source to enforce the ordering. So the main issue remaining is the endpoint not always being available after first creation. #639 looks like it might be the solution but that PR needs a little cleaning up.

Just run into this on master - can you give an example of how we can fix using a null_data_source?

dpiddockcmp · 2020-01-03T15:33:25Z

Just run into this on master - can you give an example of how we can fix using a null_data_source?

Basically:

Have a data null_data_source nds with inputs:
- cluster_name. Doesn't matter where this comes from
- kubernetes_config_map.aws_auth.id
Use data.null_data_source.nds.outputs["cluster_name"] as your cluster_name variable in the node groups

You'd have to fork the module to do it. It's not usable from outside. It only solves the race condition on cluster first creation with node groups.

erabug · 2020-01-06T21:08:01Z

Today I ran into both this issue and the Error: configmaps "my-config" already exists issue @dpiddockcmp described. I switched to the branch from #639 and it solved the problem 🎉

max-rocket-internet · 2020-01-09T13:20:41Z

Resolved in #639

ngortheone · 2020-01-14T21:08:35Z

@max-rocket-internet I am not familiar with the release cadence. When is this fix going to be available for regular users?

barryib · 2020-01-14T21:20:00Z

@ngortheone it's already available in v8.0.0

github-actions · 2022-11-28T02:19:42Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues. If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

dpiddockcmp mentioned this issue Dec 15, 2019

[EKS] [request]: On create: only return ACTIVE when endpoint actually usable aws/containers-roadmap#654

Open

max-rocket-internet mentioned this issue Dec 16, 2019

Wait cluster responsive #639

Merged

3 tasks

barryib mentioned this issue Dec 26, 2019

Wait for kubernetes API to be ready during EKS cluster creation hashicorp/terraform-provider-aws#11426

Closed

max-rocket-internet mentioned this issue Jan 3, 2020

Timeout issue using cluster_endpoint output to create kubernetes objects #313

Closed

4 tasks

barryib mentioned this issue Jan 4, 2020

Use Kubernetes provider #547

Closed

4 tasks

max-rocket-internet closed this as completed Jan 9, 2020

satadruroy mentioned this issue Aug 17, 2020

eks aws_auth configmap management may cause race conditions SUSE/cap-terraform#84

Open

github-actions bot locked as resolved and limited conversation to collaborators Nov 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error: Post https://xxxx.eks.amazonaws.com/api/v1/namespaces/kube-system/configmaps: dial tcp xxx:443: i/o timeout #621

Error: Post https://xxxx.eks.amazonaws.com/api/v1/namespaces/kube-system/configmaps: dial tcp xxx:443: i/o timeout #621

max-rocket-internet commented Dec 10, 2019

dpiddockcmp commented Dec 10, 2019

xposix commented Dec 10, 2019

max-rocket-internet commented Dec 11, 2019

max-rocket-internet commented Dec 11, 2019

ivanich commented Dec 13, 2019

max-rocket-internet commented Dec 13, 2019

ivanich commented Dec 13, 2019

max-rocket-internet commented Dec 13, 2019

ivanich commented Dec 13, 2019

dpiddockcmp commented Dec 15, 2019 •

edited

Loading

barryib commented Dec 26, 2019 •

edited

Loading

dpiddockcmp commented Dec 29, 2019 •

edited

Loading

max-rocket-internet commented Jan 2, 2020

max-rocket-internet commented Jan 2, 2020

dpiddockcmp commented Jan 2, 2020

max-rocket-internet commented Jan 3, 2020

jamescross91 commented Jan 3, 2020

dpiddockcmp commented Jan 3, 2020

erabug commented Jan 6, 2020

max-rocket-internet commented Jan 9, 2020

ngortheone commented Jan 14, 2020

barryib commented Jan 14, 2020

github-actions bot commented Nov 28, 2022

Error: Post https://xxxx.eks.amazonaws.com/api/v1/namespaces/kube-system/configmaps: dial tcp xxx:443: i/o timeout #621

Error: Post https://xxxx.eks.amazonaws.com/api/v1/namespaces/kube-system/configmaps: dial tcp xxx:443: i/o timeout #621

Comments

max-rocket-internet commented Dec 10, 2019

dpiddockcmp commented Dec 10, 2019

xposix commented Dec 10, 2019

max-rocket-internet commented Dec 11, 2019

max-rocket-internet commented Dec 11, 2019

ivanich commented Dec 13, 2019

max-rocket-internet commented Dec 13, 2019

ivanich commented Dec 13, 2019

max-rocket-internet commented Dec 13, 2019

ivanich commented Dec 13, 2019

dpiddockcmp commented Dec 15, 2019 • edited Loading

barryib commented Dec 26, 2019 • edited Loading

dpiddockcmp commented Dec 29, 2019 • edited Loading

max-rocket-internet commented Jan 2, 2020

max-rocket-internet commented Jan 2, 2020

dpiddockcmp commented Jan 2, 2020

max-rocket-internet commented Jan 3, 2020

jamescross91 commented Jan 3, 2020

dpiddockcmp commented Jan 3, 2020

erabug commented Jan 6, 2020

max-rocket-internet commented Jan 9, 2020

ngortheone commented Jan 14, 2020

barryib commented Jan 14, 2020

github-actions bot commented Nov 28, 2022

dpiddockcmp commented Dec 15, 2019 •

edited

Loading

barryib commented Dec 26, 2019 •

edited

Loading

dpiddockcmp commented Dec 29, 2019 •

edited

Loading