-
Notifications
You must be signed in to change notification settings - Fork 561
[kubernetes] k8s cluster starts working after nodes are restarted #123
Comments
How much time elapsed between the SP creation and the reboot? Also, I know it's a weird question, but what timezone did you execute the SP creation command in? |
@colemickens .. Let me answer the second question first. I executed sp creation commands, on my laptop in UTC+2 TZ! For the first question, it's not entirely clear to me. I created the SP using the new python |
Rebooted remaining nodes, and they came up in |
@kim0 How you get api server restarted? I've ssh-ed to master, tried to do same as you, but:
|
@olostan Your SP is very likely misconfigured. Please check the troubleshooting steps here: https://github.com/Azure/acs-engine/blob/master/docs/kubernetes.md#troubleshooting |
@colemickens wow... thnx. That helps - seems I really have
Sorry if it is not correct place (however could be useful for others if they have same problem, but is there any clue how to add those permissions? Actually I have created cluster exactly one-to-one as on this video: https://www.youtube.com/watch?v=nhY9XdzNbbY with |
@olostan If you used Can you please detail the exact command you ran (possibly looking through your shell history) and also paste the full output from (We might end up moving this over to https://github.com/Azure/azure-cli ...) |
Created Azure/azure-cli#1620 |
auth rights to resource groups should be able to be added through the Azure Portal. |
Please re-open if you encounter this issue again, since the latest az should fix this. |
I created a sp through
az ad sp create-for-rbac --role contributor --scopes /subscriptions/xxx-yyy-zzz
, then I deployed a k8s cluster through the portal UI. After the boxes were up, I ssh'ed into master node and:but
az login
was working fine with my sp account! Confused, I tried restarting the k8s api server usingdocker restart foo
, and suddenly the k8s api server was responding. Albeit all nodes were not ready!I rebooted agent-1 from the web portal UI .. a minute later
I didn't yet reboot the rest of nodes in case anyone wants to take a look. If I were to guess, It seems k8s cluster was up before AAD had fully replicated the sp account? and surprisingly, k8s does not auto-retry, but somehow gets stuck!
The text was updated successfully, but these errors were encountered: