Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue in enabling kubeflow in ubuntu 18.04 azure vm #2024

Closed
thirubeyond opened this issue Feb 17, 2021 · 4 comments
Closed

Issue in enabling kubeflow in ubuntu 18.04 azure vm #2024

thirubeyond opened this issue Feb 17, 2021 · 4 comments
Assignees
Labels

Comments

@thirubeyond
Copy link

On enabling kubeflow in azure vm, I get error

ERROR cannot load ssh client keys: open /var/snap/microk8s/2052/juju/share/juju/ssh: permission denied

OS: 18.04.5 LTS (Bionic Beaver)
microk8s.kubectl config view --raw > $HOME/.kube/config
export KUBECONFIG=$HOME/.kube/config
microk8s enable dns dashboard storage worked fine
snap list
microk8s v1.20.2 2052 latest/edge canonical✓ classic
inspection-report-20210217_122634.tar.gz

Tried:

-output of microk8s.kubectl get all --all-namespaces give all pods are running fine

  • output of microk8s inspect gave a warning
    WARNING: Port 16443 seems to be in use by another application.

I have been trying several times still hitting the same issue.

@knkski
Copy link
Contributor

knkski commented Feb 18, 2021

@thirubeyond: that's odd that you'd encounter this permissions issue if you're not running any command with sudo . Can you try running these commands to see if they fix the permissions issue?

sudo chgrp -R microk8s /var/snap/microk8s/current/juju
sudo chmod -R 775 /var/snap/microk8s/current/juju

If that doesn't work, can you try including the --debug flag and posting the output? e.g.

microk8s enable kubeflow --debug

@thirubeyond
Copy link
Author

thirubeyond commented Feb 19, 2021

@knkski : I had run the above chgrp/chmod commands last around, did it again now in fresh azure vm.

Experiment 1: enable kubeflow with debug (had set KUBECONFIG but not kept the kubeconfig file), got output Log1 (attached)
Experiment 2: generated kubeconfig file and enabled kubeflow with debug, got output Log 2 (attached)

Log2.pdf
Log1.pdf

After seeing log2 I ran below command:
sudo chgrp -R microk8s /tmp
sudo chmod -R 775 /tmp

Again I enabled kubeflow,
Now it proceeded and reached a stage showing " waiting for operator pod " for very long time (30mins) after which I have to stop

Attaching inspect log at 30 mins:
inspection-report-20210219_120910.tar.gz

Please advise.

@knkski
Copy link
Contributor

knkski commented Feb 22, 2021

@thirubeyond: You shouldn't have to set KUBECONFIG or modify the kubeconfig file. What happens if you leave both of those unset?

I see in the inspection log messages such as The connection to the server 127.0.0.1:16443 was refused - did you specify the right host or port?, which looks to me like a deeper MicroK8s issue, instead of a Kubeflow-specific error.

@stale
Copy link

stale bot commented Jan 19, 2022

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@stale stale bot added the inactive label Jan 19, 2022
@stale stale bot closed this as completed Feb 19, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants