Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

po/tf-job-operator logs said user cannot get endpoints in the namespace "default" #109

Closed
wydwww opened this issue Jan 10, 2018 · 3 comments

Comments

@wydwww
Copy link
Contributor

wydwww commented Jan 10, 2018

After ks apply default -c kubeflow-core, the jupyterhub was up but when I checked kubectl logs tf-job-operator-77b689c87d-g7w2z, it said

E0110 11:10:03.455096       1 election.go:226] error retrieving resource lock default/tf-operator: endpoints "tf-operator" is forbidden: User "system:serviceaccount:default:tf-job-operator" cannot get endpoints in the namespace "default"

It's pretty same with kubeflow/training-operator#173 but it's not deployed by helm, so how to fix this?

Thanks

@foxish
Copy link
Contributor

foxish commented Jan 10, 2018

This is an RBAC issue - you'd need to create a rolebinding that grants the ability to view the endpoints resource to default:tf-job-operator. https://kubernetes.io/docs/admin/authorization/rbac/ is a good starting point on RBAC.

@wydwww
Copy link
Contributor Author

wydwww commented Jan 11, 2018

@foxish Thanks! It's easy to fix

kubectl create clusterrolebinding tf-job-rule --clusterrole=cluster-admin --serviceaccount=default:tf-job-operator

Do I need to add this to https://github.com/google/kubeflow#rbac-clusters?

@jlewi
Copy link
Contributor

jlewi commented Jan 11, 2018

The ksonnet configs should create the necessary roles. So if that's not happening its a bug and we should figure out why and fix it.

@jlewi jlewi closed this as completed Mar 26, 2018
yanniszark pushed a commit to arrikto/kubeflow that referenced this issue Nov 1, 2019
* add notebook crd validation

* updated the test

* fixed crd validation

* run make test
elenzio9 pushed a commit to arrikto/kubeflow that referenced this issue Oct 31, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants