-
Notifications
You must be signed in to change notification settings - Fork 2.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Endpoint not coming up when Kfctl apply on GCP with IAP #3628
Comments
Issue-Label Bot is automatically applying the label Links: app homepage, dashboard and code for this bot. |
All backends seem healthy from the log. |
I just noticed in one of my running kubeflow cluster (endpoint accessible), the iap-enabler pod is also crash-looping. |
@lluunn were you able to access the kubeflow endpoint ? |
@johnugeorge did you try following the IAP troubleshooting guide? It might need some updating for ISTIO but it's a good place to start. |
I am seeing the same thing, on a GKE cluster created via
This is in a GCP project where I have previously created a GKE cluster with kfctl (using version 0.5.1) which no longer exists, with IAP / cloud endpoints, in case that makes any difference. Kubeflow in the cluster and IAP works just fine though - but I am left with a crash-looping iap-enabler pod. I think I know what the problem is - the ports:
- name: status-port
nodePort: 30777
port: 15020
protocol: TCP
targetPort: 15020
- name: http2
nodePort: 31380
port: 80
protocol: TCP
targetPort: 80
- name: https
nodePort: 31390
port: 443
protocol: TCP
targetPort: 443
- name: tcp
nodePort: 31400
port: 31400
protocol: TCP
targetPort: 31400
- name: https-kiali
nodePort: 32007
port: 15029
protocol: TCP
targetPort: 15029
- name: https-prometheus
nodePort: 30355
port: 15030
protocol: TCP
targetPort: 15030
- name: https-grafana
nodePort: 31764
port: 15031
protocol: TCP
targetPort: 15031
- name: https-tracing
nodePort: 30439
port: 15032
protocol: TCP
targetPort: 15032
- name: tls
nodePort: 32472
port: 15443
protocol: TCP
targetPort: 15443 The setup_backend.sh script that the iap-enabled pod is running tries to list a backend-service with gcloud that the nodePort value from the first port in the list - for me that value is kubeflow/kubeflow/gcp/setup_backend.sh Lines 70 to 72 in 6f790f8
and I don't have a backend service whose name contains that port:
|
I think a jsonpath in the kubectl command of |
The script selects the nodePort based on [the name of the port earlier in the script](https://github.com/kubeflow/kubeflow/blob/e6944f35149f7c362d32fb9d57fc9f3842a46347/kubeflow/gcp/setup_backend.sh#L26), but the query in the `checkBackend` function assumes that the port number is the first in the Service. I believe this fixes kubeflow#3628
This should fix kubeflow/kubeflow#3628. See also kubeflow/kubeflow#3691 (which modified the wrong file).
This should fix kubeflow/kubeflow#3628. See also kubeflow/kubeflow#3691 (which modified the wrong file).
This should fix kubeflow/kubeflow#3628. See also kubeflow/kubeflow#3691 (which modified the wrong file).
I tried KF installation with latest Kfctl(built using
make build-kfctl
) but I am not able to view the cloud IAP endpoint.I see that
iap-enabler
pod inistio-system
namespace is continuously crashing. Logs are given below./cc @lluunn
/cc @kunmingg
/cc @jlewi
The text was updated successfully, but these errors were encountered: