Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

E2E kfctl go test should verify that Kubeflow is deployed. #2795

Merged
merged 4 commits into from Mar 27, 2019

Conversation

jlewi
Copy link
Contributor

@jlewi jlewi commented Mar 24, 2019


This change is Reviewable

…oyed.

* Related to kubeflow#2610 E2E test for go binary.

* Create kf_is_ready_test.py to verify that Kubeflow is deployed successfully.
* We don't reuse wait_for_kubeflow.py because that is configured to wait
  on JupyterHub.

  * We could have modified wait_for_kubeflow.py but I took this as an
    opportunity to convert the test to use pytest.

* We don't run the component tests because those are currently configured
  to depend on wait_for_kubeflow.py. So we will modify them in a follow
  on PR.
@jlewi jlewi changed the title [WIP] Add the component tests to the kfctl go E2E test. Add the component tests to the kfctl go E2E test. Mar 26, 2019
@jlewi
Copy link
Contributor Author

jlewi commented Mar 26, 2019

Test passed before rebasing
https://prow.k8s.io/view/gcs/kubernetes-jenkins/pr-logs/pull/kubeflow_kubeflow/2795/kubeflow-presubmit/6687/

So this should be ready to go.

@jlewi
Copy link
Contributor Author

jlewi commented Mar 26, 2019

/assign @gabrielwen
/assign @kkasravi

@jlewi jlewi changed the title Add the component tests to the kfctl go E2E test. E2E kfctl go test should verify that Kubeflow is deployed. Mar 26, 2019
@jlewi
Copy link
Contributor Author

jlewi commented Mar 26, 2019

Most recent failure was due to running out of CPU quota.

@jlewi
Copy link
Contributor Author

jlewi commented Mar 26, 2019

/test all

// Increase the log level so that info level log statements show up.
"--log-cli-level=info",
// Test timeout in seconds.
"--timeout=500",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

if it's waiting for the deployment to be ready, 500 seconds seems to be too constraint?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Intersting. It doesn't seem to work like that so I guess I can remove it.
If we look at the runs the tests are definitely running longer than 5 minutes.
http://testing-argo.kubeflow.org/workflows/kubeflow-test-infra/kubeflow-postsubmit-kfctl-go-4485a65-1297-efc5?tab=workflow

# Verify that components are actually deployed.
# TODO(jlewi): We need to parameterize this list based on whether
# we are using IAP or basic auth.
deployment_names = [
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe read config files from bootstrap/config/... ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it might be better to be explicit. This way its clear what the expectations are (e.g. which servicecs could be running).

If we parse the config and get a list of components dynamically then its not clear if we are setting expectations correctly. Also, if we do it that way and someone accidentally deletes some component then our E2E test wouldn't catch that.

@jlewi
Copy link
Contributor Author

jlewi commented Mar 27, 2019

/test all

@gabrielwen
Copy link
Contributor

/lgtm

@jlewi
Copy link
Contributor Author

jlewi commented Mar 27, 2019

/approve

Fingers crossed tests pass.

@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: jlewi

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@jlewi
Copy link
Contributor Author

jlewi commented Mar 27, 2019

/test all

Latest failure was on delete; looks like a transitory issue calling Deployment manager delete.

Error: couldn't delete KfApp: coordinator Delete failed for gcp: Gcp.Delete is failed for kubeflow-ci-deployment/k
fctl-d64a: Deleting kfctl-d64a error: Get https://www.googleapis.com/deploymentmanager/v2/projects/kubeflow-ci-deployment/global/operations/operation-1553
648831540-585090eb5ac11-86b3ff73-d1b65ebc?alt=json&prettyPrint=false: read tcp 10.36.54.88:58554->74.125.141.95:443: read: connection reset by peer

@jlewi
Copy link
Contributor Author

jlewi commented Mar 27, 2019

/test all

@k8s-ci-robot k8s-ci-robot removed the lgtm label Mar 27, 2019
@gabrielwen
Copy link
Contributor

/lgtm

@jlewi
Copy link
Contributor Author

jlewi commented Mar 27, 2019

/test all

@k8s-ci-robot k8s-ci-robot merged commit 6eddd95 into kubeflow:master Mar 27, 2019
lluunn pushed a commit to lluunn/kubeflow that referenced this pull request Mar 28, 2019
…2795)

* Extend the E2E test for the go binary to verify that Kubeflow is deployed.

* Related to kubeflow#2610 E2E test for go binary.

* Create kf_is_ready_test.py to verify that Kubeflow is deployed successfully.
* We don't reuse wait_for_kubeflow.py because that is configured to wait
  on JupyterHub.

  * We could have modified wait_for_kubeflow.py but I took this as an
    opportunity to convert the test to use pytest.

* We don't run the component tests because those are currently configured
  to depend on wait_for_kubeflow.py. So we will modify them in a follow
  on PR.

* kfctl_delete is now passing so enable it.

* Address comments.

* Disable katib stduy job test because it is flaky (Related to kubeflow#2825).
saffaalvi pushed a commit to StatCan/kubeflow that referenced this pull request Feb 11, 2021
…2795)

* Extend the E2E test for the go binary to verify that Kubeflow is deployed.

* Related to kubeflow#2610 E2E test for go binary.

* Create kf_is_ready_test.py to verify that Kubeflow is deployed successfully.
* We don't reuse wait_for_kubeflow.py because that is configured to wait
  on JupyterHub.

  * We could have modified wait_for_kubeflow.py but I took this as an
    opportunity to convert the test to use pytest.

* We don't run the component tests because those are currently configured
  to depend on wait_for_kubeflow.py. So we will modify them in a follow
  on PR.

* kfctl_delete is now passing so enable it.

* Address comments.

* Disable katib stduy job test because it is flaky (Related to kubeflow#2825).
saffaalvi pushed a commit to StatCan/kubeflow that referenced this pull request Feb 12, 2021
…2795)

* Extend the E2E test for the go binary to verify that Kubeflow is deployed.

* Related to kubeflow#2610 E2E test for go binary.

* Create kf_is_ready_test.py to verify that Kubeflow is deployed successfully.
* We don't reuse wait_for_kubeflow.py because that is configured to wait
  on JupyterHub.

  * We could have modified wait_for_kubeflow.py but I took this as an
    opportunity to convert the test to use pytest.

* We don't run the component tests because those are currently configured
  to depend on wait_for_kubeflow.py. So we will modify them in a follow
  on PR.

* kfctl_delete is now passing so enable it.

* Address comments.

* Disable katib stduy job test because it is flaky (Related to kubeflow#2825).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants