Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

don't enable node-problem-local detector on dns jobs #31312

Merged
merged 1 commit into from
Nov 24, 2023

Conversation

aojea
Copy link
Member

@aojea aojea commented Nov 24, 2023

Since NPD changed to be a daemonset and the workers are small, sometimes there is no room on the nodes for the pods to be scheduled and the job times out

These jobs test DNS, so no need to install node problem detector.

Change-Id: Ia772b00182ecb4cf40e0932f91df7e8410cbea32
@k8s-ci-robot k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Nov 24, 2023
@k8s-ci-robot k8s-ci-robot added area/config Issues or PRs related to code in /config area/jobs sig/network Categorizes an issue or PR as relevant to SIG Network. sig/testing Categorizes an issue or PR as relevant to SIG Testing. labels Nov 24, 2023
@aojea
Copy link
Member Author

aojea commented Nov 24, 2023

/assign @upodroid @ameukam

@ameukam
Copy link
Member

ameukam commented Nov 24, 2023

/lgtm

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 24, 2023
Copy link
Member

@upodroid upodroid left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is an open PR in k/k to have the master/control-plane use e2-standard-2 and the nodes to use e2-standard-2. double the cores for control plane instance

kubernetes/kubernetes#118626

/lgtm

@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: aojea, upodroid

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@aojea
Copy link
Member Author

aojea commented Nov 24, 2023

is this related?

k8s.io/test-infra/prow/config/secret: TestAddWithParser/1 expand_less 2s
{Failed === RUN TestAddWithParser/1 === PAUSE TestAddWithParser/1 === CONT TestAddWithParser/1 time="2023-XX-24TXX:X2:00Z" level=warning msg="Error loading secret file." error="stat /tmp/X742X67904: no such file or directory" secret-path=/tmp/X742X67904 time="2023-XX-24TXX:X2:00Z" level=warning msg="Error loading secret file." error="stat /tmp/3086003526: no such file or directory" secret-path=/tmp/3086003526 agent_test.go:169: expected value 2 from generator, got 1 time="X0X3-11-X4T11:1X:01Z" level=error msg="Error loading secret." error="strconv.Atoi: parsing "not-a-number": invalid syntax" secret-path=/tmp/TestAddWithParser0360X481105/001/secret --- FAIL: TestAddWithParser/1 (2.00s)

https://prow.k8s.io/view/gs/kubernetes-jenkins/pr-logs/pull/test-infra/31312/pull-test-infra-unit-test/1728007651314372608

@aojea
Copy link
Member Author

aojea commented Nov 24, 2023

/test pull-test-infra-unit-test

@k8s-ci-robot k8s-ci-robot merged commit 223b877 into kubernetes:master Nov 24, 2023
7 checks passed
@k8s-ci-robot
Copy link
Contributor

@aojea: Updated the job-config configmap in namespace default at cluster test-infra-trusted using the following files:

  • key sig-network-misc.yaml using file config/jobs/kubernetes/sig-network/sig-network-misc.yaml

In response to this:

Since NPD changed to be a daemonset and the workers are small, sometimes there is no room on the nodes for the pods to be scheduled and the job times out

These jobs test DNS, so no need to install node problem detector.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@aojea
Copy link
Member Author

aojea commented Nov 25, 2023

It does not seem to make effect https://testgrid.k8s.io/sig-network-gce#gci-gce-coredns-nodecache , it keeps installing the node problem detector

@upodroid
Copy link
Member

upodroid commented Nov 25, 2023

It does not seem to make effect https://testgrid.k8s.io/sig-network-gce#gci-gce-coredns-nodecache , it keeps installing the node problem detector

You used the wrong variable

https://github.com/kubernetes/kubernetes/blob/d61cbac69aae97db1839bd2e0e86d68f26b353a7/cluster/gce/config-default.sh#L290

export ENABLE_NODE_PROBLEM_DETECTOR="${KUBE_ENABLE_NODE_PROBLEM_DETECTOR:-daemonset}"

@aojea
Copy link
Member Author

aojea commented Nov 25, 2023

It does not seem to make effect https://testgrid.k8s.io/sig-network-gce#gci-gce-coredns-nodecache , it keeps installing the node problem detector

You used the wrong variable

https://github.com/kubernetes/kubernetes/blob/d61cbac69aae97db1839bd2e0e86d68f26b353a7/cluster/gce/config-default.sh#L290

export ENABLE_NODE_PROBLEM_DETECTOR="${KUBE_ENABLE_NODE_PROBLEM_DETECTOR:-daemonset}"

:facepalm

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. area/config Issues or PRs related to code in /config area/jobs cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. sig/network Categorizes an issue or PR as relevant to SIG Network. sig/testing Categorizes an issue or PR as relevant to SIG Testing. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants