Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Increase api latency threshold for cluster-scoped list calls #52732

Merged
merged 1 commit into from
Sep 21, 2017

Conversation

shyamjvs
Copy link
Member

Recent change from @smarterclayton (#52237) added scope to apiserver metrics. As a result, our current threshold for list calls is no longer sufficient for all-namespace calls which are now being measured separately from namespaced lists. For e.g (from our last 5k run):

WARNING Top latency metric: {Resource:pods Subresource: Verb:LIST Scope:cluster Latency:{Perc50:4.498374s Perc90:7.548079s Perc99:8.169389s Perc100:0s} Count:1400}

cc @kubernetes/sig-scalability-misc @kubernetes/sig-api-machinery-misc @wojtek-t

@shyamjvs shyamjvs added the release-note-none Denotes a PR that doesn't merit a release note. label Sep 19, 2017
@shyamjvs shyamjvs added this to the v1.8 milestone Sep 19, 2017
@k8s-ci-robot k8s-ci-robot added sig/scalability Categorizes an issue or PR as relevant to SIG Scalability. size/S Denotes a PR that changes 10-29 lines, ignoring generated files. sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Sep 19, 2017
@apelisse apelisse added this to Backlog in 1.8 Failing tests Sep 19, 2017
@apelisse apelisse removed this from Backlog in 1.8 Failing tests Sep 19, 2017
@dims
Copy link
Member

dims commented Sep 19, 2017

/test all

@dims
Copy link
Member

dims commented Sep 20, 2017

/retest

@gmarek
Copy link
Contributor

gmarek commented Sep 20, 2017

/lgtm

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 20, 2017
@dims
Copy link
Member

dims commented Sep 20, 2017

/retest

@dims
Copy link
Member

dims commented Sep 20, 2017

@liggitt @smarterclayton - Can one of you PTAL?

@dims
Copy link
Member

dims commented Sep 20, 2017

/test pull-kubernetes-e2e-gce-gpu

// as list response sizes are bigger in general for big clusters. We also use a higher threshold
// for list calls with cluster scope (all namespaces).
apiListCallLatencyThreshold time.Duration = 5 * time.Second
apiClusterScopeListCallThreshold time.Duration = 10 * time.Second
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do you literally mean cluster scoped (i.e., node objects) or cross-namespace lists? If the latter, please choose a less confusing name.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I meant the former (which includes both non-namespaced and all-namespaced calls) - changed the comment to make it clearer.

if !isListCall ||
!isBigCluster ||
(!isClusterScopedCall && latency > apiListCallLatencyThreshold) ||
(latency > apiClusterScopeListCallThreshold) {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a really confusing logic statement. I suggest one if/switch to set the threshold, and then this if should read if latency > threshold.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Changed it. LG?

@k8s-github-robot k8s-github-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 21, 2017
@smarterclayton
Copy link
Contributor

Yay, more precise metrics! :)

@spiffxp
Copy link
Member

spiffxp commented Sep 21, 2017

/approve no-issue
/lgtm
per /lgtm from an approver above

@shyamjvs @gmarek a reminder that if the PR links to an issue (not PR) in the description, you don't need to /approve no-issue

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 21, 2017
@k8s-github-robot
Copy link

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: gmarek, shyamjvs, spiffxp

Associated issue requirement bypassed by: spiffxp

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these OWNERS Files:

You can indicate your approval by writing /approve in a comment
You can cancel your approval by writing /approve cancel in a comment

@k8s-github-robot k8s-github-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Sep 21, 2017
@k8s-github-robot
Copy link

/test all [submit-queue is verifying that this PR is safe to merge]

@smarterclayton
Copy link
Contributor

/test pull-kubernetes-kubemark-e2e-gce-big

@k8s-github-robot
Copy link

Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions here..

@k8s-github-robot k8s-github-robot merged commit 5424861 into kubernetes:master Sep 21, 2017
@shyamjvs shyamjvs deleted the fix-metrics-perf-tests branch September 21, 2017 18:34
@mbohlool
Copy link
Contributor

cc @jpbetz

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. release-note-none Denotes a PR that doesn't merit a release note. sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. sig/scalability Categorizes an issue or PR as relevant to SIG Scalability. size/S Denotes a PR that changes 10-29 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

9 participants