Add --verbose option to test/e2e/ #24471

jayunit100 · 2016-04-19T14:54:37Z

Problem
There are a few issues in the e2e tests which are very verbose at +10 node scale. This can make it hard to interpret and read through the results of a test run, esp. when there are multiple failures. The two things i've seen in the past.

hanging at large scale.
(failures) dumping all data for 100s of nodes to stdout.

Proposed Solution (4/18/2016)

When developing e2es at small (2-4) node scale, the info is very useful. We should log lots of stuff, but just minimize the output, unless the user specifies --verbose.

UPDATED SOLUTION (4/21/2016)

The simplest solution wound up simply supporting (2) below. (1) can be done later if we really need to.

So we should

introspect the cluster and find out how many nodes there are.
Add a Debugf which runs by default in small clusters.
Make dump call Debugf.
Drastically reduce the use of Logf across the e2e suite.

cc @kubernetes/sig-testing @kubernetes/sig-scalability

The text was updated successfully, but these errors were encountered:

jayunit100 · 2016-04-20T20:23:37Z

This script counts the log lines for each step in the E2Es...

currStep = "N/A"
currCount = 0
print "starting"
def status():
        print currStep, " ", currCount
for line in fileinput.input():
        if "STEP" in line:
                status()
                currStep = line
                currCount = 0
        else:
                currCount = currCount + 1
print currCount

runnning this on the test output from the CI, I get these culprits for scaleout overlogging...

STEP: Waiting for a default service account to be provisioned in namespace
  85
STEP: Waiting for a default service account to be provisioned in namespace
  85
STEP: Waiting for a default service account to be provisioned in namespace
  85
STEP: creating replication controller cleanup60-7b6834b4-0720-11e6-a521-42010af0000d in namespace e2e-tests-kubelet-hb0om
  59
STEP: deleting replication controller cleanup60-7b6834b4-0720-11e6-a521-42010af0000d in namespace e2e-tests-kubelet-hb0om
  103
STEP: Waiting for a default service account to be provisioned in namespace
  125
STEP: creating replication controller svc-latency-rc in namespace e2e-tests-svc-latency-cs82v
  407
STEP: Waiting for a default service account to be provisioned in namespace
  85
STEP: Waiting for a default service account to be provisioned in namespace
  125
STEP: creating replication controller proxy-service-rd0m4 in namespace e2e-tests-proxy-51h6k
  692

We could actually run this at the end of jenkins jobs if we want to punish overlogging programmatically. but for nowill just audit these tests and make them less verbose.

jayunit100 · 2016-04-20T20:26:33Z

fyi @timothysc "scale killers" overlogging.

timothysc · 2016-04-20T20:57:16Z

Then fix them ;-)

jayunit100 · 2016-04-20T21:56:05Z

On it yup

jayunit100 · 2016-04-21T18:16:19Z

culprit: config.DefaultReporterConfig.Verbose this is currently set to true.

This has the advantage of giving us spec progress.
The disadvantage is that in ginkgo that also means the Logs get streamed out.

So, I think the simplest solution is to have the debug logs go into an output file, which has both INFO as well as DEBUG.

jayunit100 · 2016-04-21T19:15:41Z

Dug some more, I have a patch that will do this the easy way,

Require -v 2 to see the ugly granular logs, which go through the Debugf pathway.
User can decide and no need for extra directories/files.

Thus, this separates ginkgo progress logs from debug e2e logs (Which almost always really should be verbose) from the other logs (which can be on/off via -v glog).

@timothysc

Automatic merge from submit-queue Logging soak Implements #24427 Needs - #24471 so that it doesnt clog test outputs for scale - builds on the utils function added in support of #22869 cc @timothysc @kubernetes/sig-testing

0xmichalis · 2017-06-20T23:21:39Z

/sig testing

fejta-bot · 2017-12-29T06:18:41Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

spiffxp · 2018-01-07T01:29:19Z

/remove-area test-infra
/area test

fejta-bot · 2018-02-10T09:35:10Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten
/remove-lifecycle stale

fejta-bot · 2018-03-12T10:21:31Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

mikedanese added the area/test-infra label Apr 20, 2016

jayunit100 mentioned this issue Apr 20, 2016

Logging soak #24536

Merged

jayunit100 mentioned this issue Apr 21, 2016

Log control: Enable a DebugF implementation to separate *ginkgo progress logs* from *debug e2e logs* #24615

Closed

jayunit100 mentioned this issue May 11, 2016

Add a flag to disable dumpig logs after e2e test failure #25477

Merged

k8s-github-robot assigned jayunit100 Jun 8, 2016

k8s-github-robot added the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label May 31, 2017

k8s-ci-robot added the sig/testing Categorizes an issue or PR as relevant to SIG Testing. label Jun 20, 2017

0xmichalis removed the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Jun 20, 2017

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 29, 2017

k8s-ci-robot added area/test and removed area/test-infra labels Jan 7, 2018

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Feb 10, 2018

k8s-ci-robot closed this as completed Mar 12, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add --verbose option to test/e2e/ #24471

Add --verbose option to test/e2e/ #24471

jayunit100 commented Apr 19, 2016 •

edited

jayunit100 commented Apr 20, 2016 •

edited

jayunit100 commented Apr 20, 2016

timothysc commented Apr 20, 2016

jayunit100 commented Apr 20, 2016

jayunit100 commented Apr 21, 2016

jayunit100 commented Apr 21, 2016 •

edited

0xmichalis commented Jun 20, 2017

fejta-bot commented Dec 29, 2017

spiffxp commented Jan 7, 2018

fejta-bot commented Feb 10, 2018

fejta-bot commented Mar 12, 2018

Add --verbose option to test/e2e/ #24471

Add --verbose option to test/e2e/ #24471

Comments

jayunit100 commented Apr 19, 2016 • edited

jayunit100 commented Apr 20, 2016 • edited

jayunit100 commented Apr 20, 2016

timothysc commented Apr 20, 2016

jayunit100 commented Apr 20, 2016

jayunit100 commented Apr 21, 2016

jayunit100 commented Apr 21, 2016 • edited

0xmichalis commented Jun 20, 2017

fejta-bot commented Dec 29, 2017

spiffxp commented Jan 7, 2018

fejta-bot commented Feb 10, 2018

fejta-bot commented Mar 12, 2018

jayunit100 commented Apr 19, 2016 •

edited

jayunit100 commented Apr 20, 2016 •

edited

jayunit100 commented Apr 21, 2016 •

edited