Detail how to debug CI failures as a contributor #3143

mariantalla · 2019-01-23T16:24:56Z

This PR adds more detail on what contributors can do to debug CI failures and flakes on their contributions.

Tasks:

What to do if failure is not related to the changes of the PR
- Triaging failure
- Opening an issue
- Notifying SIG

Is it worth, as part of this PR:

Including an overview of testgrid? Other tooling?

mariantalla · 2019-01-24T13:08:52Z

@tpepper @nikhita @guineveresaenger @parispittman

I've added some documentation for "What do I do when my PR fails CI tests?". How does this align with what you wanted to include as part of #1537 ?

@spiffxp @BenTheElder: A lot of this content comes from the testing.md file that kubernetes/sig-release#428 aims to remove - mostly stuff that I thought were useful to contributors in general (as opposed to specific CI-related roles) but would love to hear your feedback.

BenTheElder · 2019-01-24T20:48:47Z

thanks @mariantalla -- I'm pretty swamped at the moment, mind pinging me sometime next week to revisit? I would love to help add some details 😅

neolit123

thanks for the writeup @mariantalla 👍
i've added some minor comments.

neolit123 · 2019-01-27T20:20:31Z

contributors/devel/testing.md

+  - If yes, comment on it and link your PR, the failed run that affected you and any other information you think might be relevant
+  - If no, open a new issue and notify the appropriate SIG (see: SIG test escalation)
+
+#### SIG test escalation


i would title this:

Escalating failures to a SIG

Thanks, changed it!

neolit123 · 2019-01-27T20:21:36Z

contributors/devel/testing.md

+
+#### SIG test escalation
+- Figure out corresponding sig from test name/description
+- Mention the sig's github handle on the issue, optionally cc the SIG's chair(s) (locate them under kubernetes/community/sig-<name>)


some capitalizations:

sig -> SIG
github -> GitHub
cc -> CC

Thanks, capitalized sig and github except where (I thought) it made sense to keep lower-case (e.g. in urls)

I changed cc to code format (cc) rather than capitalizing with the logic that it's sort of a command within a GitHub conversation... no strong opinions though. What do you think?

sounds good 👍

neolit123 · 2019-01-27T20:24:48Z

contributors/devel/testing.md

@@ -225,3 +225,40 @@ version and the watch cache test is skipped.
 ## End-to-End tests

 Please refer to [End-to-End Testing in Kubernetes](e2e-tests.md).
+
+## Running your contribution in the Kubernetes CI
+Once you open a PR, prow will run pre-submit tests in CI.


possibly mention /ok-to-test and /retest and/or link to some of the PR / test related bot commands?

Added, thanks!

neolit123 · 2019-01-27T20:28:51Z

contributors/devel/testing.md

+### Troubleshooting a failure
+Click on `Details` and look at the [`gubernator`](gubernator.k8s.io/) output for the test.
+
+#### Troubleshooting failures/flakes that are not caused by your change


possibly unify the two sections?

we can explain that if the gubernator output seems unrelated to the change they might try to first call /retest them self. but if a test continues to fail, to contact a reviewer/maintainer.

usually what happens is that contributors first ping in the PR and if nobody responds they go asking on slack.

Good point, we unified them under a single title and structured them a bit more.

Opt for active voicing for better readability.

Signed-off-by: Maria Ntalla <mntalla@pivotal.io>

mariantalla · 2019-02-20T12:34:19Z

/cc @spiffxp @fejta @stevekuznetsov @timothysc @tpepper

(sig-testing-leads and #1537 creator)

spiffxp

A few comments

spiffxp · 2019-02-20T14:55:10Z