T14397 baseline test plan by gctucker · Pull Request #55 · kernelci/kernelci-core

gctucker · 2019-05-02T11:25:53Z

This is an initial PR to start adding the new baseline test plan. See previous thread on the mailing list for some background:
https://groups.io/g/kernelci/topic/29693068

Initially this would be to run bootrr on a few platforms with a dedicated script to check the device drivers are initialised properly on them. Then it would be necessary to have a way to automatically determine the expected devices in order to scale better with the number of platforms tested on kernelci.org, probably by reading the device tree or some other info which the kernel itself uses to load drivers and probe devices.

Another part of this test plan would be to look for kernel errors rather than plainly relying on LAVA's attempt to reach a login prompt.

This PR relies on the bootrr scripts to be added to a buildroot image, so it depends on the following ones to be resolved first:

Fix google,kevin script in bootrr Google kevin fixes linux-msm/bootrr#24
Fix make install in bootrr T14637 fix install directory bootrr#1
Add bootrr package to buildroot T14637 add bootrr buildroot#3
Build a baseline buildroot rootfs with bootrr in Jenkins and use it on staging

gctucker · 2019-05-02T11:37:37Z

I've run this with my test tree and got this report:

gtucker/kernelci-local baseline: 122 tests, 0 regressions (kernelci-local-snapshot-037-1-gce3292b9c9eb)

Test results summary
--------------------

  Test:    baseline
  Tree:    gtucker
  Branch:  kernelci-local
  Kernel:  kernelci-local-snapshot-037-1-gce3292b9c9eb
  URL:     https://gitlab.collabora.com/gtucker/linux.git
  Commit:  ce3292b9c9eb51981c8a8d86ea74d5b871a6bc1f


1  | rk3288-veyron-jaq      | arm   |  61 total:  61 PASS   0 FAIL   0 SKIP
2  | rk3288-veyron-jaq      | arm   |  61 total:  61 PASS   0 FAIL   0 SKIP

There are 2 lines because they were with kernels built with gcc and clang. The rk3399-gru-kevin results are missing due to some infrastructure error (console was down), next round should have them in principle.

Some sample jobs in LAVA:
https://lava.collabora.co.uk/scheduler/job/1594445 (rk3288-veyron-jaq)
https://lava.collabora.co.uk/scheduler/job/1595670 (rk3399-gru-kevin)

test-configs.yaml

khilman · 2019-05-09T18:42:36Z

This change (specifically the last patch) is also using a modified buildroot ramdisk. What's missing in our buildroot images, and shouldn't we just update them instead?

gctucker · 2019-05-09T18:52:06Z

The bootrr scripts are not in the buildroot images. I'm waiting for some changes to get merged into bootrr itself, then I have some patches in buildroot to add a bootrr package using that. We can then decide to either have that enabled directly in the regular buildroot builds or have a variant for the "baseline" test plan (i.e. bootrr + other fast system-wide checks).

Ultimately I think the "baseline" test plan should replace our current "boot" tests (see thread on the mailing list). We can keep 2 separate rootfs images until that happens. I've found that the bootrr tests add about 10s to the LAVA job run times with buildroot, on rk3399-gru-kevin and rk3288-veyron-jaq, but using Debian (Stretch) would have a much bigger impact.

khilman · 2019-05-09T21:25:35Z

OK, thanks for the clarification. I think getting merging boot and baseline (or getting rid of boot) is the right strategy.

So, I'm fine with this change, but I'd prefer to keep everything that's staging only in the last patch. As I pointed out above, it's not quite a clean separation.

gctucker · 2019-05-10T07:06:30Z

@khilman So if I understand correctly what you mean, you prefer to have the bootrr scripts and any other potential tool needed for the baseline test plan to be added to the base image rather than have a separate baseline one?

I was initially planning to add a config fragment in buildroot to add bootrr and may other small things if necessary and keep building 2 flavours for each architecture, with and without this fragment. That's why I added the buildroot_baseline_ramdisk and that was not intended to be only for staging, only the temporary image URL was. But I'm OK with directly adding bootrr to the base image and simplify things a bit.

In any case I'll update this PR once other things it depends up have been resolved, I'll add a checklist to keep track of them.

gctucker · 2019-05-10T11:09:45Z

There's a small issue with how test results from LAVA callbacks are stored in the database: the test suite results are converted into a test group containing some results, but it doesn't take into account the case where there are several test suites in a single job or if the test suite doesn't have the same name as the test plan. In this case, the plan is "baseline" and the test suite is "bootrr", so the results are stored as "bootrr" rather than "baseline". This needs to be fixed so the plan name is used instead, and if there are several test suites in the job (or if subsequent jobs send more results for the same test plan) then the results for each test suite should be kept within a separate group.

gctucker · 2019-06-03T13:16:23Z

The backend has been fixed to handle LAVA jobs with test suites that have a different name than the test plan, or multiple test suites.

@khilman I think it's best to keep building 2 flavours of buildroot during the transition phase between boot and baseline: the one used for current boot tests, and a "baseline" one with extra scripts installed for bootrr. Once the "baseline" is running in production, we can decide to merge the changes and have only one buildroot file system or drop current boot tests once baseline is running on all the devices. If there's any issue with deploying "baseline" tests, current boot tests won't be affected.

khilman · 2019-06-03T18:40:18Z

@gctucker: OK with me to have both.

gctucker · 2019-06-05T08:32:35Z

Tested OK on staging with buildroot file systems from here:
http://staging-storage.kernelci.org/images/rootfs/buildroot/kci-2019.02-2-gefc755ba4a/

Sample boot and baseline runs:
https://lava.collabora.co.uk/scheduler/job/1637437 (boot on x86 Minnowboard Turbot)
https://lava.collabora.co.uk/scheduler/job/1637426 (baseline on arm64 rk3399-gru-kevin)

gctucker · 2019-06-05T08:33:34Z

Email report for the baseline test plan showing no failures:

kernelci/staging.kernelci.org baseline: 217 tests, 0 regressions (v5.2-rc3-25-g4bf231fa7f04)

Test results summary
--------------------

  Test:    baseline
  Tree:    kernelci
  Branch:  staging.kernelci.org
  Kernel:  v5.2-rc3-25-g4bf231fa7f04
  URL:     https://github.com/kernelci/linux.git
  Commit:  4bf231fa7f042454850eba9d9d8e44060c8a26bb


1  | rk3399-gru-kevin       | clang-8  | arm64 |  78 total:  78 PASS   0 FAIL   0 SKIP
2  | rk3288-veyron-jaq      | gcc-8    | arm   |  61 total:  61 PASS   0 FAIL   0 SKIP
3  | rk3399-gru-kevin       | gcc-8    | arm64 |  78 total:  78 PASS   0 FAIL   0 SKIP

gctucker · 2019-06-05T08:35:18Z

@khilman The "base" version of the file system failed for risc-v with this error:

18:37:21 Makefile:609: arch/riscv/Makefile: No such file or directory
18:37:21 make[1]: *** No rule to make target 'arch/riscv/Makefile'.  Stop.

However the "baseline" seems to have worked, so maybe that was a glitch in the Jenkins job. All the other combinations worked.

gctucker · 2019-06-05T08:38:02Z

For the record, this is how the Jenkins job is configured to run the build script for staging.kernelci.org, with PARAM_ARCH and PARAM_VARIANT set as Matrix axis values:

#!/bin/bash                                                                                                                                                                           

rm -rf kernelci-core
git clone --depth 1 --branch staging.kernelci.org https://github.com/kernelci/kernelci-core

export PATH=$PATH:$PWD/kernelci-core:$PWD/kernelci-core/jenkins
buildroot-builder.sh $PARAM_ARCH $PARAM_VARIANT

Values for PARAM_ARCH:

arc
armel
armeb
arm64
arm64be
mipsel
riscv
x86

Values for PARAM_VARIANT:

base baseline

gctucker · 2019-06-05T12:08:38Z

@khilman The "base" version of the file system failed for risc-v with this error:
18:37:21 Makefile:609: arch/riscv/Makefile: No such file or directory
18:37:21 make[1]: *** No rule to make target 'arch/riscv/Makefile'.  Stop.
However the "baseline" seems to have worked, so maybe that was a glitch in the Jenkins job. All the other combinations worked.

I ran a build again this morning and it failed in the same way, for some reason the "base" fragment fails with risc-v but not the "baseline" one o.O

khilman · 2019-06-05T19:27:59Z

jenkins/buildroot-builder.sh

+variant=${2:-base}
+
+# Build
+./configs/frags/build ${arch} ${variant}


The arg order is important here, we want options from $arch to override options from $variant (especially when variant == base) so this should be:
./configs/frags/build ${variant} ${arch}

gctucker · 2019-06-05T20:53:45Z

OK thanks @khilman, the latest update of this PR worked. I've now changed the staging Jenkins config to this:

#!/bin/bash                                                                                                                                                                           

rm -rf kernelci-core
git clone --depth 1 --branch staging.kernelci.org https://github.com/kernelci/kernelci-core
export PATH=$PWD/kernelci-core:$PATH
./kernelci-core/jenkins/buildroot-builder.sh $ARCH $FRAG

Add arch and frag arguments on the command line rather than relying on the environment variables. Default to "base" for the fragment as this is how KernelCI's buildroot build script works. Also use the fragment name in the path when uploading the rootfs binaries to the storage server. Signed-off-by: Guillaume Tucker <guillaume.tucker@collabora.com>

Add "set -e" to abort the script when any comand fails, typically if the build fails to avoid silently uploading any previous rootfs. Signed-off-by: Guillaume Tucker <guillaume.tucker@collabora.com>

Add a "baseline" test plan intended to ultimately replace "boot" tests. This starts by using bootrr to verify that all the expected device drivers have been intialised correctly, to strike a good compromise between runtime and test coverage. Signed-off-by: Guillaume Tucker <guillaume.tucker@collabora.com>

Add a definition for the buildroot_baseline_ramdisk rootfs as well as the baseline test plan and enable it on rk3399-gru-kevin and rk3288-veyron-jaq. These devices have a dedicated script in bootrr to look for specific device drivers and run a good series of checks. Signed-off-by: Guillaume Tucker <guillaume.tucker@collabora.com>

Enable the baseline test plan to run in lab-collabora. Signed-off-by: Guillaume Tucker <guillaume.tucker@collabora.com>

Tweak timeouts and add baseline reports for gtucker's tree. Signed-off-by: Guillaume Tucker <guillaume.tucker@collabora.com>

Enable sending baseline email reports to the regular kernelci-build-reports mailing list for mainline and next. Signed-off-by: Guillaume Tucker <guillaume.tucker@collabora.com>

gctucker · 2019-06-07T06:29:23Z

All the builds passed, I also added a step to delete the output directory before starting each buildroot build to avoid hard to track issues. The binaries can be found here:
http://staging-storage.kernelci.org/images/rootfs/buildroot/kci-2019.02-2-gefc755ba4a/

Email reports for boot and baseline using these file systems:

kernelci/staging.kernelci.org boot: 89 boots: 1 failed, 87 passed with 1 untried/unknown (v5.2-rc3-38-gfb9efe9dfd51)

Full Boot Summary: https://staging.kernelci.org/boot/all/job/kernelci/branch/staging.kernelci.org/kernel/v5.2-rc3-38-gfb9efe9dfd51/
Full Build Summary: https://staging.kernelci.org/build/kernelci/branch/staging.kernelci.org/kernel/v5.2-rc3-38-gfb9efe9dfd51/

Tree: kernelci
Branch: staging.kernelci.org
Git Describe: v5.2-rc3-38-gfb9efe9dfd51
Git Commit: fb9efe9dfd51e94129ef0342785a8bf3c907c4b2
Git URL: https://github.com/kernelci/linux.git
Tested: 34 unique boards, 15 SoC families, 3 builds out of 9

Boot Regressions Detected:

arm:

    multi_v7_defconfig:
        clang-8:
          bcm2836-rpi-2-b:
              lab-collabora-dev: failing since 10 days (last pass: v4.14-11011-ga39fa8b3e663 - first fail: v5.2-rc2-1-gf7e5586b71bd)

Boot Failure Detected:

arm:
    multi_v7_defconfig:
        clang-8:
            bcm2836-rpi-2-b: 1 failed lab

---
For more info write to <info@kernelci.org>



kernelci/staging.kernelci.org baseline: 278 tests, 0 regressions (v5.2-rc3-38-gfb9efe9dfd51)

Test results summary
--------------------
  Test:    baseline
  Tree:    kernelci
  Branch:  staging.kernelci.org
  Kernel:  v5.2-rc3-38-gfb9efe9dfd51
  URL:     https://github.com/kernelci/linux.git
  Commit:  fb9efe9dfd51e94129ef0342785a8bf3c907c4b2


1  | rk3399-gru-kevin       | gcc-8    | arm64 |  78 total:  78 PASS   0 FAIL   0 SKIP
2  | rk3288-veyron-jaq      | gcc-8    | arm   |  61 total:  61 PASS   0 FAIL   0 SKIP
3  | rk3399-gru-kevin       | clang-8  | arm64 |  78 total:  78 PASS   0 FAIL   0 SKIP
4  | rk3288-veyron-jaq      | clang-8  | arm   |  61 total:  61 PASS   0 FAIL   0 SKIP

gctucker · 2019-06-07T06:30:15Z

Updated Jenkins job script, for the record:

#!/bin/bash                                                                                                                                                                           

set -e

rm -rf kernelci-core
git clone --depth 1 --branch staging.kernelci.org https://github.com/kernelci/kernelci-core
export PATH=$PWD/kernelci-core:$PATH
rm -rf output
./kernelci-core/jenkins/buildroot-builder.sh $ARCH $FRAG

gctucker force-pushed the T14397-baseline-test-plan branch 5 times, most recently from 6fb632f to 0c334e8 Compare May 6, 2019 08:45

khilman approved these changes May 6, 2019

View reviewed changes

gctucker force-pushed the T14397-baseline-test-plan branch from 0c334e8 to 3ed50c0 Compare May 9, 2019 14:56

gctucker requested a review from khilman May 9, 2019 16:50

khilman reviewed May 9, 2019

View reviewed changes

test-configs.yaml Show resolved Hide resolved

gctucker force-pushed the T14397-baseline-test-plan branch from 3ed50c0 to 7c94c60 Compare May 10, 2019 07:01

gctucker force-pushed the T14397-baseline-test-plan branch from 7c94c60 to 53306f4 Compare May 10, 2019 07:49

gctucker mentioned this pull request May 10, 2019

Add IGT test plan for gru-kevin #58

Merged

gctucker force-pushed the T14397-baseline-test-plan branch from 53306f4 to f47f3df Compare May 10, 2019 16:19

gctucker force-pushed the T14397-baseline-test-plan branch 3 times, most recently from 785467e to d896732 Compare May 29, 2019 06:50

gctucker force-pushed the T14397-baseline-test-plan branch 2 times, most recently from 8a32dc9 to 1626085 Compare June 3, 2019 13:10

gctucker changed the title ~~RFC T14397 baseline test plan~~ WIP T14397 baseline test plan Jun 3, 2019

gctucker force-pushed the T14397-baseline-test-plan branch from 1626085 to f8adb58 Compare June 3, 2019 18:17

gctucker force-pushed the T14397-baseline-test-plan branch 2 times, most recently from 3004a63 to 86d43dd Compare June 4, 2019 09:37

gctucker mentioned this pull request Jun 4, 2019

jenkins/buildroot-builder: add fragment #83

Closed

gctucker force-pushed the T14397-baseline-test-plan branch from 86d43dd to 4a1936b Compare June 4, 2019 15:58

gctucker changed the title ~~WIP T14397 baseline test plan~~ T14397 baseline test plan Jun 5, 2019

khilman reviewed Jun 5, 2019

View reviewed changes

gctucker force-pushed the T14397-baseline-test-plan branch from 4a1936b to 6772571 Compare June 5, 2019 19:52

gctucker force-pushed the T14397-baseline-test-plan branch 2 times, most recently from f5f8068 to 8ff2501 Compare June 6, 2019 17:52

gctucker added 7 commits June 6, 2019 22:27

buildroot-builder.sh: abort when build fails

035c182

Add "set -e" to abort the script when any comand fails, typically if the build fails to avoid silently uploading any previous rootfs. Signed-off-by: Guillaume Tucker <guillaume.tucker@collabora.com>

lava-boot-v2.sh: run baseline in lab-collabora

e621f65

Enable the baseline test plan to run in lab-collabora. Signed-off-by: Guillaume Tucker <guillaume.tucker@collabora.com>

kernel-arch-complete.sh: update email reports for gtucker's tree

75bb6ac

Tweak timeouts and add baseline reports for gtucker's tree. Signed-off-by: Guillaume Tucker <guillaume.tucker@collabora.com>

kernel-arch-complete.sh: send baseline reports for mainline and next

b05e507

Enable sending baseline email reports to the regular kernelci-build-reports mailing list for mainline and next. Signed-off-by: Guillaume Tucker <guillaume.tucker@collabora.com>

gctucker force-pushed the T14397-baseline-test-plan branch from 8ff2501 to b05e507 Compare June 6, 2019 21:28

gctucker merged commit cbc35e3 into kernelci:master Jun 7, 2019

gctucker deleted the T14397-baseline-test-plan branch June 7, 2019 06:31

Conversation

gctucker commented May 2, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gctucker commented May 2, 2019

Uh oh!

Uh oh!

khilman commented May 9, 2019

Uh oh!

gctucker commented May 9, 2019

Uh oh!

khilman commented May 9, 2019

Uh oh!

gctucker commented May 10, 2019

Uh oh!

gctucker commented May 10, 2019

Uh oh!

gctucker commented Jun 3, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

khilman commented Jun 3, 2019 via email • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gctucker commented Jun 5, 2019

Uh oh!

gctucker commented Jun 5, 2019

Uh oh!

gctucker commented Jun 5, 2019

Uh oh!

gctucker commented Jun 5, 2019

Uh oh!

gctucker commented Jun 5, 2019

Uh oh!

khilman Jun 5, 2019

Choose a reason for hiding this comment

Uh oh!

gctucker commented Jun 5, 2019

Uh oh!

gctucker commented Jun 7, 2019

Uh oh!

gctucker commented Jun 7, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gctucker commented May 2, 2019 •

edited

Loading

gctucker commented Jun 3, 2019 •

edited

Loading

khilman commented Jun 3, 2019 via email •

edited

Loading