Run make check in Jenkins #125

elliottslaughter · 2022-01-28T21:58:58Z

I noticed that make check is not being run in Jenkins. I'm adding it because I suspect there are failures in the test suite that aren't being caught at the moment.

ahbarnett · 2022-02-03T21:52:38Z

@janden @garrettwrong Dear Joakim & Garrett, do either of you have a few minutes to look into what's happening here? Since we're starting to get a bunch of users, it would be great to have CI? (of course, various devs like Melody, Johannes, and occasionally me, are regularly checking it locally..)
It fails at the Py import stage, ie it's an env issue not a code issue:

    from pycuda._driver import *  # noqa
E   SystemError: initialization of _driver raised unreported exception

I know nothing about the jenkins GPU setup... Thanks, Alex

janden · 2022-02-04T12:33:34Z

So it looks like there are two issues here:
– CI is failing currently due to NumPy version mismatches. This should be handled by #129.
– We are currently only running the Python tests, not the C++ tests.

Regarding the issue of the C++ tests, I think that our idea was that the Python tests could cover most of the code, so any bugs would trip them. That being said, I don't know that that is achieved, so it might make sense to include them. @ahbarnett, what do you think?

If we do want to include them, we have to see how that might best be achieved. It could be here in the Jenkinsfile or it could be in the Dockerfile where the library is being built (and the other calls to make are performed).

ahbarnett · 2022-02-04T21:15:39Z

Actually I have a question about this for @janden: does the Jenkins test (now using python3 -m pip install -e)
actually compile libcufinufft.so locally, or are this precompiled and downloaded from somewhere else?

It the .so is locally compiled, then I think the py tests are good enough for CI. The tests in make check are more extensive, less supportable, and let's keep them for devs only.

But @elliottslaughter, do you suspect particular make check items of not working on the Jenkins system?
Maybe now since commit 46a8e8a fixes the environment, you can upstream fetch that to your branch anyway.

elliottslaughter · 2022-02-05T06:51:37Z

I think I was confused about the make check because I expected an issue like #116 (comment) to break the build, but it doesn't. (I suspect it may be a GCC vs Clang issue, so the default CUDA setup with GCC doesn't catch it.)

Anyway, this is your project so feel free to do as you see fit, but my own two cents would be that the more tests, the better. Even if the Python and C++ tests overlap, make check is at least testing a different language interface. The CI is the perfect place to put this since you don't need to remember to run it, and it helps sanity check that everything stays working.

I rebased and the run passes, but maybe I messed the Jenkins syntax because I can't actually see the make check running anywhere in the job output.

ahbarnett · 2022-02-08T22:49:08Z

Thanks Elliott for finding the MAX_NF bug, and we will aim to add make check to CI as you suggest. Best, Alex

…

On Sat, Feb 5, 2022 at 1:51 AM Elliott Slaughter ***@***.***> wrote: I think I was confused about the make check because I expected an issue like #116 (comment) <#116 (comment)> to break the build, but it doesn't. (I suspect it may be a GCC vs Clang issue, so the default CUDA setup with GCC doesn't catch it.) Anyway, this is your project so feel free to do as you see fit, but my own two cents would be that the more tests, the better. Even if the Python and C++ tests overlap, make check is at least testing a different language interface. The CI is the perfect place to put this since you don't need to remember to run it, and it helps sanity check that everything stays working. I rebased and the run passes, but maybe I messed the Jenkins syntax because I can't actually see the make check running anywhere in the job output. — Reply to this email directly, view it on GitHub <#125 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACNZRSUCHPHRQ4ADXSNKHQLUZTCILANCNFSM5NB3KEXA> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>. You are receiving this because you were mentioned.Message ID: ***@***.***>

-- *---------------------------------------------------------------------~^`^~._.~' |\ Alex H. Barnett Center for Computational Mathematics, Flatiron Institute | \ http://users.flatironinstitute.org/~ahb 646-876-5942

ahbarnett

Hi Elliot,
Thanks for upstreaming this PR. But I can't just pull it in since I don't see the Jenkins run producing any output from make check so I don't believe it's being run in CI, riight? However, I now can be persuaded that having it in CI is a good thing. Maybe @janden as author or the Jenkinsfile could help get this PR in? Thanks all, Alex

elliottslaughter · 2022-02-09T06:40:59Z

Is it possible that you're running a centrally-controlled Jenkinsfile, rather than the one in the repo? I tried another version of my change, and it literally isn't taking the script I'm putting in. I'm starting to suspect it's not actually running the Jenkinsfile out of the repository at all but using a centrally managed version, or something along those lines.

ahbarnett · 2022-02-09T23:00:52Z

I'm pinging @janden on this, since he set it up. Best, Alex

…

On Wed, Feb 9, 2022 at 1:41 AM Elliott Slaughter ***@***.***> wrote: Is it possible that you're running a centrally-controlled Jenkinsfile, rather than the one in the repo? I tried another version of my change, and it literally isn't taking the script I'm putting in. I'm starting to suspect it's not actually running the Jenkinsfile out of the repository at all but using a centrally managed version, or something along those lines. — Reply to this email directly, view it on GitHub <#125 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACNZRSQXZHZ2SSQN7QQRRP3U2IEANANCNFSM5NB3KEXA> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>. You are receiving this because you were mentioned.Message ID: ***@***.***>

-- *---------------------------------------------------------------------~^`^~._.~' |\ Alex H. Barnett Center for Computational Mathematics, Flatiron Institute | \ http://users.flatironinstitute.org/~ahb 646-876-5942

janden · 2022-02-10T10:31:22Z

Actually I have a question about this for @janden: does the Jenkins test (now using python3 -m pip install -e)
actually compile libcufinufft.so locally, or are this precompiled and downloaded from somewhere else?

It's compiled as part of the Dockerfile each time the CI is run.

Anyway, this is your project so feel free to do as you see fit, but my own two cents would be that the more tests, the better. Even if the Python and C++ tests overlap, make check is at least testing a different language interface. The CI is the perfect place to put this since you don't need to remember to run it, and it helps sanity check that everything stays working.

I agree.

Is it possible that you're running a centrally-controlled Jenkinsfile, rather than the one in the repo?

I'm not sure. It could be that it runs the one in the master branch (for safety reasons). That being said, I'm fairly certain that the Dockerfile is being updated from the branch, so you may have better luck putting in the make check line there. I also think it makes more sense to put it there since that's where the other calls to make are.

elliottslaughter · 2022-02-10T19:48:31Z

Good news: running make check in the Dockerfile does indeed cause it to run.

Bad news: it is now hitting:

../bin/spread1d_test: error while loading shared libraries: libcuda.so.1: cannot open shared object file: No such file or directory

I'm not really familiar with how these containers are set up, so I'm not sure where this library would be located?

janden · 2022-02-11T08:41:03Z

I'm not really familiar with how these containers are set up, so I'm not sure where this library would be located?

This line

cufinufft/ci/docker/cuda10.1/Dockerfile-x86_64

Line 72 in bb1453d

ENV LD_LIBRARY_PATH /io/lib:${LD_LIBRARY_PATH}

sets the necessary environment variable. Try invoking make check after it.

elliottslaughter · 2022-02-11T16:27:24Z

@janden I believe the most recent change I made matches what you suggested, but the Jenkins build still fails.

janden · 2022-02-14T11:27:29Z

Looks like the LD_LIBRARY_PATH is still incorrect. Let me try something.

janden · 2022-02-14T13:51:42Z

So I think I know what's happening. The way the docker image is set up, it doesn't actually have access to the GPU during the build phase. At this point, all that's happening is that the library is being compiled, so it doesn't actually need the GPU. Later, when we're testing, we do use the GPU, which is why Jenkins runs the docker image with the --gpus 1 flag set (otherwise we'd see the same crash during pytest).

It may be possible to give the docker image access to the GPU during the build phase, but this seems complicated. I therefore suggest you move the make check invocation back into the Jenkinsfile. Then we get back to the original problem that Jenkins won't run it until it's in master, but we'll just have to merge it in and see what happens.

elliottslaughter · 2022-02-14T22:21:15Z

Ok, I have reset this PR to its original form, adding the make check line to Jenkinsfile.

However, I really think you should check the Jenkins configuration first. As you can see at the link below, one of the options to configure the contents of a Jenkinsfile is through the admin UI. If this is how the instance is currently set up, it literally won't matter what is in the master branch (or anything else, aside from the UI).

https://www.jenkins.io/doc/book/pipeline/getting-started/#through-the-classic-ui

You'll want to double check that you've followed these instructions to make sure you're pulling the Jenkinsfile from the repository itself:

https://www.jenkins.io/doc/book/pipeline/getting-started/#defining-a-pipeline-in-scm

I don't see anything in there that would suggest that it only pays attention to certain branches (which is why I'm suggesting double checking that it's not configured via the UI).

janden · 2022-02-15T05:43:53Z

I don't have admin access to Jenkins, so there is no good way for me to verify or modify these settings.

It looks like Jenkins doesn't trust the Jenkinsfile in this PR (see https://jenkins.flatironinstitute.org/job/cufinufft/indexing/events), but it does trust the one from master. It might trust it if the PR comes from this repository. Will give it a try. Otherwise it won't hurt to just merge it.

janden · 2022-02-15T06:06:51Z

Ok. Looks like my hypothesis was correct.

I suggest we close this PR, undraft #134, and continue there. Since it's a branch on the main repository, Jenkins trusts the Jenkinsfile, so we can make any changes we want there and it should show up in the CI.

elliottslaughter · 2022-02-15T17:07:41Z

Looks like make check ran in master. Thanks all!

elliottslaughter force-pushed the eds/jenkins-make-check branch from f51a402 to 4ced4b5 Compare January 28, 2022 22:30

ahbarnett mentioned this pull request Feb 3, 2022

Add perlmutter build script #127

Merged

elliottslaughter force-pushed the eds/jenkins-make-check branch from 4ced4b5 to 0e4e5ad Compare February 5, 2022 06:00

ahbarnett requested changes Feb 8, 2022

View reviewed changes

elliottslaughter force-pushed the eds/jenkins-make-check branch from 0e4e5ad to 4e8f863 Compare February 9, 2022 06:23

elliottslaughter force-pushed the eds/jenkins-make-check branch from 4e8f863 to a090333 Compare February 10, 2022 19:30

elliottslaughter force-pushed the eds/jenkins-make-check branch from a090333 to 70c058e Compare February 11, 2022 16:09

Run make check in Jenkins.

36fdbd6

elliottslaughter force-pushed the eds/jenkins-make-check branch from 70c058e to 36fdbd6 Compare February 14, 2022 22:13

janden mentioned this pull request Feb 15, 2022

Run make check in Jenkins. #134

Merged

ahbarnett merged commit 3ae9871 into flatironinstitute:master Feb 15, 2022

elliottslaughter deleted the eds/jenkins-make-check branch February 15, 2022 17:07

janden mentioned this pull request Feb 15, 2022

Pass/fail checks for tests #135

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run make check in Jenkins #125

Run make check in Jenkins #125

elliottslaughter commented Jan 28, 2022

ahbarnett commented Feb 3, 2022

janden commented Feb 4, 2022

ahbarnett commented Feb 4, 2022 •

edited

Loading

elliottslaughter commented Feb 5, 2022

ahbarnett commented Feb 8, 2022 via email

ahbarnett left a comment

elliottslaughter commented Feb 9, 2022 •

edited

Loading

ahbarnett commented Feb 9, 2022 via email

janden commented Feb 10, 2022

elliottslaughter commented Feb 10, 2022

janden commented Feb 11, 2022

elliottslaughter commented Feb 11, 2022

janden commented Feb 14, 2022

janden commented Feb 14, 2022

elliottslaughter commented Feb 14, 2022

janden commented Feb 15, 2022

janden commented Feb 15, 2022

elliottslaughter commented Feb 15, 2022

Run make check in Jenkins #125

Run make check in Jenkins #125

Conversation

elliottslaughter commented Jan 28, 2022

ahbarnett commented Feb 3, 2022

janden commented Feb 4, 2022

ahbarnett commented Feb 4, 2022 • edited Loading

elliottslaughter commented Feb 5, 2022

ahbarnett commented Feb 8, 2022 via email

ahbarnett left a comment

Choose a reason for hiding this comment

elliottslaughter commented Feb 9, 2022 • edited Loading

ahbarnett commented Feb 9, 2022 via email

janden commented Feb 10, 2022

elliottslaughter commented Feb 10, 2022

janden commented Feb 11, 2022

elliottslaughter commented Feb 11, 2022

janden commented Feb 14, 2022

janden commented Feb 14, 2022

elliottslaughter commented Feb 14, 2022

janden commented Feb 15, 2022

janden commented Feb 15, 2022

elliottslaughter commented Feb 15, 2022

ahbarnett commented Feb 4, 2022 •

edited

Loading

elliottslaughter commented Feb 9, 2022 •

edited

Loading