Skip to content
This repository has been archived by the owner on Nov 17, 2023. It is now read-only.

Add Jenkins setup for running nightly tests on examples #13270

Closed
wants to merge 7 commits into from

Conversation

kalyc
Copy link
Contributor

@kalyc kalyc commented Nov 14, 2018

Description

Add Jenkins setup for running nightly tests on examples

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

  • The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
  • Changes are complete (i.e. I finished coding on this PR)
  • All changes have test coverage:
  • Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
  • Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
  • Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
  • Code is well-documented:
  • For user-facing API changes, API doc string has been updated.
  • For new C++ functions in header files, their functionalities and arguments are documented.
  • For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
  • Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
  • To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

  • Add Jenkins setup for running nightly tests on examples for python2 & python3 on GPU

Comments

@kalyc
Copy link
Contributor Author

kalyc commented Nov 14, 2018

@marcoabreu @ankkhedia

@kalyc
Copy link
Contributor Author

kalyc commented Nov 14, 2018

@mxnet-label-bot add [pr-awaiting-review]

@marcoabreu marcoabreu added the pr-awaiting-review PR is waiting for code review label Nov 14, 2018
Copy link
Member

@roywei roywei left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for adding the example test!
How do we test this?

export MXNET_STORAGE_FALLBACK_LOG_VERBOSE=0
export MXNET_TEST_KERNEL=python3
cd /work/mxnet/tests/examples
nosetests-3.4 --with-xunit --xunit-file nosetests_tutorials.xml test_examples.py --nologcapture
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does this PR depend on #13271 to be merged?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

as mentioned here - #13271 (comment), please combine the PRs

@@ -1125,6 +1125,30 @@ nightly_tutorial_test_ubuntu_python2_gpu() {
nosetests-3.4 --with-xunit --xunit-file nosetests_tutorials.xml test_tutorials.py --nologcapture
}

nightly_example_test_ubuntu_python3_gpu() {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please review everything in here. It seems like this has just been copy and pasted

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes we need a similar setup - as we want to run the examples on GPU.
We want to know how to test the jenkins file locally so the code can be tested

ci/docker/runtime_functions.sh Outdated Show resolved Hide resolved

}

nightly_example_test_ubuntu_python2_gpu() {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we really need to test examples on python2? From what I understand only newly fixed examples are going to get added to the tests here and these will be python 3 compliant. And python 2 is getting EOLed in a year's time.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We haven't moved off of python2 yet - the other nightly tests also run for python2

ci/docker/runtime_functions.sh Outdated Show resolved Hide resolved
@marcoabreu
Copy link
Contributor

@jlcontreras

@kalyc
Copy link
Contributor Author

kalyc commented Nov 15, 2018

Addressed comments and removed extraneous commands from jenkins setup

@stu1130
Copy link
Contributor

stu1130 commented Nov 26, 2018

@marcoabreu @anirudhacharya @ankkhedia looks like your comments were addressed?

@marcoabreu
Copy link
Contributor

Did we test this yet?

@kalyc
Copy link
Contributor Author

kalyc commented Nov 27, 2018

@marcoabreu we have cut a TT at your end for assistance in helping with testing this setup. Could you help us out with it?

@marcoabreu
Copy link
Contributor

marcoabreu commented Nov 27, 2018 via email

@kalyc
Copy link
Contributor Author

kalyc commented Nov 29, 2018

@Chancebair could you help with testing this PR?

@nswamy
Copy link
Member

nswamy commented Dec 8, 2018

@kalyc could you please follow up and take it to completion.

@kalyc kalyc force-pushed the jenkins_setup_testing_examples branch from 76c24eb to a86bbdc Compare December 18, 2018 19:45
@kalyc kalyc force-pushed the jenkins_setup_testing_examples branch from a86bbdc to 1fcc220 Compare December 18, 2018 23:30
@Roshrini
Copy link
Member

Roshrini commented Jan 2, 2019

@kalyc Thanks for working on this! Is this PR good to go? Did it pass tests on jenkins setup?

@kalyc
Copy link
Contributor Author

kalyc commented Jan 2, 2019

@Roshrini looks like there is a build failure with this setup - http://jenkins.mxnet-ci-dev.amazon-ml.com/job/test-kalyc-NightlyTestsForBinaries/job/jenkins_setup_testing_examples/

Requesting @ankkhedia @roywei to take a look. Please let me know if this PR needs to be closed accordingly.

@kalyc
Copy link
Contributor Author

kalyc commented Jan 2, 2019

Error log -

======================================================================
FAIL: test_examples.test_cifar_gpu

Traceback (most recent call last):
File "/usr/local/lib/python3.5/dist-packages/nose/case.py", line 198, in runTest
self.test(*self.arg)
File "/work/mxnet/tests/examples/test_examples.py", line 79, in test_cifar_gpu
['python', os.path.join(example_dir, 'train_cifar10.py'), '--num-epochs', '5', 'gpus', '0'])
AssertionError


Ran 2 tests in 6024.068s

@kalyc
Copy link
Contributor Author

kalyc commented Jan 8, 2019

As discussed offline @ankkhedia and @roywei will take this PR forward. Closing this PR for now.

@kalyc kalyc closed this Jan 8, 2019
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
pr-awaiting-review PR is waiting for code review
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

8 participants