Refactor of test suite and addition of explicit validation data sets #32

matthewcarbone · 2018-07-27T00:45:59Z

Major overhaul of the testing suite:

Move things out of the single test_script.py and into test directory.
Explicit unit tests on Scan.
Needs more work: need to explicitly test every function individually.

Also, add the explicit validation data set option

Add the option to Scan to specify x_val and y_val.
Overrides the val_split option.

Let me know what you think @mikkokotila. Hope I didn't do too much at once here...

Add docstrings to validation_split() and random_shuffle(). Also, add capability to seed the random shuffle generator so taht results may be reproducible, at least in terms of the random shuffling of the train/cross-validation data.

To allow for a user to potentially augment their training data but not their validation data (and it should be noted that nobody should ever augment first then randomize then split, else major bias problems), implement an option to explicitly specify the validation dataset. Also make necessary changes to validation_split to account for this, and make a few QOL edits to the testing suite.

Clean up completely. Make the following changes: - Move all tests into test/ directory - test_script.py still calls all the same unit checks as before - Move test models (iris and cervical cancer) to talos/model/examples.py

Notably, bugfix an error in Scan where the attributes x_val and y_val were not being defined.

Appears that fmeasure was changed to fmeasure_acc in a previous commit.

pep8speaks · 2018-07-27T00:46:02Z

Hello @x94carbone! Thanks for submitting the PR.

In the file setup.py, following are the PEP8 issues :

Line 94:1: E124 closing bracket does not match visual indentation

In the file test/core_tests/__init__.py, following are the PEP8 issues :

Line 1:1: W391 blank line at end of file

coveralls · 2018-07-27T00:50:26Z

Pull Request Test Coverage Report for Build 103

43 of 44 (97.73%) changed or added relevant lines in 3 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.4%) to 88.571%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
talos/utils/validation_split.py	11	12	91.67%

Totals
Change from base Build 97:	0.4%
Covered Lines:	682
Relevant Lines:	770

💛 - Coveralls

mikkokotila · 2018-07-27T11:28:54Z

This is amazing! Thanks a lot for great work 👍

matthewcarbone · 2018-07-27T12:58:59Z

My pleasure! Got another one coming in a sec.

matthewcarbone and others added 18 commits July 23, 2018 00:05

Add docstrings, implement seed generator

35562ae

Add docstrings to validation_split() and random_shuffle(). Also, add capability to seed the random shuffle generator so taht results may be reproducible, at least in terms of the random shuffling of the train/cross-validation data.

Merge branch 'dev' into implement-splits

9c4342e

Create ISSUE_TEMPLATE.md

86611e2

Update ISSUE_TEMPLATE.md

bb6803c

Update ISSUE_TEMPLATE.md

1f097e0

Merge branch 'implement-splits' into dev

5ebdaed

Minor linting

64a5982

Bugfix

895bc7c

Create CONTRIBUTE.md

93d0e8f

Completely refactor the testing suite

99761db

Clean up completely. Make the following changes: - Move all tests into test/ directory - test_script.py still calls all the same unit checks as before - Move test models (iris and cervical cancer) to talos/model/examples.py

Finalize testing of Scan explicit val dataset

9bc1bf6

Notably, bugfix an error in Scan where the attributes x_val and y_val were not being defined.

Merge branch 'unitcheck_val_split' into dev

0a3a83d

Merge branch 'dev' of https://github.com/autonomio/talos into dev

c6ba185

Merge branch 'master' into dev

e466633

Add sklearn to required packages

eb8cf29

Fix import error from metrics re fmeasure_acc

3c50fd8

Appears that fmeasure was changed to fmeasure_acc in a previous commit.

Final checks

8a619a7

matthewcarbone requested a review from mikkokotila July 27, 2018 00:45

matthewcarbone mentioned this pull request Jul 27, 2018

Suggestion: allow custom import of cross-validation / train data sets instead of a split #21

Closed

mikkokotila merged commit 0705a17 into autonomio:dev Jul 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor of test suite and addition of explicit validation data sets #32

Refactor of test suite and addition of explicit validation data sets #32

matthewcarbone commented Jul 27, 2018

pep8speaks commented Jul 27, 2018

coveralls commented Jul 27, 2018

mikkokotila commented Jul 27, 2018

matthewcarbone commented Jul 27, 2018

Refactor of test suite and addition of explicit validation data sets #32

Refactor of test suite and addition of explicit validation data sets #32

Conversation

matthewcarbone commented Jul 27, 2018

pep8speaks commented Jul 27, 2018

coveralls commented Jul 27, 2018

Pull Request Test Coverage Report for Build 103

💛 - Coveralls

mikkokotila commented Jul 27, 2018

matthewcarbone commented Jul 27, 2018