[FIX] results management and visualisation with missing test data #465

ravinkohli · 2022-08-08T15:31:53Z

Fixes issue #455

Types of changes

Breaking change (fix or feature that would cause existing functionality to not work as expected)
Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)

Note that a Pull Request should only contain one of refactoring, new features or documentation changes.
Please separate these changes and send us individual PRs for each.
For more information on how to create a good pull request, please refer to The anatomy of a perfect pull request.

Checklist:

My code follows the code style of this project.
My change requires a change to the documentation.
I have updated the documentation accordingly.

Have you checked to ensure there aren't other open Pull Requests for the same update/change?
Have you added an explanation of what your changes do and why you'd like us to include them?
Have you written new tests for your core changes, as applicable?
Have you successfully ran tests with your changes locally?

Description

Currently, the results_manager.py and results_visualiser.py assumes that we always have the test data. However, in the API class, we allow the users to run the optimisation without giving test data and I think it is a common use case, especially in AutoML. This PR makes the necessary changes to allow visualising and storing results when no test data is passed.

Motivation and Context

Fixes #455

How has this been tested?

I have added tests where the run history does not contain the test scores and the tests ensure that the sprint_statistics and the plot_perf_over_time functions work with this run history

codecov · 2022-08-08T16:20:45Z

Codecov Report

Merging #465 (b3d969c) into development (c7220f7) will increase coverage by 20.82%.
The diff coverage is 68.29%.

@@               Coverage Diff                @@
##           development     #465       +/-   ##
================================================
+ Coverage        64.65%   85.48%   +20.82%     
================================================
  Files              231      231               
  Lines            16304    16351       +47     
  Branches          3009     3028       +19     
================================================
+ Hits             10542    13977     +3435     
+ Misses            4714     1535     -3179     
+ Partials          1048      839      -209

Impacted Files	Coverage Δ
autoPyTorch/constants.py	`100.00% <ø> (ø)`
autoPyTorch/utils/results_visualizer.py	`95.53% <42.85%> (+55.53%)`	⬆️
autoPyTorch/utils/results_manager.py	`95.30% <73.52%> (+48.93%)`	⬆️
...bone/forecasting_encoder/seq_encoder/TCNEncoder.py	`96.46% <0.00%> (+1.76%)`	⬆️
autoPyTorch/ensemble/ensemble_selection.py	`96.87% <0.00%> (+2.08%)`	⬆️
...nts/setup/network_backbone/ShapedResNetBackbone.py	`100.00% <0.00%> (+2.08%)`	⬆️
autoPyTorch/evaluation/utils.py	`73.61% <0.00%> (+2.77%)`	⬆️
...peline/components/training/trainer/MixUpTrainer.py	`97.14% <0.00%> (+2.85%)`	⬆️
...nts/setup/early_preprocessor/EarlyPreprocessing.py	`85.71% <0.00%> (+2.85%)`	⬆️
... and 145 more

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

theodorju

Fix seem to resolve the issue and both test cases added assert correct functionality. Some minor comments on my part.

theodorju · 2022-08-10T20:19:10Z

autoPyTorch/utils/results_manager.py

@@ -28,6 +28,9 @@
 ]


+OPTIONAL_INFERENCE_CHOICES = ('test',)


Would it make sense to transfer this to constants.py?

theodorju · 2022-08-10T21:05:46Z

autoPyTorch/utils/results_manager.py

+        Checks if the data is missing for each optional inference choice and
+        sets the scores for that inference choice to all None.


It would be nice to also add the case of score == metric._worst_possible_result in the docstring:

Checks if the data is missing or if all scores are equal to the worst possible result for each optional inference choice and sets the scores for that inference choice to all None.

…test data (#465)

* [FIX] Documentation and docker workflow file (#449) * fixes to documentation and docker * fix to docker * Apply suggestions from code review * add change log for release (#450) * [FIX] release docs (#452) * Release 0.2 * Release 0.2.0 * fix docs new line * [FIX] ADD forecasting init design to pip data files (#459) * add forecasting_init.json to data files under setup * avoid undefined reference in scale_value * checks for time series dataset split (#464) * checks for time series dataset split * maint * Update autoPyTorch/datasets/time_series_dataset.py Co-authored-by: Ravin Kohli <13005107+ravinkohli@users.noreply.github.com> Co-authored-by: Ravin Kohli <13005107+ravinkohli@users.noreply.github.com> * [FIX] Numerical stability scaling for timeseries forecasting tasks (#467) * resolve rebase conflict * add checks for scaling factors * flake8 fix * resolve conflict * [FIX] pipeline options in `fit_pipeline` (#466) * fix update of pipeline config options in fit pipeline * fix flake and test * suggestions from review * [FIX] results management and visualisation with missing test data (#465) * add flexibility to avoid checking for test scores * fix flake and test * fix bug in tests * suggestions from review * [ADD] Robustly refit models in final ensemble in parallel (#471) * add parallel model runner and update running traditional classifiers * update pipeline config to pipeline options * working refit function * fix mypy and flake * suggestions from review * fix mypy and flake * suggestions from review * finish documentation * fix tests * add test for parallel model runner * fix flake * fix tests * fix traditional prediction for refit * suggestions from review * add warning for failed processing of results * remove unnecessary change * update autopytorch version number * update autopytorch version number and the example file * [DOCS] Release notes v0.2.1 (#476) * Release 0.2.1 * add release docs * Update docs/releases.rst Co-authored-by: Difan Deng <33290713+dengdifan@users.noreply.github.com>

ravinkohli added 2 commits August 8, 2022 17:21

add flexibility to avoid checking for test scores

dcb3ca1

fix flake and test

0d663f5

ravinkohli requested review from theodorju and nabenabe0928 August 8, 2022 15:32

ravinkohli changed the title ~~add flexibility to avoid checking for test scores~~ [FIX] results management and visualisation with missing test data Aug 8, 2022

fix bug in tests

a7934b8

ravinkohli linked an issue Aug 10, 2022 that may be closed by this pull request

sprint_statistics() method gives KeyError: 'test_loss' in 0.2 version #455

Closed

theodorju reviewed Aug 10, 2022

View reviewed changes

suggestions from review

b3d969c

theodorju approved these changes Aug 12, 2022

View reviewed changes

ravinkohli merged commit d160903 into automl:development Aug 12, 2022

github-actions bot pushed a commit that referenced this pull request Aug 12, 2022

Ravin Kohli: [FIX] results management and visualisation with missing …

ebdf672

…test data (#465)

ravinkohli mentioned this pull request Aug 23, 2022

[RELEASE] v0.2.1 #475

Merged

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIX] results management and visualisation with missing test data #465

[FIX] results management and visualisation with missing test data #465

ravinkohli commented Aug 8, 2022 •

edited

codecov bot commented Aug 8, 2022 •

edited

theodorju left a comment

theodorju Aug 10, 2022

theodorju Aug 10, 2022

		Checks if the data is missing for each optional inference choice and
		sets the scores for that inference choice to all None.

[FIX] results management and visualisation with missing test data #465

[FIX] results management and visualisation with missing test data #465

Conversation

ravinkohli commented Aug 8, 2022 • edited

Types of changes

Checklist:

Description

Motivation and Context

How has this been tested?

codecov bot commented Aug 8, 2022 • edited

Codecov Report

theodorju left a comment

Choose a reason for hiding this comment

theodorju Aug 10, 2022

Choose a reason for hiding this comment

theodorju Aug 10, 2022

Choose a reason for hiding this comment

ravinkohli commented Aug 8, 2022 •

edited

codecov bot commented Aug 8, 2022 •

edited