Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add pipeline.should_skip_featurization flag #3849

Merged
merged 9 commits into from
Nov 22, 2022
Merged

Conversation

eccabay
Copy link
Contributor

@eccabay eccabay commented Nov 17, 2022

Fixes an issue where time series native estimators were double-featurizing during predict when featurization happens before running evalml

@codecov
Copy link

codecov bot commented Nov 17, 2022

Codecov Report

Merging #3849 (238f6f3) into main (1ee1c7f) will increase coverage by 0.1%.
The diff coverage is 100.0%.

@@           Coverage Diff           @@
##            main   #3849     +/-   ##
=======================================
+ Coverage   99.7%   99.7%   +0.1%     
=======================================
  Files        344     344             
  Lines      36185   36190      +5     
=======================================
+ Hits       36048   36053      +5     
  Misses       137     137             
Impacted Files Coverage Δ
evalml/pipelines/time_series_pipeline_base.py 100.0% <100.0%> (ø)
.../integration_tests/test_time_series_integration.py 100.0% <100.0%> (ø)

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

@eccabay eccabay marked this pull request as ready for review November 21, 2022 21:16
},
)
y = pd.Series(range(PERIODS))
if problem_type == ProblemTypes.TIME_SERIES_BINARY:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can you add the parametrization for these problem types?

assert pipeline.should_skip_featurization


@patch(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do you think its cleaner to add the pipeline fitting logic to test_can_run_automl_for_time_series_with_exclude_featurizers and check after search is run? I think either works!

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great idea, done!

@eccabay eccabay enabled auto-merge (squash) November 22, 2022 20:28
@eccabay eccabay merged commit 61a0a0e into main Nov 22, 2022
@eccabay eccabay deleted the skip_featurization branch November 22, 2022 20:49
@chukarsten chukarsten mentioned this pull request Nov 23, 2022
eccabay added a commit that referenced this pull request Nov 28, 2022
chukarsten pushed a commit that referenced this pull request Nov 29, 2022
* Revert "Add pipeline.should_skip_featurization flag (#3849)"

This reverts commit 61a0a0e.

* release notes
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants