Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support Woodwork 0.8.2 #2909

Merged
merged 6 commits into from
Oct 18, 2021
Merged

Support Woodwork 0.8.2 #2909

merged 6 commits into from
Oct 18, 2021

Conversation

freddyaboulton
Copy link
Contributor

@freddyaboulton freddyaboulton commented Oct 14, 2021

Pull Request Description

Fixes #2887, Fixes #2827

Perf tests are here. There is a slight drop in score for SeoulBikeData but I don't think it should hold up the upgrade.


After creating the pull request: in order to pass the release_notes_updated check you will need to update the "Future Release" section of docs/source/release_notes.rst to include this pull request by adding :pr:123.

@codecov
Copy link

codecov bot commented Oct 14, 2021

Codecov Report

Merging #2909 (276a33f) into main (e667959) will increase coverage by 0.1%.
The diff coverage is 100.0%.

Impacted file tree graph

@@           Coverage Diff           @@
##            main   #2909     +/-   ##
=======================================
+ Coverage   99.7%   99.7%   +0.1%     
=======================================
  Files        302     302             
  Lines      28412   28433     +21     
=======================================
+ Hits       28319   28340     +21     
  Misses        93      93             
Impacted Files Coverage Δ
evalml/tests/pipeline_tests/test_pipeline_utils.py 99.5% <ø> (ø)
.../transformers/preprocessing/datetime_featurizer.py 100.0% <100.0%> (ø)
evalml/pipelines/utils.py 99.4% <100.0%> (ø)
.../tests/component_tests/test_datetime_featurizer.py 100.0% <100.0%> (ø)
...mponent_tests/test_delayed_features_transformer.py 100.0% <100.0%> (ø)
evalml/tests/component_tests/test_featuretools.py 100.0% <100.0%> (ø)
...alml/tests/component_tests/test_one_hot_encoder.py 100.0% <100.0%> (ø)

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e667959...276a33f. Read the comment docs.

Copy link
Contributor

@bchen1116 bchen1116 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@@ -40,7 +43,7 @@ Release Notes
* Deleted scikit-learn ensembler :pr:`2819`
* ``IterativeAlgorithm`` now requires X, y, problem_type as required arguments as well as sampler_name, allowed_model_families, allowed_component_graphs, max_batches, and verbose as optional arguments :pr:`2854`
* Changed method names of ``fit_features`` and ``compute_final_component_features`` to ``fit_and_transform_all_but_final`` and ``transform_all_but_final`` in ``ComponentGraph``, and ``compute_estimator_features`` to ``transform_all_but_final`` in pipeline classes :pr:`2902`

* Set the minimum allowed woodwork version to ``0.8.2``.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: include PR # here

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

100% Can't believe I missed that!

@@ -459,6 +459,9 @@ def test_delay_feature_transformer_woodwork_custom_overrides_returned_by_compone
X.ww.init(logical_types={0: logical_type})
except (ww.exceptions.TypeConversionError, ValueError):
continue
if X.loc[:, 0].isna().all():
# Casting the fourth and fifth dataframes to datetime will produce all NaNs
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

good comment!

Copy link
Contributor

@eccabay eccabay left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

Copy link
Contributor

@chukarsten chukarsten left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me! I'm curious if we should care more about the case of casting the dataframe to Datetime to create all NaN. That doesn't seem to be consistent with the other override types. If you think it's important to follow up on, we can. I don't see any need here to block this, though!

@@ -46,10 +51,10 @@ def _extract_month(col, encode_as_categories=False):
def _extract_day_of_week(col, encode_as_categories=False):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Filed this issue to refactor these two functions as they're the same.

@freddyaboulton freddyaboulton merged commit 53242f9 into main Oct 18, 2021
@freddyaboulton freddyaboulton deleted the 2887-upgrade-to-ww-0.8.2 branch October 18, 2021 18:44
@chukarsten chukarsten mentioned this pull request Oct 27, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Unpin woodwork version / support woodwork>=0.8.2 Error in infer_feature_types for seoul bike dataset
4 participants