Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove reindex in Entity.query_by_values #626

Merged
merged 9 commits into from
Jun 27, 2019
Merged

Conversation

CJStadler
Copy link
Contributor

reindex may change the order of the dataframe, but the dataframe
should keep the existing ordering by time index.

After calculating features call reindex to order by instance_ids.

Reindex may change the order, and features may assume that the dataframe
is ordered by time index.

After calculating features reindex to use order of instance_ids.
@codecov
Copy link

codecov bot commented Jun 25, 2019

Codecov Report

Merging #626 into master will increase coverage by <.01%.
The diff coverage is 100%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master     #626      +/-   ##
==========================================
+ Coverage   97.42%   97.43%   +<.01%     
==========================================
  Files         118      118              
  Lines        9532     9543      +11     
==========================================
+ Hits         9287     9298      +11     
  Misses        245      245
Impacted Files Coverage Δ
...mputational_backend/test_feature_set_calculator.py 100% <100%> (ø) ⬆️
featuretools/entityset/entity.py 96.49% <100%> (-0.02%) ⬇️
...s/computational_backends/feature_set_calculator.py 98.12% <100%> (+0.01%) ⬆️
...imitive_tests/test_groupby_transform_primitives.py 100% <100%> (ø) ⬆️
featuretools/tests/entityset_tests/test_entity.py 100% <100%> (ø) ⬆️
...s/tests/primitive_tests/test_transform_features.py 98.37% <100%> (ø) ⬆️
...ools/tests/primitive_tests/test_direct_features.py 100% <100%> (ø) ⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update f7fb459...cd2db12. Read the comment docs.

This is faster and doesn't change the type.

The function can no longer accept lists, so update the tests to pass in
np.ndarray.
@CJStadler CJStadler merged commit c5f9e53 into master Jun 27, 2019
@CJStadler CJStadler deleted the cfm-always-sorted-by-time branch June 27, 2019 14:52
@rwedge rwedge mentioned this pull request Jul 3, 2019
johnnyheineken pushed a commit to johnnyheineken/featuretools that referenced this pull request Jul 7, 2019
Reindex could change the order, violating the invariant that the dataframe
is ordered by time index.

After calculating features reindex to use order of instance_ids.

FeatureSetCalculator.run can no longer accept lists, so this updates the tests
to pass in np.ndarray.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants