-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WIP: sparse dataset #84
Conversation
sparsity/sparse_frame.py
Outdated
) | ||
if prefixes: | ||
cols = list(map(lambda x: '{}_{}'.format(column, x), cols)) | ||
if column_cat is False: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hm @michcio1234 do you think we should add a check if the column is numerical here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Probably yes 👍
Codecov Report
@@ Coverage Diff @@
## ohe-include-untouched #84 +/- ##
=========================================================
+ Coverage 89.79% 90.07% +0.28%
=========================================================
Files 7 7
Lines 1215 1290 +75
=========================================================
+ Hits 1091 1162 +71
- Misses 124 128 +4
Continue to review full report at Codecov.
|
And change sf_arange fixture to have values from 1 to 10 rather than from 0 to 9, because 0 is treated as missing in sparse frames.
When the same dsf is computed twice, results may be different
If an argument is a generator, we will call next() on it before passing it to a function for each partition.
Previously, if sf was all-zero but not empty (shape was > 0), empty sf was returned anyway.
Superseded by 4 smaller PRs. |
No description provided.