-
Notifications
You must be signed in to change notification settings - Fork 205
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
homogenize object cols to str #558
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good -- can you log a message that the retry is happening + on which columns?
…phistry into fix/hetero_feat
graphistry/feature_utils.py
Outdated
@@ -907,6 +907,7 @@ def process_dirty_dataframes( | |||
try: | |||
X_enc = data_encoder.fit_transform(ndf, y) | |||
except TypeError: | |||
logger.info("obj columns:", object_columns, "are being converted to str") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@dcolinmorgan logger.info("obj columns: %s are being converted to str", object_columns)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- ci failing?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
message was placed before object_columns was defined
object_columns = nndf.select_dtypes(include=['object']).columns | ||
nndf[object_columns] = nndf[object_columns].astype(str) | ||
X_enc = data_encoder.fit_transform(nndf, y) | ||
logger.info("obj columns: %s are being converted to str", object_columns) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can we get a test that triggers & handles this edge case?
can land after, looks good!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
see comments
g.umap() works even after adding this column to the untitled.json data