Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unify column names in dd.read_csv #1740

Merged
merged 1 commit into from Nov 1, 2016

Conversation

mrocklin
Copy link
Member

@mrocklin mrocklin commented Nov 1, 2016

Perhaps we should do this somewhere more globally?
Perhaps we shouldn't do this at all?

cc @jcrist thoughts on how to improve? Should we remove?

@@ -222,6 +222,17 @@ def test_enforce_columns(reader, blocks):
dask.compute(*dfs, get=get_sync)


def test_robust_column_mismatch():
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Slight nitpick - the test file is organized by functionality, might want to move this test down to below https://github.com/dask/dask/pull/1740/files#diff-62d83a5db4152149463203311f3d8b00R269?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah, thanks for the nit. I appreciate such organization myself. Fixed.

@jcrist
Copy link
Member

jcrist commented Nov 1, 2016

Overall this seems fine to me. I can see the use case for normalizing column names across files where they might not match.

Perhaps we should do this somewhere more globally?
Perhaps we shouldn't do this at all?
@mrocklin mrocklin merged commit ac3760b into dask:master Nov 1, 2016
@mrocklin mrocklin deleted the dd.read_csv-columns branch November 1, 2016 18:46
jsignell added a commit that referenced this pull request Aug 5, 2021
Reverts #1740 to make it so that when files have different columns, those column names are not overwritten.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants