Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adjust for pandas large string change #10942

Merged
merged 2 commits into from Feb 22, 2024
Merged

Conversation

phofl
Copy link
Collaborator

@phofl phofl commented Feb 20, 2024

  • Closes #xxxx
  • Tests added / passed
  • Passes pre-commit run --all-files

Our testing function did not detect the difference...

Pandas uses large_string instead of string as a backend for the StringDtype

Copy link
Contributor

github-actions bot commented Feb 20, 2024

Unit Test Results

See test report for an extended history of previous test failures. This is useful for diagnosing flaky tests.

     15 files  ± 0       15 suites  ±0   3h 18m 30s ⏱️ + 2m 58s
 13 110 tests + 1   12 178 ✅ +1     931 💤 ±0  1 ❌ ±0 
162 325 runs  +12  145 767 ✅ +7  16 557 💤 +5  1 ❌ ±0 

For more details on these failures, see this check.

Results for commit b441aaa. ± Comparison against base commit 67ecd26.

♻️ This comment has been updated with latest results.

Copy link
Member

@jrbourbeau jrbourbeau left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @phofl -- this LGTM if CI is happy. Just one non-blocking question

pdf.to_parquet(tmpdir + "string.parquet")
df = dd.read_parquet(tmpdir + "string.parquet")
assert_eq(df, pdf)
pd.testing.assert_frame_equal(df.compute(), pdf)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does this not happen inside assert_eq above?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

no, but not sure why, I'll open an issue

@phofl phofl merged commit e20ca75 into dask:main Feb 22, 2024
27 of 28 checks passed
@phofl phofl deleted the large_string_change branch February 22, 2024 13:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants