Skip to content

Improve to_parquet error for appended divisions overlap#9102

Merged
jcrist merged 2 commits intodask:mainfrom
jcrist:improve-parquet-overlap-error
May 19, 2022
Merged

Improve to_parquet error for appended divisions overlap#9102
jcrist merged 2 commits intodask:mainfrom
jcrist:improve-parquet-overlap-error

Conversation

@jcrist
Copy link
Copy Markdown
Member

@jcrist jcrist commented May 19, 2022

Small tweak to the error message for appended overlapping divisions in
to_parquet.

Fixes #8145.

  • Closes #xxxx
  • Tests added / passed
  • Passes pre-commit run --all-files

Small tweak to the error message for appended overlapping divisions in
`to_parquet`.
Two fixes:
- Previously this check was accidentally quadratic
- If we know the existing divisions aren't sorted, then don't error on
overlap. We don't spend excessive effort checking this, a false positive
error is easy to fix by passing in `ignore_divisions=True`.
Copy link
Copy Markdown
Collaborator

@ian-r-rose ian-r-rose left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @jcrist

@jcrist jcrist merged commit 5c6627a into dask:main May 19, 2022
@jcrist jcrist deleted the improve-parquet-overlap-error branch May 19, 2022 20:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

to_parquet fails when append=True

2 participants