Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SEDONA-408] Set a reasonable default size for RasterUDT #1046

Merged
merged 1 commit into from
Oct 4, 2023

Conversation

Kontinuation
Copy link
Member

@Kontinuation Kontinuation commented Oct 4, 2023

Did you read the Contributor Guide?

Is this PR related to a JIRA ticket?

What changes were proposed in this PR?

Set a default size for raster values. This is used by the optimizer to decide whether to broadcast the DataFrame or not.

The default value was 100 bytes before applying this patch, which is the default size of BinaryType. This is almost always too small for RasterUDT and will lead to large raster DataFrames being mistakenly broadcasted. We can override this method and set a better default size for RasterUDT. Maybe 512 KB is a reasonable default value.

How was this patch tested?

Manually tested using medium-sized datasets.

Did this PR include necessary documentation updates?

  • No, this PR does not affect any public API so no need to change the docs.

@jiayuasu jiayuasu added this to the sedona-1.5.0 milestone Oct 4, 2023
@jiayuasu jiayuasu merged commit 556af16 into apache:master Oct 4, 2023
40 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants