Skip to content

Conversation

@cashmand
Copy link
Contributor

What changes were proposed in this pull request?

Previously, if variant_get returned a Variant or a nested type containing Variant, we would just return the slice of the value along with the full metadata. Typically, most of the metadata is relevant to other parts of the original Variant value, and is not needed, so the resulting binary is larger than needed. This can be very expensive if the value is written to disk (e.g. parquet or shuffle file).

Why are the changes needed?

Avoid unnecessarily large Variant values in memory and on disk.

Does this PR introduce any user-facing change?

No, the resulting Variant is logically the same as before, only storage size should change.

How was this patch tested?

Unit tests.

Was this patch authored or co-authored using generative AI tooling?

No.

@github-actions github-actions bot added the SQL label Jun 11, 2024
@cloud-fan
Copy link
Contributor

thanks, merging to master!

@cloud-fan cloud-fan closed this in 42cd961 Jun 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants