[SPARK-55051][CORE] Byte string accepts KiB, MiB, GiB, TiB, PiB #53816

pan3793 · 2026-01-15T17:13:06Z

What changes were proposed in this pull request?

This PR enhances JavaUtils.byteStringAs to support parsing the input string that has suffixes Ki, KiB, Mi, MiB, and so on, which allows users to use, for example, 2GiB, as the value of byte type configurations.

Why are the changes needed?

Strictly speaking, 1KB = 1000B and 1KiB = 1024B, while currently, Spark only accepts 1K or 1KB and interprets it as 1KiB.

I'm not intending to "correct" it, but I think it should at least accept 1Ki or 1KiB as input, which usually gets complain by users who are familiar with K8s, as suffix Mi, GiB are widely used in the K8s ecosystem.

Does this PR introduce any user-facing change?

Yes, users are allowed to use 1Ki, 2MiB, etc. as the value of byte type configurations.

How was this patch tested?

UTs are added.

Was this patch authored or co-authored using generative AI tooling?

No.

github-actions · 2026-01-15T17:13:25Z

JIRA Issue Information

=== Improvement SPARK-55051 ===
Summary: Byte string accepts KiB, MiB, GiB, TiB, PiB
Assignee: None
Status: Open
Affected: ["4.2.0"]

This comment was automatically generated by GitHub Actions

pan3793 · 2026-01-16T03:12:00Z

cc @dongjoon-hyun @LuciferYang

LuciferYang · 2026-01-19T03:31:16Z

Merged into master. Thanks @pan3793 and @peter-toth

[SPARK-55051][CORE] Byte string accepts KiB, MiB, GiB, TiB, PiB

b3a3247

github-actions bot added the CORE label Jan 15, 2026

peter-toth approved these changes Jan 16, 2026

View reviewed changes

LuciferYang approved these changes Jan 19, 2026

View reviewed changes

LuciferYang closed this in f30bf06 Jan 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-55051][CORE] Byte string accepts KiB, MiB, GiB, TiB, PiB #53816

[SPARK-55051][CORE] Byte string accepts KiB, MiB, GiB, TiB, PiB #53816

pan3793 commented Jan 15, 2026

Uh oh!

github-actions bot commented Jan 15, 2026

Uh oh!

pan3793 commented Jan 16, 2026

Uh oh!

LuciferYang commented Jan 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[SPARK-55051][CORE] Byte string accepts KiB, MiB, GiB, TiB, PiB #53816

[SPARK-55051][CORE] Byte string accepts KiB, MiB, GiB, TiB, PiB #53816

Conversation

pan3793 commented Jan 15, 2026

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

github-actions bot commented Jan 15, 2026

JIRA Issue Information

Uh oh!

pan3793 commented Jan 16, 2026

Uh oh!

LuciferYang commented Jan 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants