KTOR-8292 Decode URL safe base64 correctly #4721

floscher · 2025-03-03T20:59:50Z

Subsystem
ktor-utils

Motivation
In a unit test for a Ktor project I was encoding and decoding some data with ByteArray.encodeBase64(): String and String.decodeBase64Bytes(): ByteArray and was wondering why the result was not what I was expecting. After decoding and encoding, the resulting data was very slightly different from what I put in.

Turns out that the issue was, that I was using URL-safe base64 (i.e. same as normal base64, but all + characters are replaced by - and all / replaced by _, also no = padding at the end).

After looking at the source code, I found that String.decodeBase64Bytes(): ByteArray treats every character that is not a valid Base64 character as if it were the character /. That's because the decoded character is determined using indexOf(), which returns -1, so the last character in the alphabet is selected.

So "_-!A".decodeBase64Bytes() is decoded to byteArrayOf(-1, -1, -64).

byteArrayOf(-1, -1, -64).encodeBase64() will encode to ///A.

Solution
I thought, this could be easily adapted to work with URL-safe base64 as well. Without impacting encoding and decoding of valid "normal" Base64. So that's what I did with this PR.

For strings containing non-base64 I was assuming this behaviour was currently undefined (and as far as I can see not documented) behaviour, so I hope it is fine to make this change here.

I also added a test case verifying that a string containing the entire Base64 alphabet (both normal and URL safe) is converted correctly, since several characters were not tested in the previous unit tests.

By the way: I'm not sure how far along it currently is, but there is now also a Base64 utility in the stdlib: https://kotlinlang.org/api/core/kotlin-stdlib/kotlin.io.encoding/-base64/ (currently needs opt-in to @ExperimentalEncodingApi). Are there any reasons holding ktor back from just using that (besides it still being marked as experimental)?

bjhham

Nice find!

Could you run ./gradlew formatKotlin to get past the code style check?

We might also want to target this for 3.2.0 in case people are relying on the incorrect behaviour.

floscher · 2025-03-04T07:47:17Z

@bjhham Done, it was just a blank line too much.

bjhham · 2025-03-07T13:10:45Z

Thanks again, I decided we can target this for 3.1.2 - here is a YouTrack ticket for the purposes of tracking https://youtrack.jetbrains.com/issue/KTOR-8292/URL-safe-base64-decoding-problem

…aracters This also checks decoding of URL-safe base64 strings, which currently does not work, but will be changed in a separate commit.

bjhham approved these changes Mar 4, 2025

View reviewed changes

bjhham approved these changes Mar 7, 2025

View reviewed changes

bjhham enabled auto-merge (squash) March 11, 2025 08:26

floscher added 3 commits March 11, 2025 09:27

Add unit test for encoding/decoding a string containing all base64 ch…

7ca305d

…aracters This also checks decoding of URL-safe base64 strings, which currently does not work, but will be changed in a separate commit.

Change base64 decoder, so it can also handle URL-safe base64 strings

e92e029

Fix code formatting using the formatKotlin task

1534afc

bjhham force-pushed the main branch from b043f0c to 1534afc Compare March 11, 2025 08:27

bjhham merged commit 37c6724 into ktorio:main Mar 11, 2025
14 of 16 checks passed

osipxd changed the title ~~Decode URL safe base64 correctly~~ KTOR-8292 Decode URL safe base64 correctly Mar 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

KTOR-8292 Decode URL safe base64 correctly #4721

KTOR-8292 Decode URL safe base64 correctly #4721

Uh oh!

floscher commented Mar 3, 2025 •

edited

Loading

Uh oh!

bjhham left a comment

Uh oh!

floscher commented Mar 4, 2025

Uh oh!

bjhham commented Mar 7, 2025

Uh oh!

Uh oh!

Uh oh!

KTOR-8292 Decode URL safe base64 correctly #4721

KTOR-8292 Decode URL safe base64 correctly #4721

Uh oh!

Conversation

floscher commented Mar 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bjhham left a comment

Choose a reason for hiding this comment

Uh oh!

floscher commented Mar 4, 2025

Uh oh!

bjhham commented Mar 7, 2025

Uh oh!

Uh oh!

Uh oh!

floscher commented Mar 3, 2025 •

edited

Loading