Use more efficient serialization format for long integers in cache files #20151

JukkaL · 2025-10-31T14:54:06Z

A long integer (one that doesn't fit in the 4-byte encoding) will now be encoded like this:

initial header byte
short integer (1-4 bytes) encoding the number of bytes of data and sign
variable-length number of data bytes (absolute value of the integer) -- all bits are used

For example, a 32-bit integer can now always be encoded using at most 6 bytes (+ type tag).

This is optimized for size efficiency, not performance, since large integers are not expected to be a performance bottleneck. Having an efficient format makes it easier to improve performance in the future, however, without changing the encoding.

The header byte has a few unused bits which could be used to slightly improve efficiency, but I decided that it's not worth the extra complexity.

ilevkivskyi

LG, thanks!

JukkaL added 4 commits October 31, 2025 10:38

Update test case

cf4cc24

Use more efficient binary encoding for long integers

6509dfb

Add tests

c23d3d6

Fix error handling

3d3c2ad

JukkaL requested a review from ilevkivskyi October 31, 2025 14:54

Actually execute the new test cases

3bb226d

ilevkivskyi approved these changes Oct 31, 2025

View reviewed changes

JukkaL merged commit 7213139 into master Oct 31, 2025
13 checks passed

JukkaL deleted the serialize-long-int branch October 31, 2025 16:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Use more efficient serialization format for long integers in cache files #20151

Use more efficient serialization format for long integers in cache files #20151

Uh oh!

JukkaL commented Oct 31, 2025

Uh oh!

ilevkivskyi left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Use more efficient serialization format for long integers in cache files #20151

Use more efficient serialization format for long integers in cache files #20151

Uh oh!

Conversation

JukkaL commented Oct 31, 2025

Uh oh!

ilevkivskyi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants