Skip to content

Commit

Permalink
Add sentence motivating why escaping is unecessary in row format
Browse files Browse the repository at this point in the history
  • Loading branch information
alamb committed Nov 4, 2022
1 parent 69af2ad commit 7f89c31
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions _posts/2022-10-30-multi-column-sorts-in-arrow-rust-part-2.md
Original file line number Diff line number Diff line change
Expand Up @@ -52,6 +52,8 @@ The row format is a variable length byte sequence created by concatenating the e
(Columns)
```

The encoding is carefully designed in such a way that escaping is unnecessary: it is never ambiguous as to whether a byte is part of a sentinel (e.g. null) or a value.

### Unsigned Integers

To encode a non-null unsigned integer, the byte `0x01` is written, followed by the integer’s bytes starting with the most significant, i.e. big endian. A null is encoded as a `0x00` byte, followed by the encoded bytes of the integer’s zero value
Expand Down

0 comments on commit 7f89c31

Please sign in to comment.