Skip to content

Commit

Permalink
Merge pull request #229 from aiven/ivanyu/readme-compression
Browse files Browse the repository at this point in the history
Document compression
  • Loading branch information
jeqo committed May 17, 2023
2 parents 1ff77a4 + d2b08cc commit 94cefb2
Showing 1 changed file with 4 additions and 4 deletions.
8 changes: 4 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -68,15 +68,15 @@ This encoding scheme allows indices to be stored (cached) in memory and accessed

### Compression

TBD
Optional data compression with Zstandard is supported. It can be disabled, enabled, or enabled conditionally based on a heuristic.

#### Compression heuristics
#### Compression heuristic

TBD
For each segment that is to be uploaded, the heuristic checks if the first batch in the segment is compressed. If it is not compressed, the whole segment is considered not compressed and compression is performed.

#### On compression dictionaries

TBD
It's possible to further reduce the size of compressed chunks if, instead of compressing them independently, first pretrain a shared compression dictionary. This is not implemented now and will potentially be addressed in the future: preliminary research didn't demonstrate significant advantage.

### Encryption

Expand Down

0 comments on commit 94cefb2

Please sign in to comment.