Split files into smaller chunks when replicating snapshots #12795

deepthidevaki · 2023-05-17T09:17:19Z

Description

A snapshot consists of multiple files. When replicating a snapshot, we put one file into a single request. This can cause issues because it takes longer to send the request and can hit the configured timeout. In high latency networks this can happen more frequent. We have configured max file sizes for rocksdb. Even then the files can be upto 64MB.

A better way to handle it is to split each files to smaller chunks, and send one small chunk at a time. Smaller packets are faster to send. Besides, this also lowers memory footprint, as we don't have to load the whole file into memory. This might also help to relax the limit on rocksdb sst file size.

relates to https://jira.camunda.com/browse/SUPPORT-16901

megglos · 2023-05-25T12:43:40Z

ZDP-Triage:

relates to Allow configuring request timeout for InstallRequest #12793 in being a more long term solution => reduce size of chunks tranferred to not hit timeouts on slow networks

megglos · 2023-06-09T09:35:25Z

ZDP-Planning:

already mitigated by Allow configuring request timeout for InstallRequest #12793
Improve raft snapshot replication failure handling #11496 is more valuable to do

EuroLew · 2024-06-11T23:04:29Z

Notes:

The implementation of this will come in two parts the sender PR and the receiver PR.

The sender:
A ChunkedFileBasedSnapshotChunkReader will be created an will replace the current FileBasedSnapshotChunkReader however the chunk size designated will be Integer.MAX_VALUE so therefore no functional change should occur each chunk will still contain the full file. In addition the chunkName has been changed, previously this was the file name however since there could be multiple chunks per file the chunkName will now be appended with the file part e.g. file-1, file-2,file-3. As a result a fileName field has been added to the snapshot chunk implementation used instead of relying on string splits to extract (There is a fair amount of changes required to add the fileName filed maybe it would be simpler to rely on string split but this locks in the chunk naming scheme so a specific format so not ideal)

The receiver:

This involves adding support for chunks which only contain parts of files, currently correctness checks occur where file existence is taken as a failure however with chunked file snapshots this no longer holds these checks will be removed or refactored. In addition there is the question of checksums, before each chunk was a file and the file snapshot was provided. Currently the new implementation would involve snapshots of the file chunks. I think this is a fine solution, if we know all the file parts are not corrupted the full file should be correct? Open to thoughts on this. Lastly there will need to be changes to how files are written per chunk as now a chunk could represent not just a new file to create and write but also appending content to a file.

deepthidevaki · 2024-06-12T08:24:46Z

Have you also looked into how to make these changes with out breaking rolling update?

EuroLew · 2024-06-12T11:03:07Z

Have you also looked into how to make these changes with out breaking rolling update?

Yes changing chunkName to be a chunk identifier and adding a fileName field will break rolling update. Instead keep chunkName to be the fileName (not a very apt name this can be changed in a later version). and add a field called chunkIdentifier? Which has format file_name-<file_part> for use during debugging (Do you think this field is even needed?)

## Description This will result in split file snapshot chunks being sent to a broker which supports it. Changes: - Install Response, added a `preferredChunKSize` field which defaults to 0 for old versions. - Changed `FileBasedSnapshotChunkReader.next()` to instead read files chunk by chunk. - Added `chunkSize` field to the `FileBasedSnapshotChunkReader`. - Changed `FileBasedSnapshotChunkReader.nextId()` from a `<file-name>` to `<file name>-<file-part>` format e.g. file-1, file-2. - Added `fileBlockPosition` and `totalFileSize` field to `SnapshotChunk` in order to aid in processing on the receiver side as it is useful to know the current position in the file for next expected chunk checks and total file size for any completion actions when all blocks have been received for example file flushing. (Can be calculated with data.length, position and file size if it is the last chunk for a given file) ## Related issues Sender side of #12795

## Description Add config support for chunk size. closes #12795

## Description Built on the changes in #19361 as new chunk fields are needed. - [ ] Removed file existence checks as they will fail for chunked files. - [ ] Now writes file using the chunk position so order of chunks doesn't matter (for writing files will still fail checksums) - [ ] Now support partially building metadata if file is chunked. - [ ] Now enforces order of chunks using `chunkId` in `PassiveRole` ## Related issues closes #12795

deepthidevaki added kind/toil Categorizes an issue or PR as general maintenance, i.e. cleanup, refactoring, etc. area/performance Marks an issue as performance related area/resilience component/raft labels May 17, 2023

deepthidevaki mentioned this issue May 17, 2023

Allow configuring request timeout for InstallRequest #12793

Closed

megglos added the support Marks an issue as related to a customer support request label May 25, 2023

megglos mentioned this issue Jun 9, 2023

Improve raft snapshot replication failure handling #11496

Closed

romansmirnov added the component/zeebe Related to the Zeebe component/team label Mar 5, 2024

npepinpe assigned EuroLew Jun 3, 2024

This was referenced Jun 17, 2024

transfer snapshot files in chunks #19361

Merged

Enable receiver to parse chunked file snapshot chunks #19490

Merged

EuroLew mentioned this issue Jun 20, 2024

Snapshot chunk size config #19588

Merged

EuroLew added a commit that referenced this issue Jun 20, 2024

Snapshot chunk size config (#19588)

6b85129

## Description Add config support for chunk size. closes #12795

EuroLew added a commit that referenced this issue Jun 24, 2024

feat: Snapshot chunk size config (#19588)

0834636

## Description Add config support for chunk size. closes #12795

EuroLew closed this as completed in #19490 Jun 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split files into smaller chunks when replicating snapshots #12795

Split files into smaller chunks when replicating snapshots #12795

deepthidevaki commented May 17, 2023 •

edited by megglos

Loading

megglos commented May 25, 2023

megglos commented Jun 9, 2023

EuroLew commented Jun 11, 2024

deepthidevaki commented Jun 12, 2024

EuroLew commented Jun 12, 2024

Split files into smaller chunks when replicating snapshots #12795

Split files into smaller chunks when replicating snapshots #12795

Comments

deepthidevaki commented May 17, 2023 • edited by megglos Loading

megglos commented May 25, 2023

megglos commented Jun 9, 2023

EuroLew commented Jun 11, 2024

deepthidevaki commented Jun 12, 2024

EuroLew commented Jun 12, 2024

deepthidevaki commented May 17, 2023 •

edited by megglos

Loading