backups: add prefixes to backup keys to improve S3 sharding by jkartseva · Pull Request #88418 · ClickHouse/ClickHouse

jkartseva · 2025-10-13T05:03:00Z

Changelog category (leave one):

Performance Improvement

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

S3 partitions objects internally based on key-name prefixes and automatically scales to high request rates per partition. This change introduces two new BACKUP settings: data_file_name_generator and data_file_name_prefix_length. When data_file_name_generator=checksum, backup data files are named using a hash of their contents. Example: for a checksum = abcd1234ef567890abcd1234ef567890 and data_file_name_prefix_length = 3, the resulting path will be: abc/d1234ef567890abcd1234ef567890. The resulting key distribution enhances load balancing across S3 partitions and reduces the risk of throttling.

Documentation entry for user-facing changes

Documentation is written (mandatory for new features)

Details

Addresses https://github.com/ClickHouse/clickhouse-private/issues/35866#issuecomment-3346135504

clickhouse-gh · 2025-10-13T05:03:27Z

Workflow [PR], commit [1517d12]

Summary: ❌

job_name	test_name	status	info
BuzzHouse (amd_debug)		failure
	Buzzing result	failure	cidb
BuzzHouse (amd_tsan)		failure
	Buzzing result	failure	cidb

nikitamikhaylov · 2025-10-16T12:49:58Z

@jkartseva Let's not expose the key_prefix_template, but automatically shard by 3 symbols. It should be enough.

nikitamikhaylov · 2025-10-16T21:01:58Z

Also, please check how restores would work with this scheme.

This reverts commit 1d4af4f.

jkartseva · 2025-10-17T07:03:58Z

Let's not expose the key_prefix_template, but automatically shard by 3 symbols. It should be enough.

I think it should be enabled through a setting – either a query setting or a configuration setting, currently under backups.key_prefix_length.

We'll also need to account for incremental backups. If we enable 3 by default, the data path patterns could be mixed. I haven't tested it fully yet.

src/Backups/BackupCoordinationFileInfos.cpp

vitlibar · 2025-11-04T22:23:05Z

tests/integration/test_backup_restore_on_cluster/test.py

+    [
+        pytest.param("", id="first_file_name"),
+        pytest.param("checksum"),
+    ],


I would prefer to create a separate test*.py file, enable there the checksum data file name generation and copy there some of these tests.

vitlibar · 2025-11-04T22:42:24Z

tests/integration/test_backup_restore_s3/test.py

+@pytest.fixture(
+    params=[[], ["configs/data_file_name_generator_from_checksum.xml"]],
+    ids=["data_file_name_from_first_file_name", "data_file_name_from_checksum"],
+    scope="module",


Can you explain how this parametrization works?

This is a parametrized fixture – pytest will run every test that depends on setup_cluster twice, once for each value in params.
[] adds no extra config file
["configs/data_file_name_generator_from_checksum.xml"] adds one more config file that directs data_file_name to be default from checksum.

Extra configs passed as a list, which concatenated with the main_congis list in setup_cluster function.

tests/integration/test_backup_restore_azure_blob_storage/test.py

vitlibar · 2025-11-04T22:50:11Z

tests/integration/test_backup_restore_on_cluster/test.py

    node1.query("SYSTEM SYNC REPLICA ON CLUSTER 'cluster' tbl")

    backup_name = new_backup_name()
+    backup_settings = {"data_file_name_generator": name_gen} if name_gen else None


We could randomly enable this feature in some tests. So if something fails it'll make the tests flaky.

I don't know how to to this in the integration tests. I can think of fault injection in C++ code, such as randomly adding a data_file_name_generator=checksum setting when a backup is created, but it may not be what we want.

vitlibar · 2025-11-05T17:03:35Z

If I understood it correctly we need this feature mostly for S3. We are not sure if we need it for Azure or for backups on local disks. Thus it seems we need to broadly test this feature for S3, and maybe just a bit of testing for Azure and for backups on local disks. So it seems right to keep your parametrized test approach for S3, and maybe have just one test for Azure and one for backups on local disks.

src/Core/SettingsEnums.h

src/Backups/getBackupDataFileName.cpp

src/Core/SettingsEnums.h

add rng param to MatchGenerator

1d4af4f

clickhouse-gh bot added the pr-performance Pull request with some performance improvements label Oct 13, 2025

jkartseva marked this pull request as draft October 13, 2025 05:04

jkartseva added 5 commits October 13, 2025 05:15

key generator for backups

2aef498

add keys generator to backup coordinator

1495593

add default backup keys generator

0326295

add key_prefix_template to backup settings

95f19d3

update integrational tests

2ac4803

jkartseva force-pushed the backup-keys-shard branch from b48e41e to 2ac4803 Compare October 13, 2025 05:18

jkartseva added 2 commits October 14, 2025 05:43

more tests

a53f385

tests on cluster

cc38867

jkartseva marked this pull request as ready for review October 14, 2025 05:55

jkartseva added 4 commits October 15, 2025 03:49

more tests

58757bb

test_mutation

93ddb4e

Merge remote-tracking branch 'upstream/master' into backup-keys-shard

f0a6cd4

fix test

c323f3b

vitlibar self-assigned this Oct 16, 2025

jkartseva added 4 commits October 17, 2025 01:26

more tests

76dcdf8

switch to prefix length

2f692de

update tests

e73424f

Revert "add rng param to MatchGenerator"

9d3a25e

This reverts commit 1d4af4f.

tidy

cb47691

vitlibar reviewed Oct 17, 2025

View reviewed changes

src/Backups/BackupCoordinationFileInfos.cpp Outdated Show resolved Hide resolved

vitlibar reviewed Oct 17, 2025

View reviewed changes

src/Backups/BackupCoordinationFileInfos.cpp Outdated Show resolved Hide resolved

jkartseva added 2 commits October 21, 2025 01:49

fix test

ad967f5

use checksum

42b606c

vitlibar reviewed Nov 4, 2025

View reviewed changes

tests/integration/test_backup_restore_azure_blob_storage/test.py Outdated Show resolved Hide resolved

vitlibar reviewed Nov 4, 2025

View reviewed changes

jkartseva added 9 commits November 7, 2025 01:20

Merge remote-tracking branch 'upstream/master' into backup-keys-shard

8b5ddc1

address some comments

641bb23

style

962017a

revert azure changes

f1ff659

extend azure tests

5e583df

style

5f53b03

extend local tests

bc7811f

revert tests/integration/test_backup_restore_on_cluster/test.py

99bb5c7

add test_backup_restore_on_cluster_with_checksum_data_file_name

af8a8c0

jkartseva force-pushed the backup-keys-shard branch from 5327542 to af8a8c0 Compare November 10, 2025 06:31

style

72dc0e5

vitlibar approved these changes Nov 10, 2025

View reviewed changes

vitlibar reviewed Nov 10, 2025

View reviewed changes

src/Core/SettingsEnums.h Outdated Show resolved Hide resolved

vitlibar reviewed Nov 10, 2025

View reviewed changes

src/Backups/getBackupDataFileName.cpp Outdated Show resolved Hide resolved

vitlibar reviewed Nov 10, 2025

View reviewed changes

src/Core/SettingsEnums.h Outdated Show resolved Hide resolved

jkartseva added 4 commits November 13, 2025 22:44

Merge remote-tracking branch 'upstream/master' into backup-keys-shard

bbf8ea7

address feedback

dd5d497

address feedback

1c808a2

simplify

1517d12

jkartseva enabled auto-merge November 14, 2025 07:40

jkartseva added this pull request to the merge queue Nov 14, 2025

Merged via the queue into ClickHouse:master with commit 7196ad8 Nov 14, 2025
248 of 256 checks passed

jkartseva deleted the backup-keys-shard branch November 14, 2025 21:36

robot-ch-test-poll added the pr-synced-to-cloud The PR is synced to the cloud repo label Nov 14, 2025

Conversation

jkartseva commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog category (leave one):

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

Documentation entry for user-facing changes

Details

Uh oh!

clickhouse-gh bot commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nikitamikhaylov commented Oct 16, 2025

Uh oh!

nikitamikhaylov commented Oct 16, 2025

Uh oh!

jkartseva commented Oct 17, 2025

Uh oh!

Uh oh!

Uh oh!

vitlibar Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

jkartseva Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

vitlibar Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

jkartseva Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

vitlibar Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

jkartseva Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

vitlibar commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

jkartseva commented Oct 13, 2025 •

edited

Loading

clickhouse-gh bot commented Oct 13, 2025 •

edited

Loading

vitlibar commented Nov 5, 2025 •

edited

Loading