Antalya 26.1 Backport of #96191 - Introduce async prefetch and staleness for Iceberg metadata#1575
Conversation
…ache-preheat-and-staleness-for-iceberg Introduce async prefetch and staleness for Iceberg metadata
|
The diff has two function calls that are different from the upstream: 1.
|
|
AI audit note: This review comment was generated by AI (gpt-5.3-codex). Audit update for PR #1575 (Antalya 26.1 backport of ClickHouse#96191 — Iceberg async metadata prefetch + read staleness): Confirmed defects No confirmed defects in reviewed scope. Coverage summary
|
arthurpassos
left a comment
There was a problem hiding this comment.
LGTM, brownie points for the summary of conflicts
PR #1575 CI Verification ReportCI Results Overview
PR's New Test ValidationAll 4 new
Tests validated:
Additionally, all existing Iceberg tests passed across all shards (~470 Iceberg tests in amd_binary, ~470 in amd_asan). CI Failures1. BuzzHouse (amd_debug) — Known Flaky FuzzerServer crash during random SQL fuzzing ("Lost connection to server"). BuzzHouse has a ~5.3% failure rate in upstream CI (228 failures out of 4,266 runs in the last 30 days). BuzzHouse (arm_asan) passed. Related to PR: No — Known flaky fuzzer unrelated to Iceberg metadata changes 2. DCO (Developer Certificate of Origin)Missing Related to PR: Potentially — Commit signing requirement, but appears to be a branch-wide issue 3. Integration tests (amd_asan, targeted) — CI Framework BugJob: Integration tests (amd_asan, targeted) Error: Related to PR: No — CI framework bug in Praktika 4. Stateless tests (arm_asan, targeted) — CI Framework BugJob: Stateless tests (arm_asan, targeted) Same Related to PR: No — CI framework bug in Praktika 5. GrypeScan (-alpine) — CVE in Base ImageCVE-2026-2673 (High) in Alpine base image OpenSSL packages. Same failure on PR #1552. Related to PR: No — Base image vulnerability 6. Regression Parquet (x86 + aarch64) — Pre-existing FailureJobs: RegressionTestsRelease / Parquet, RegressionTestsAarch64 / Parquet Failed test: Related to PR: No — Pre-existing Parquet subsystem issue, unrelated to Iceberg 7. Regression Swarms (x86 + aarch64) — Pre-existing Failure8. Regression S3Export (part) (x86 + aarch64) — Timeout/CancelledRelated to PR: No — Pre-existing S3 export stress test timeouts Conclusion
Verdict: Ready to merge — No PR-related failures detected. |
Changelog category (leave one):
Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):
iceberg_metadata_async_prefetch_period_msat the table creation. E.g.:iceberg_metadata_staleness_msparameter, which would allow ClickHouse to rely on the cache version of the metadata if it's fresher than the specified staleness. Otherwise, the remote Iceberg catalog will be queried for the latest metadata in order to process the request (how it worked before). With this change, we're able to eliminate calls to Iceberg catalog down to 0 during request processing, which is expected to bring a visible performance gain. Example:Similar functionality is available at:
Bechmarks:
(ClickHouse#96191 by @arsenmuk)
CI/CD Options
Exclude tests:
Regression jobs to run: