Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Flaky test_replicated_merge_tree_s3_zero_copy/test.py::test_drop_table #57651

Open
vdimir opened this issue Dec 8, 2023 · 2 comments
Open

Flaky test_replicated_merge_tree_s3_zero_copy/test.py::test_drop_table #57651

vdimir opened this issue Dec 8, 2023 · 2 comments
Assignees
Labels
experimental feature Bug in the feature that should not be used in production flaky test flaky test found by CI testing Special issue with list of bugs found by CI

Comments

@vdimir
Copy link
Member

vdimir commented Dec 8, 2023

Fails with

>       node.query_with_retry(
            "system sync replica test_drop_table",
            settings={"receive_timeout": 5},
            sleep_time=5,
            retry_count=10,
        )
...
>       raise Exception("Can't execute query {}".format(sql))
E       Exception: Can't execute query system sync replica test_drop_table
2023.12.07 19:39:32.734844 [ 8 ] {8c001075-0d48-4130-95af-ee4f332eb894} <Trace> default.test_drop_table (28da86af-40f3-402b-b48f-c2089d74531f): Starting up table
2023.12.07 19:39:32.737282 [ 366 ] {} <Debug> default.test_drop_table (28da86af-40f3-402b-b48f-c2089d74531f): Uncovered unexpected parts: 15_0_0_0, 11_0_0_0, 0_0_0_0. Restorable une
xpected parts: . Missing parts: . Covered unexpected parts: . Expected parts: .
2023.12.07 19:39:32.737498 [ 366 ] {} <Error> default.test_drop_table (ReplicatedMergeTreeAttachThread): Initialization failed, table will remain readonly. Error: Code: 231. DB::Exc
eption: The local set of parts of table default.test_drop_table (28da86af-40f3-402b-b48f-c2089d74531f) doesn't look like the set of parts in ZooKeeper: 31.00 rows of 31.00 total row
s in filesystem are suspicious. There are 3 uncovered unexpected parts with 31 rows (0 of them is not just-written with 0 rows), 0 missing parts (with 0 blocks), 0 covered unexpecte
d parts (with 0 rows). (TOO_MANY_UNEXPECTED_DATA_PARTS), Stack trace (when copying this message, always include the lines below):
...
29387:2023.12.07 19:40:18.112336 [ 8 ] {6f6d82e0-d53e-4ada-941f-72daddf4569f} <Error> executeQuery: Code: 242. DB::Exception: Table is in readonly mode (replica path: /test/drop_table/replicas/1). (TABLE_IS_READ_ONLY) (version 23.11.1.2703 (official build)) (from 172.16.11.1:61814) (in query: system sync replica test_drop_table), Stack trace (when copying this message, always include the lines below):

ci db

@Algunenano
Copy link
Member

Looking at the error it seems that something like this happens:

  • Replica1 detaches table.
  • Replica2 drops table. Since there isn't references to the blobs/nodes any more it deletes them (ZK and s3)
  • Replica1 attaches table. The parts that were dropped by replica2 are no longer in ZK. TOO_MANY_UNEXPECTED_DATA_PARTS is thrown, table is kept in read-only, operations fail.

@alexey-milovidov alexey-milovidov added the experimental feature Bug in the feature that should not be used in production label Dec 28, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
experimental feature Bug in the feature that should not be used in production flaky test flaky test found by CI testing Special issue with list of bugs found by CI
Projects
None yet
Development

No branches or pull requests

4 participants