Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Flaky test_system_clusters_actual_information #55168

Closed
qoega opened this issue Sep 29, 2023 · 6 comments · Fixed by #61395
Closed

Flaky test_system_clusters_actual_information #55168

qoega opened this issue Sep 29, 2023 · 6 comments · Fixed by #61395
Assignees
Labels
testing Special issue with list of bugs found by CI

Comments

@qoega
Copy link
Member

qoega commented Sep 29, 2023

https://s3.amazonaws.com/clickhouse-test-reports/0/332b3e644b8df1356da413b5429f154c7b564893/integration_tests__tsan__[4_6].html

>       error = node.query_and_get_error(
            "SELECT count() FROM distributed SETTINGS receive_timeout=1, handshake_timeout_ms=1"
        )

test_system_clusters_actual_information/test.py:43: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
helpers/cluster.py:3423: in query_and_get_error
    return self.client.query_and_get_error(
helpers/client.py:36: in wrap
    return func(self, *args, **kwargs)
helpers/client.py:135: in query_and_get_error
    ).get_error()
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

self = <helpers.client.CommandRequest object at 0x7ff387776f20>

    def get_error(self):
        self.process.wait(timeout=DEFAULT_QUERY_TIMEOUT)
        self.stdout_file.seek(0)
        self.stderr_file.seek(0)
    
        stdout = self.stdout_file.read().decode("utf-8", errors="replace")
        stderr = self.stderr_file.read().decode("utf-8", errors="replace")
    
        if (
            self.timer is not None
            and not self.process_finished_before_timeout
            and not self.ignore_error
        ):
            raise QueryTimeoutExceedException("Client timed out!")
    
        if self.process.returncode == 0:
>           raise QueryRuntimeException(
                "Client expected to be failed but succeeded! stdout: {}".format(stdout),
                self.process.returncode,
                stderr,
            )
E           helpers.client.QueryRuntimeException: Client expected to be failed but succeeded! stdout: 0
@azat
Copy link
Collaborator

azat commented Nov 22, 2023

https://s3.amazonaws.com/clickhouse-test-reports/56331/cc5179078c3c01bc797732edfabf852befcfaf2f/integration_tests__tsan__[4_6].html

The problem is that pause API does not always work:

Stderr:ERROR: for roottestsystemclustersactualinformation_node_1_1  Cannot pause container 2b598988cb909536c4177c915d2560baaada84d3f15627ef7203c6b56df1c903: OCI runtime pause failed: unable to freeze: unknown

@azat
Copy link
Collaborator

azat commented Nov 24, 2023

The problem is that pause API does not always work:

It could worked around with iptables/PartitionManager I guess

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
testing Special issue with list of bugs found by CI
Projects
None yet
5 participants