Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CI Failure (fail to install) in RandomNodeOperationsTest.test_node_operations #18337

Closed
vbotbuildovich opened this issue May 9, 2024 · 8 comments
Labels
auto-triaged used to know which issues have been opened from a CI job ci-failure ci-rca/infra CI Root Cause Analysis - Infrastructure Issue

Comments

@vbotbuildovich
Copy link
Collaborator

vbotbuildovich commented May 9, 2024

https://buildkite.com/redpanda/vtools/builds/13607
https://buildkite.com/redpanda/vtools/builds/13607

Module: rptest.tests.random_node_operations_test
Class: RandomNodeOperationsTest
Method: test_node_operations
Arguments: {
    "num_to_upgrade": 3,
    "enable_failures": false,
    "with_tiered_storage": false
}
test_id:    RandomNodeOperationsTest.test_node_operations
status:     FAIL
run time:   57.132 seconds

RemoteCommandError({'ssh_config': {'host': 'ip-172-31-11-60', 'hostname': '172.31.11.60', 'user': 'root', 'port': 22, 'password': None, 'identityfile': '/home/ubuntu/.ssh/id_rsa'}, 'hostname': 'ip-172-31-11-60', 'ssh_hostname': '172.31.11.60', 'user': 'root', 'externally_routable_ip': '34.222.249.118', '_logger': <Logger rptest.tests.random_node_operations_test.RandomNodeOperationsTest.test_node_operations.enable_failures=False.num_to_upgrade=3.with_tiered_storage=False-261 (DEBUG)>, 'os': 'linux', '_ssh_client': <paramiko.client.SSHClient object at 0x7f7440fdcaf0>, '_sftp_client': <paramiko.sftp_client.SFTPClient object at 0x7f7440ec7fd0>, '_custom_ssh_exception_checks': None}, 'curl -fsSL https://vectorized-public.s3.us-west-2.amazonaws.com/releases/redpanda/24.1.1/redpanda-24.1.1-amd64.tar.gz --create-dir -o /opt/redpanda_installs/v24.1.1/redpanda.tar.gz && gunzip -c /opt/redpanda_installs/v24.1.1/redpanda.tar.gz | tar -xf - -C /opt/redpanda_installs/v24.1.1 && rm /opt/redpanda_installs/v24.1.1/redpanda.tar.gz', 35, b'')
Traceback (most recent call last):
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/tests/runner_client.py", line 184, in _do_run
    data = self.run_test()
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/tests/runner_client.py", line 276, in run_test
    return self.test_context.function(self.test)
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/mark/_mark.py", line 535, in wrapper
    return functools.partial(f, *args, **kwargs)(*w_args, **w_kwargs)
  File "/home/ubuntu/redpanda/tests/rptest/services/cluster.py", line 103, in wrapped
    r = f(self, *args, **kwargs)
  File "/home/ubuntu/redpanda/tests/rptest/tests/random_node_operations_test.py", line 319, in test_node_operations
    self._start_redpanda(num_to_upgrade,
  File "/home/ubuntu/redpanda/tests/rptest/tests/random_node_operations_test.py", line 147, in _start_redpanda
    installer.install(
  File "/home/ubuntu/redpanda/tests/rptest/services/redpanda_installer.py", line 609, in install
    self._install_unlocked(nodes, install_target)
  File "/home/ubuntu/redpanda/tests/rptest/services/redpanda_installer.py", line 658, in _install_unlocked
    raise e
  File "/home/ubuntu/redpanda/tests/rptest/services/redpanda_installer.py", line 638, in _install_unlocked
    self.wait_for_async_ssh(self._redpanda.logger,
  File "/home/ubuntu/redpanda/tests/rptest/services/redpanda_installer.py", line 165, in wait_for_async_ssh
    for l in ssh_out_per_node[node]:
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/cluster/remoteaccount.py", line 687, in next
    return next(self.iter_obj)
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/cluster/remoteaccount.py", line 354, in output_generator
    raise RemoteCommandError(self, cmd, exit_status, stderr.read())
ducktape.cluster.remoteaccount.RemoteCommandError: root@ip-172-31-11-60: Command 'curl -fsSL https://vectorized-public.s3.us-west-2.amazonaws.com/releases/redpanda/24.1.1/redpanda-24.1.1-amd64.tar.gz --create-dir -o /opt/redpanda_installs/v24.1.1/redpanda.tar.gz && gunzip -c /opt/redpanda_installs/v24.1.1/redpanda.tar.gz | tar -xf - -C /opt/redpanda_installs/v24.1.1 && rm /opt/redpanda_installs/v24.1.1/redpanda.tar.gz' returned non-zero exit status 35.

JIRA Link: CORE-2852

@vbotbuildovich vbotbuildovich added auto-triaged used to know which issues have been opened from a CI job ci-failure labels May 9, 2024
@piyushredpanda piyushredpanda added the ci-rca/infra CI Root Cause Analysis - Infrastructure Issue label May 10, 2024
@vbotbuildovich
Copy link
Collaborator Author

@travisdowns
Copy link
Member

Looks like fail to install from s3 again.

@travisdowns travisdowns changed the title CI Failure (key symptom) in RandomNodeOperationsTest.test_node_operations CI Failure (fail to install) in RandomNodeOperationsTest.test_node_operations Jun 23, 2024
@travisdowns
Copy link
Member

I suspect these are duplicates of #18607.
We are using curl here rather than requests, but the endpoint is the same and curl error
35 is an SSL-related error just like we got in Python.

{"duplicate": "https://github.com/redpanda-data/redpanda/issues/18607"}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
auto-triaged used to know which issues have been opened from a CI job ci-failure ci-rca/infra CI Root Cause Analysis - Infrastructure Issue
Projects
None yet
Development

No branches or pull requests

3 participants