Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CI Failure (ErrorCommiting transaction) in TxUpgradeTest.upgrade_does_not_change_tx_coordinator_assignment_test #18184

Open
vbotbuildovich opened this issue Apr 30, 2024 · 10 comments · Fixed by #20743
Assignees
Labels
area/transactions auto-triaged used to know which issues have been opened from a CI job ci-failure ci-rca/test CI Root Cause Analysis - Test Issue

Comments

@vbotbuildovich
Copy link
Collaborator

vbotbuildovich commented Apr 30, 2024

https://buildkite.com/redpanda/redpanda/builds/48458
https://buildkite.com/redpanda/redpanda/builds/48467
https://buildkite.com/redpanda/redpanda/builds/48475
https://buildkite.com/redpanda/vtools/builds/13348
https://buildkite.com/redpanda/redpanda/builds/48482
https://buildkite.com/redpanda/vtools/builds/13346

Module: rptest.tests.transactions_test
Class: TxUpgradeTest
Method: upgrade_does_not_change_tx_coordinator_assignment_test
test_id:    TxUpgradeTest.upgrade_does_not_change_tx_coordinator_assignment_test
status:     FAIL
run time:   152.840 seconds

KafkaException(KafkaError{FATAL,code=INVALID_TXN_STATE,val=48,str="Failed to add partitions to transaction: Broker: Producer attempted a transactional operation in an invalid state"})
Traceback (most recent call last):
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/tests/runner_client.py", line 184, in _do_run
    data = self.run_test()
  File "/opt/.ducktape-venv/lib/python3.10/site-packages/ducktape/tests/runner_client.py", line 276, in run_test
    return self.test_context.function(self.test)
  File "/root/tests/rptest/services/cluster.py", line 103, in wrapped
    r = f(self, *args, **kwargs)
  File "/root/tests/rptest/tests/transactions_test.py", line 1346, in upgrade_does_not_change_tx_coordinator_assignment_test
    self._populate_tx_coordinator(topic.name)
  File "/root/tests/rptest/tests/transactions_test.py", line 1310, in _populate_tx_coordinator
    producer.commit_transaction()
cimpl.KafkaException: KafkaError{FATAL,code=INVALID_TXN_STATE,val=48,str="Failed to add partitions to transaction: Broker: Producer attempted a transactional operation in an invalid state"}

JIRA Link: CORE-2733

@vbotbuildovich vbotbuildovich added auto-triaged used to know which issues have been opened from a CI job ci-failure labels Apr 30, 2024
@vbotbuildovich
Copy link
Collaborator Author

@piyushredpanda
Copy link
Contributor

Presumed to be fixed by #18187

@piyushredpanda piyushredpanda added the ci-rca/redpanda CI Root Cause Analysis - Redpanda Issue label May 8, 2024
@mmaslankaprv mmaslankaprv self-assigned this Jun 28, 2024
@mmaslankaprv mmaslankaprv changed the title CI Failure (key symptom) in TxUpgradeTest.upgrade_does_not_change_tx_coordinator_assignment_test CI Failure (ErrorCommiting transaction) in TxUpgradeTest.upgrade_does_not_change_tx_coordinator_assignment_test Jun 28, 2024
@mmaslankaprv mmaslankaprv added ci-rca/test CI Root Cause Analysis - Test Issue and removed ci-rca/redpanda CI Root Cause Analysis - Redpanda Issue labels Jun 28, 2024
mmaslankaprv added a commit to mmaslankaprv/redpanda that referenced this issue Jun 28, 2024
Most of our transactions tests were build based with the same code which
often were copy pasted. The test were setting the default transaction
timeout to 10 seconds as they might have been based on the test that
validated the transactions expiration. Updated the tests to make sure
that the timeout is set in the test that need it.

The timeout made some tests to fail intermittently.

Fixes: redpanda-data#18184

Signed-off-by: Michał Maślanka <michal@redpanda.com>
vbotbuildovich pushed a commit to vbotbuildovich/redpanda that referenced this issue Jun 28, 2024
Most of our transactions tests were build based with the same code which
often were copy pasted. The test were setting the default transaction
timeout to 10 seconds as they might have been based on the test that
validated the transactions expiration. Updated the tests to make sure
that the timeout is set in the test that need it.

The timeout made some tests to fail intermittently.

Fixes: redpanda-data#18184

Signed-off-by: Michał Maślanka <michal@redpanda.com>
(cherry picked from commit 797ac33)
vbotbuildovich pushed a commit to vbotbuildovich/redpanda that referenced this issue Jun 28, 2024
Most of our transactions tests were build based with the same code which
often were copy pasted. The test were setting the default transaction
timeout to 10 seconds as they might have been based on the test that
validated the transactions expiration. Updated the tests to make sure
that the timeout is set in the test that need it.

The timeout made some tests to fail intermittently.

Fixes: redpanda-data#18184

Signed-off-by: Michał Maślanka <michal@redpanda.com>
(cherry picked from commit 797ac33)
@vbotbuildovich vbotbuildovich reopened this Jul 4, 2024
@vbotbuildovich
Copy link
Collaborator Author

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/transactions auto-triaged used to know which issues have been opened from a CI job ci-failure ci-rca/test CI Root Cause Analysis - Test Issue
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants