Skip to content

[E2E] group_mode_transactions_test fail to copy all messages #788

@mooc9988

Description

@mooc9988

error stack:

Traceback (most recent call last):
  File "/usr/local/lib/python3.9/dist-packages/ducktape/tests/runner_client.py", line 186, in _do_run
    data = self.run_test()
  File "/usr/local/lib/python3.9/dist-packages/ducktape/tests/runner_client.py", line 246, in run_test
    return self.test_context.function(self.test)
  File "/usr/local/lib/python3.9/dist-packages/ducktape/mark/_mark.py", line 433, in wrapper
    return functools.partial(f, *args, **kwargs)(*w_args, **w_kwargs)
  File "/opt/kafka-dev/tests/kafkatest/tests/core/group_mode_transactions_test.py", line 287, in test_transactions
    concurrently_consumed_message_by_partition = self.copy_messages_transactionally(
  File "/opt/kafka-dev/tests/kafkatest/tests/core/group_mode_transactions_test.py", line 246, in copy_messages_transactionally
    wait_until(lambda: copier.is_done,
  File "/usr/local/lib/python3.9/dist-packages/ducktape/utils/util.py", line 58, in wait_until
    raise TimeoutError(err_msg() if callable(err_msg) else err_msg) from last_exception
ducktape.errors.TimeoutError: copier-2 - Failed to copy all messages in 240s.

found exception in copier:

[2024-02-04 19:36:34,851] ERROR Shutting down after unexpected error in event loop (org.apache.kafka.tools.TransactionalMessageCopier)
java.lang.IllegalStateException: TransactionalId copier-2: Invalid transition attempted from state READY to state ABORTING_TRANSACTION
	at org.apache.kafka.clients.producer.internals.TransactionManager.transitionTo(TransactionManager.java:974)
	at org.apache.kafka.clients.producer.internals.TransactionManager.transitionTo(TransactionManager.java:967)
	at org.apache.kafka.clients.producer.internals.TransactionManager.lambda$beginAbort$3(TransactionManager.java:269)
	at org.apache.kafka.clients.producer.internals.TransactionManager.handleCachedTransactionRequestResult(TransactionManager.java:1116)
	at org.apache.kafka.clients.producer.internals.TransactionManager.beginAbort(TransactionManager.java:266)
	at org.apache.kafka.clients.producer.KafkaProducer.abortTransaction(KafkaProducer.java:835)
	at org.apache.kafka.tools.TransactionalMessageCopier.abortTransactionAndResetPosition(TransactionalMessageCopier.java:285)
	at org.apache.kafka.tools.TransactionalMessageCopier.runEventLoop(TransactionalMessageCopier.java:401)
	at org.apache.kafka.tools.TransactionalMessageCopier.main(TransactionalMessageCopier.java:292)

The failure is caused by timeout of CommitTransactionRequest (1 minute). The corresponding broker was down for more than 1 minute.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions