storage - Cannot continue parsing. recived size:0 bytes, expected:582646 bytes. context:parser::skip_batch #3305

VadimPlh · 2021-12-17T13:56:55Z

yesterday I tested truncation for segment in gc:

Produce data
Truncate segment
Try to consume

When I opened redpanda log today I see a lot of error like:
storage - Cannot continue parsing. recived size:0 bytes, expected:582646 bytes. context:parser::skip_batch
After deleting topic and all segments from gc it is still producing error (count of fetch session for current segment is 0)

2021-12-17 12:29:54	
2021-12-17T10:29:27.254608897Z stderr F DEBUG 2021-12-17 10:29:27,252 [shard 13] cloud_storage - [fiber481 kafka/delete_first_segment/0] - remote_partition.cc:215 - maybe_reset_reader called
	
2021-12-17 12:29:54	
2021-12-17T10:29:27.254600393Z stderr F ERROR 2021-12-17 10:29:27,252 [shard 13] storage - Cannot continue parsing. recived size:0 bytes, expected:582646 bytes. context:parser::skip_batch
	
2021-12-17 12:29:54	
2021-12-17T10:29:27.254596679Z stderr F DEBUG 2021-12-17 10:29:27,252 [shard 13] cloud_storage - [fiber448~1~12 kafka/delete_first_segment/0] - remote_segment.cc:404 - skip_batch_start called for 35353
	
2021-12-17 12:29:54	
2021-12-17T10:29:27.254592821Z stderr F DEBUG 2021-12-17 10:29:27,252 [shard 13] cloud_storage - [fiber448~1~12 kafka/delete_first_segment/0] - remote_segment.cc:357 - accept_batch_start skip because last_kafka_offset 36333 (last_rp_offset: 36334) < config.start_offset: 39000
	
2021-12-17 12:29:54	
2021-12-17T10:29:27.254588814Z stderr F DEBUG 2021-12-17 10:29:27,252 [shard 13] cloud_storage - [fiber481 kafka/delete_first_segment/0] - remote_partition.cc:122 - Invoking 'read_some' on current log reader {start_offset:{39000}, max_offset:{9223372036854775807}, min_bytes:0, max_bytes:1048576, type_filter:batch_type::raft_data, first_timestamp:nullopt}
	
2021-12-17 12:29:54	
2021-12-17T10:29:27.254584542Z stderr F DEBUG 2021-12-17 10:29:27,252 [shard 13] cloud_storage - [fiber481 kafka/delete_first_segment/0] - remote_partition.cc:268 - maybe_reset_stream completed true false
	
2021-12-17 12:29:54	
2021-12-17T10:29:27.254579206Z stderr F DEBUG 2021-12-17 10:29:27,252 [shard 13] cloud_storage - [fiber481 kafka/delete_first_segment/0] - remote_partition.cc:236 - maybe_reset_reader, config start_offset: 39000, reader max_offset: 46155
	
2021-12-17 12:29:54	
2021-12-17T10:29:27.254574958Z stderr F DEBUG 2021-12-17 10:29:27,252 [shard 13] cloud_storage - [fiber481 kafka/delete_first_segment/0] - remote_partition.cc:215 - maybe_reset_reader called
	
2021-12-17 12:29:54	
2021-12-17T10:29:27.254570619Z stderr F ERROR 2021-12-17 10:29:27,252 [shard 13] storage - Cannot continue parsing. recived size:0 bytes, expected:582646 bytes. context:parser::skip_batch

It is still try to parse segment inside cloud_storage

The text was updated successfully, but these errors were encountered:

The partition_record_batch_reader_impl component is not stopping when the underlying remote_partition is stopped. This manifested in the following situation during failure. The reader stuck in an infinite loop first. Then the remote_partition was stopped, but the infinite loop didn't. It continued to consume CPU even when the entire topic was deleted. Thic commit fixes this by checking the abort_source inside the remote_partition. Fixes redpanda-data#3305

Lazin · 2021-12-22T14:54:23Z

fixed by #3280 and #3293

The partition_record_batch_reader_impl component is not stopping when the underlying remote_partition is stopped. This manifested in the following situation during failure. The reader stuck in an infinite loop first. Then the remote_partition was stopped, but the infinite loop didn't. It continued to consume CPU even when the entire topic was deleted. Thic commit fixes this by checking the abort_source inside the remote_partition. Fixes redpanda-data#3305 (cherry picked from commit c6ad84d)

VadimPlh added kind/bug Something isn't working area/cloud-storage Shadow indexing subsystem labels Dec 17, 2021

VadimPlh assigned Lazin, ztlpn and LenaAn Dec 17, 2021

Lazin mentioned this issue Dec 17, 2021

Stop partition_record_batch_reader_impl properly #3308

Merged

dswang unassigned ztlpn and LenaAn Dec 21, 2021

Lazin closed this as completed Dec 22, 2021

Lazin mentioned this issue Dec 24, 2021

backport 3308: Stop partition_record_batch_reader_impl #3364

Merged

jcsp mentioned this issue Jan 13, 2022

Shadow indexing backport tracker #3473

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

storage - Cannot continue parsing. recived size:0 bytes, expected:582646 bytes. context:parser::skip_batch #3305

storage - Cannot continue parsing. recived size:0 bytes, expected:582646 bytes. context:parser::skip_batch #3305

VadimPlh commented Dec 17, 2021

Lazin commented Dec 22, 2021

storage - Cannot continue parsing. recived size:0 bytes, expected:582646 bytes. context:parser::skip_batch #3305

storage - Cannot continue parsing. recived size:0 bytes, expected:582646 bytes. context:parser::skip_batch #3305

Comments

VadimPlh commented Dec 17, 2021

Lazin commented Dec 22, 2021