ingester: add experimental support for consuming records from kafka #6929

dimitarvdimitrov · 2023-12-13T18:31:00Z

This PR adds experimental support to the ingester to consume write requests from a Kafka topic. The configuration is the same as used in #6888 - it is hidden because it is very experimental. For the same reason this PR doesn't include a changelog entry.

The consumption logic is following fairly basic manual kafka consumption (outside a consumer group) with a few twists:

unmarshalling and Push() invocation happen concurrently. Roughly 14% of the time spent serving a write request is in unmarhsalling it. Making these two work concurrently speeds up consumption.
the ingester is using a consumer group to persist how far it has consumed, so it picks up from there after a restart. But it doesn't consume in a consumer group - partition allocations is manual based on the ingester ID in the hash ring
committing to the consumer group happens every second to slightly speed up consumption

Incoming work:

consume records for different tenants in parallel
improve (add) error handling so that only client errors are swallowed, not server errors

This PR adds experimental support to the ingester to consume write requests from a Kafka topic. The configuration is the same as used in 6888. The consumption logic is following fairly basic manual kafka consumption (outside a consumer group) with a few twists: 1. unmarshalling and Push() invocation happen concurrently. Roughly 14% of the time spent serving a write request is in unmarhsalling it. Making these two work concurrently speeds up consumption. 2. the ingester is using a consumer group to persist how far it has consumed, but it doesn't consume in a consumer group - partition allocations is manual based on the ingester ID in the hash ring 3. committing to the consumer group happens every second to slightly speed up consumption Incoming work: * consume records for different tenants in parallel * improve (add) error handling so that only client errors are swallowed, not server errors Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

pkg/ingester/ingester.go

pkg/storage/ingest/reader.go

pracucci · 2023-12-14T19:26:02Z

pkg/storage/ingest/reader.go

+	r.metrics.recordsPerFetch.Observe(float64(numRecords))
+}
+
+func (r *PartitionReader) newKafkaReader(at kgo.Offset) (*kgo.Client, error) {


Not a blocker for this PR, but please add a TODO to your issue to review all consumer-related config options and check what should be fine-tuned. I got some good improvements fine-tuning the producer's ones in the Writer. Can be done in a follow up PR.

pracucci · 2023-12-14T19:40:06Z

pkg/storage/ingest/reader.go

+}
+
+func (r *PartitionReader) start(ctx context.Context) error {
+	offset, err := r.fetchLastCommittedOffsetWithRetries(ctx)


Any specific reason why fetchLastCommittedOffset() creates its own client, instead of moving this call after the creation of the client below and then just using it?

it is not possible to reset the offset of an existing client without first consuming from it. So we would have to do createClient(); fetchOffset(); fetchRecords(); adjustOffset(); fetchRecords() and the first fetch will be discarded

From the docs:

mimir/vendor/github.com/twmb/franz-go/pkg/kgo/consumer.go

Lines 638 to 642 in 24591ae

// SetOffsets sets any matching offsets in setOffsets to the given

// epoch/offset. Partitions that are not specified are not set. It is invalid

// to set topics that were not yet returned from a PollFetches: this function

// sets only partitions that were previously consumed, any extra partitions are

// skipped.

I can add this as a comment

I can potentially change SetOffsets to allow this, it's a pretty old function and I've removed some internal restrictions that made SetOffsets more limited in the past. Not sure on the timeline of that work though (this isn't the first need to improve SetOffsets that I've seen in the past few months)

Thanks for taking a look :) It will be useful if we can simplify the code, but it's more of a nice-to-have, than a requirement at this point.

pkg/storage/ingest/reader.go

pkg/storage/ingest/pusher.go

pracucci · 2023-12-14T20:16:21Z

pkg/storage/ingest/pusher.go

+
+	go c.unmarshalRequests(ctx, records, recC)
+	err := c.pushRequests(ctx, recC)
+	if err != nil {


Can't return error. I would not return any error from pushRequests() for now, and revisit it once we'll have the actual error handling logic, unless you already have that logic ready for a follow up PR (in that case keep it as is).

yes, i have the change ready. While I was working on it I realized that the current logic skips records

pracucci

Very nice job! I left some minor comments, and a major one about the offset committing. Most comments are nits or non blocking ones. Feel free to address them in a follow up PR or skip if you disagree. Thanks!

pkg/storage/ingest/reader_test.go

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

pracucci · 2023-12-18T10:04:19Z

pkg/storage/ingest/reader.go

+	}
+	lastOffset := int64(0)
+	fetches.EachPartition(func(partition kgo.FetchTopicPartition) {
+		lastOffset = partition.Records[len(partition.Records)-1].Offset


I would add an extra check on the partition, to make sure the partition matches the expected one. It should never happen, but I want to make sure we don't have a bug where we've consumed another partition and then we commit the wrong offset to the partition.

pracucci · 2023-12-18T10:10:12Z

pkg/storage/ingest/reader.go

+			}
+			err := collectFetchErrs(fetches)
+			level.Error(r.logger).Log("msg", "encountered error while fetching", "err", err)
+			continue


What if only some fetches returned error? Aren't we going to loose data because we're not ingesting successful fetches?

i opened #6951 to address these comments

dimitarvdimitrov added the component/ingester label Dec 13, 2023

dimitarvdimitrov requested review from grafanabot and a team as code owners December 13, 2023 18:31

dimitarvdimitrov requested a review from pracucci December 13, 2023 18:31

dimitarvdimitrov added 7 commits December 13, 2023 19:45

Record cortex_ingest_storage_reader_processing_time_seconds

dbab1eb

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Fix linter complaints

7ebeea4

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Make code compile again

2485931

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Turn down logging in reader_test.go

3421d08

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Add license headers

29c5716

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Unexport fields and types

fcae8fa

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Extend consumer group support to all tests; make more robust

2ebe8dd

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

pracucci reviewed Dec 14, 2023

View reviewed changes

pracucci approved these changes Dec 14, 2023

View reviewed changes

pkg/storage/ingest/reader_test.go Show resolved Hide resolved

pkg/storage/ingest/reader_test.go Outdated Show resolved Hide resolved

pkg/storage/ingest/reader_test.go Show resolved Hide resolved

dimitarvdimitrov added 13 commits December 15, 2023 11:05

Use kafka config

ce27982

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Change error string in ingester

4699f11

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Rename constructors

ab6b428

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Fix min/max offset calculation

9d3cc0b

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

MOve common kafka config to a single function

9cbf9f7

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Inplace mapRecord

b388acf

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Don't share metrics

12e768b

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Separate commits into their own struct

5841e4d

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Copy ingest storage config to ingester

8c12272

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Rename channel

6db7425

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Add comment

603fa9f

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Clarify go doc

212d1a9

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Repalce message with record

882767a

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

dimitarvdimitrov merged commit 2e49aab into main Dec 15, 2023
28 checks passed

dimitarvdimitrov deleted the dimitar/ingest/upstream-basic-reader branch December 15, 2023 14:21

arnitolog mentioned this pull request Dec 15, 2023

Send observability signals to Kafka grafana/alloy#290

Closed

pracucci reviewed Dec 18, 2023

View reviewed changes

dimitarvdimitrov mentioned this pull request Dec 18, 2023

ingest consumer: more granular error handling, committer sanity check #6951

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ingester: add experimental support for consuming records from kafka #6929

ingester: add experimental support for consuming records from kafka #6929

dimitarvdimitrov commented Dec 13, 2023

pracucci Dec 14, 2023

pracucci Dec 14, 2023

dimitarvdimitrov Dec 14, 2023

twmb Dec 15, 2023

dimitarvdimitrov Dec 18, 2023

pracucci Dec 14, 2023

dimitarvdimitrov Dec 14, 2023

pracucci left a comment

pracucci Dec 18, 2023

pracucci Dec 18, 2023

dimitarvdimitrov Dec 18, 2023

	// SetOffsets sets any matching offsets in setOffsets to the given
	// epoch/offset. Partitions that are not specified are not set. It is invalid
	// to set topics that were not yet returned from a PollFetches: this function
	// sets only partitions that were previously consumed, any extra partitions are
	// skipped.

ingester: add experimental support for consuming records from kafka #6929

ingester: add experimental support for consuming records from kafka #6929

Conversation

dimitarvdimitrov commented Dec 13, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pracucci left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment