FEAT: Add max_records to kafka task by divyanshu-tiwari · Pull Request #67 · patterninc/caterpillar

Divyanshu Tiwari (divyanshu-tiwari) · 2026-05-27T19:30:36Z

Summary

Adds a max_records field to the kafka read task that stops the reader after N records have been forwarded downstream.
Independent from end_after (wall-clock) and retry_limit (idle-based); the three can be combined.
In group consumer mode, offsets up to the last forwarded record are committed on shutdown via the deferred c.Close().
README updated with the new field, an example, and a behavior note; new test pipeline test/pipelines/kafka_read_max_records.yaml added.

Test plan

Run test/pipelines/kafka_read_max_records.yaml against a populated topic and verify exactly 10 records are emitted before the reader stops.
Re-run the same pipeline in group-consumer mode and confirm offsets for the consumed records are committed (subsequent runs resume past them).
Run against an empty/low-traffic topic and confirm the reader still exits cleanly via end_after / retry_limit when max_records cannot be reached.
go build ./... is clean.

🤖 Generated with Claude Code

Stop the kafka reader after a fixed number of records have been forwarded downstream. Independent from end_after (wall-clock) and retry_limit (idle-based). In group mode, offsets up to the last forwarded record are committed on shutdown via deferred Close(). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Divyanshu Tiwari (divyanshu-tiwari) · 2026-05-27T19:33:11Z

Copilot resolve the merge conflicts in this pull request

Copilot

Pull request overview

Adds a max_records read-mode limit to the Kafka pipeline task so the consumer can stop cleanly after forwarding a fixed number of messages, independent of existing end_after (wall-clock) and retry_limit (idle/error-based) stop conditions.

Changes:

Introduces max_records configuration and enforces the stop condition in the Kafka consumer read loop.
Updates Kafka task README with the new field, examples, and behavior notes.
Adds a new test pipeline YAML demonstrating max_records + end_after.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
test/pipelines/kafka_read_max_records.yaml	Adds an example pipeline that stops after forwarding 10 Kafka records (with `end_after` safety net).
internal/pkg/pipeline/task/kafka/README.md	Documents `max_records`, including usage example and interaction with `end_after`/`retry_limit`.
internal/pkg/pipeline/task/kafka/kafka.go	Implements `max_records` counter and early exit after N forwarded records.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Adds `validate:"omitempty,gte=0"` so a negative max_records is rejected at config-load time rather than silently behaving as unlimited (which contradicts the documented "0 = unlimited" semantics). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Yash Shrivastava (alephys26)

This is problematic, the kafka consumer would fetch records in batch, but would not forward all of them downstream. This will lead to dropped records.
Add flush of all records here as well as in end_after part. So that we ensure at_least_once delivery.

Yash Shrivastava (alephys26)

Correction: This is valid for the kafka task. The ctx cancellation happens before a new read and the commit happens before the count increment. Both cases are valid.

The only pain point now is the failure of record processing in any downstream task. If it fails there we will drop the messages, since the commits are already there on kafka.

Divyanshu Tiwari (divyanshu-tiwari) · 2026-05-28T09:18:02Z

Correction: This is valid for the kafka task. The ctx cancellation happens before a new read and the commit happens before the count increment. Both cases are valid.

The only pain point now is the failure of record processing in any downstream task. If it fails there we will drop the messages, since the commits are already there on kafka.

It's a known issue, same as the SQS.

Copilot AI review requested due to automatic review settings May 27, 2026 19:30

Divyanshu Tiwari (divyanshu-tiwari) requested a review from a team as a code owner May 27, 2026 19:30

Copilot started reviewing on behalf of Divyanshu Tiwari (divyanshu-tiwari) May 27, 2026 19:30 View session

Copilot started work on behalf of Divyanshu Tiwari (divyanshu-tiwari) May 27, 2026 19:33 View session

Copilot AI reviewed May 27, 2026

View reviewed changes

Comment thread internal/pkg/pipeline/task/kafka/kafka.go Outdated

Copilot stopped work on behalf of Divyanshu Tiwari (divyanshu-tiwari) due to an error May 27, 2026 19:34
The session was cancelled by the user.

Divyanshu Tiwari (divyanshu-tiwari) and others added 2 commits May 28, 2026 01:14

Merge branch 'main' into feat/kafka-max-records

9a9c267

Yash Shrivastava (alephys26) approved these changes May 28, 2026

View reviewed changes

Yash Shrivastava (alephys26) requested changes May 28, 2026

View reviewed changes

Yash Shrivastava (alephys26) approved these changes May 28, 2026

View reviewed changes

Divyanshu Tiwari (divyanshu-tiwari) merged commit 922a7de into main May 28, 2026
7 checks passed

Divyanshu Tiwari (divyanshu-tiwari) deleted the feat/kafka-max-records branch May 28, 2026 09:18

Pattern Security Automation (pattern-security-automation) mentioned this pull request May 28, 2026

Review Required 2026-05-28T09:18:25.4Z #68

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: Add max_records to kafka task#67

FEAT: Add max_records to kafka task#67
Divyanshu Tiwari (divyanshu-tiwari) merged 3 commits into
mainfrom
feat/kafka-max-records

Divyanshu Tiwari (divyanshu-tiwari) commented May 27, 2026

Uh oh!

Divyanshu Tiwari (divyanshu-tiwari) commented May 27, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Yash Shrivastava (alephys26) left a comment •

edited

Loading

Uh oh!

Yash Shrivastava (alephys26) left a comment

Uh oh!

Divyanshu Tiwari (divyanshu-tiwari) commented May 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Divyanshu Tiwari (divyanshu-tiwari) commented May 27, 2026

Summary

Test plan

Uh oh!

Divyanshu Tiwari (divyanshu-tiwari) commented May 27, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Yash Shrivastava (alephys26) left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Yash Shrivastava (alephys26) left a comment

Choose a reason for hiding this comment

Uh oh!

Divyanshu Tiwari (divyanshu-tiwari) commented May 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Yash Shrivastava (alephys26) left a comment •

edited

Loading