S3 log support by mheffner · Pull Request #32 · streamfold/rotel-lambda-forwarder

mheffner · 2026-02-20T23:08:30Z

This introduces the framework for supporting S3 event notifications. S3 event notifications will invoke the Lambda function when an AWS service writes new logs to an S3 bucket (the S3 creation event). While this PR mostly lays the framework for future support of S3 log based AWS services, it currently supports CloudTrail logs from S3 (in addition to the existing Cloudtrails CloudWatch support).

A single S3 notification may contain multiple updates from the creation of multiple S3 objects, each which needs to be read, parsed and converted. By default this will process five S3 objects concurrently and will emit logs in batches up to 1k to the logs pipeline. This may be mean that ordering of the S3 objects, and logs, are not maintained. S3 objects listed later in the event notification may be loaded, parsed and exported before earlier S3 objects. In theory this can be fixed by setting FORWARDER_S3_MAX_PARALLEL_OBJECTS=1. However, S3 event notifications may be fired to multiple Lambda methods concurrently, so ordering is not guaranteed on prinicple.

There's some refactoring that I plan to do later that will cleanup sharing between the Cloudwatch and S3 logs support. In addition, the CW logs support should be broken out of the parse module, similar to this new s3logs support.

The PR required some changes to the acker component because we won't know how many acks exist ahead of time. Instead of buffering all acks, we spawn a listener to consume the acks/nacks concurrent to processing the S3 objects.

Fixes: #5

rjenkins · 2026-03-04T19:45:56Z

src/forward/forwarder.rs

+                let md = MessageMetadata::forwarder(counter.increment());
+                if let Err(e) = self
+                    .logs_tx
+                    .send(Message::new(Some(md), vec![log], None))


Could this block indefinitely?

In theory this should not block indefinitely if the function is configured correctly because the maximum retry duration should equal the maximum function duration. This would mean that requests can't sit in the pipeline indefinitely, eventually unblocking the channel.

It's possible this could block slightly passed the maximum function runtime since there's a bit of race clearing the queue and then continuing to send here. However, in that case Lambda would likely tear the function down and respin the container. For persistent failures downstream, this would likely back pressure on Lambda causing some back off.

rjenkins · 2026-03-04T19:47:16Z

src/parse/record_parser.rs

+        let json_map = self.parse_message_to_map(message, &mut lr);
+
+        match json_map {
+            Ok(None) => {}


can you just return lr here?

rjenkins · 2026-03-04T19:49:05Z

src/parse/record_parser.rs

+        mut json_map: serde_json::Map<String, JsonValue>,
+    ) -> LogRecord {
+        let mut lr = LogRecord {
+            time_unix_nano: (timestamp * 1_000_000) as u64,


probably not likely but could this silently wrap around?

Unlikely, but yeah pushing a wrapper that'll fall back to now_nanos just in case.

rjenkins · 2026-03-04T19:51:04Z

src/s3logs/mod.rs

+
+            // Wait for the first task to finish if we've hit the concurrency limit, then
+            // stream its results immediately rather than accumulating them.
+            while tasks.len() >= max_concurrent {


Does this go one over max concurrent intentionally?

No, it shouldn't. The > check is mostly just a safe guard.

rjenkins · 2026-03-04T19:52:25Z

src/s3logs/s3record.rs

+
+        // Load object from S3
+        let object_data =
+            load_s3_object(&self.s3_client, bucket_name, object_key, &self.request_id).await?;


Can this block indefinitely or do we rely on timeouts from the S3 client?

The s3 client's timeouts and retries should prevent that.

rjenkins

nice

mheffner added 4 commits February 25, 2026 13:13

Support S3 record parsing.

3a4b963

Update README

df68861

additional readme updates

23c353a

Readme tweak

634711a

mheffner force-pushed the s3-log-support branch from 9e40eac to 634711a Compare February 25, 2026 18:18

mheffner added 9 commits February 25, 2026 13:44

Fix test

dd6b090

Remove unused test files

15e4c68

Cleanup tests

ce39cb3

Joinset is better here

dd9e1cb

Set overall timeout

927af7b

tweak readme

09c43cc

Spawn tasks to improve pipelining

6c26e0d

Spawn acker to avoid deadlock

9d892b8

Reduce logging

2a17e8c

mheffner force-pushed the s3-log-support branch from 289dd19 to 2a17e8c Compare February 25, 2026 22:04

mheffner marked this pull request as ready for review February 25, 2026 22:28

mheffner requested a review from rjenkins February 25, 2026 22:28

rjenkins reviewed Mar 4, 2026

View reviewed changes

mheffner added 3 commits March 4, 2026 16:05

Simplify match a bit

30ae6cc

Add some protections around json timestamp parsing

2c6a4c5

Handle conversion safely

9f6d317

rjenkins approved these changes Mar 5, 2026

View reviewed changes

mheffner merged commit 66b2be3 into main Mar 5, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

S3 log support#32

S3 log support#32
mheffner merged 16 commits intomainfrom
s3-log-support

mheffner commented Feb 20, 2026 •

edited

Loading

Uh oh!

rjenkins Mar 4, 2026

Uh oh!

mheffner Mar 4, 2026

Uh oh!

rjenkins Mar 5, 2026

Uh oh!

rjenkins Mar 4, 2026

Uh oh!

rjenkins Mar 4, 2026

Uh oh!

mheffner Mar 4, 2026

Uh oh!

rjenkins Mar 4, 2026

Uh oh!

mheffner Mar 4, 2026

Uh oh!

rjenkins Mar 4, 2026

Uh oh!

mheffner Mar 4, 2026

Uh oh!

rjenkins left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mheffner commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rjenkins left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mheffner commented Feb 20, 2026 •

edited

Loading