Add decoupleafterbatch converter to ensure decouple processor follows batch processor #1255

nslaughter · 2024-04-16T16:37:41Z

Problem

When using the batch processor in a Lambda Collector pipeline, it's important to always have the decouple processor come after the batch processor. This is because the batch processor introduces delays in processing some spans/metrics/logs in order to batch them with later arriving items. If the batch processor's effect isn't offset by the decouple processor then the delay can cause issues when the Lambda environment gets frozen, potentially leading to loss of data and higher costs. In short, the current situation places a burden on the user to apply knowledge specialized to the Lambda distribution to realize the results they're most likely expecting.

Solution

This PR introduces a new decoupleafterbatch converter which automatically modifies the config, potentially adding a decouple processor to the processor chains as defined in the user's config. The behavior is that checks if a batch processor is defined and if there is no decouple processor following it, then it adds it to the end.

The key aspects are:

When a user configures a batch processor without following it with a batch processor, the converter will automatically add a decouple processor to the end of the pipeline.
If a decouple processor is already present after the last occurrence of the batch processor then nothing happens.
If batch processors are defined in multiple pipelines (logs, traces, etc), each one will be appended according to the state of that processor pipeline only.

Rationale

The decouple processor allows the lambda function to return while the Collector continues exporting the observability data asynchronously. By ensuring it always comes after the batch processor, we ensure that users avoid potential data loss scenarios, delays, and unnecessary expense caused when the Lambda environment gets frozen in the middle of a batch processor timeout.

Alternatives Considered

Document the requirement and rely on users to always add the decouple processor. This is likely to be overlooked and places a burden on the user to get better results.
Disable the batch processor when running in Lambda. This would ensure preventing bad performance caused by the batch processor in lambda, but it would also prevent users from using the batch processor's capabilities.

The automated config by converters approach is simplest for users and the most robust. It restores the benefit of the batch processor, which is recommended in OpenTelemetry best practices, while ensuring that users don't run into unexpected problems with the unique behaviors in the lambda environment.

Testing

Unit tests have been added to verify the converter works as expected, both when a user hasn't defined any processors as well as when they've defined just a batch processor, just a decouple processor, or both in different valid and invalid orders.

The rest is in Github Actions.

Please take a look and let me know if you have any feedback or suggestions!

collector/internal/confmap/converter/decoupleafterbatchconverter/README.md

collector/internal/confmap/converter/decoupleafterbatchconverter/converter.go

collector/processor/decoupleprocessor/README.md

collector/internal/confmap/converter/decoupleafterbatchconverter/converter_test.go

collector/go.mod

Co-authored-by: Adam Charrett <73886859+adcharre@users.noreply.github.com>

tylerbenson · 2024-04-17T19:23:44Z

Thanks for the thorough review @adcharre!

adcharre

Looking good, just one pedantic comment about the readme.

collector/internal/confmap/converter/decoupleafterbatchconverter/README.md

…er/README.md Co-authored-by: Adam Charrett <73886859+adcharre@users.noreply.github.com>

nslaughter · 2024-04-17T22:49:46Z

Looking good, just one pedantic comment about the readme.

I appreciate the details. Thank you.

adcharre

LGTM

tylerbenson

Nice work!

tylerbenson · 2024-04-22T15:13:43Z

collector/processor/decoupleprocessor/README.md

+
+## Auto-Configuration
+
+Due to the significant performance improvements with this approach, the OpenTelemetry Lambda Layer automatically configures the decouple processor when the batch processor is used. This ensures the best performance by default.


Please also update collector/README.md with similar info.

Agree. I pushed the changes. Please let me know your thoughts.

nslaughter added 7 commits April 10, 2024 11:22

Always put decouple processor first in pipeline

3d7dde0

Add converter to derive processors from a base

728e681

Remove scratchpad code

f4b76d3

implement rules and test

ffab596

update tests

25e2092

improve tests for reviewers

d543b2c

Merge branch 'open-telemetry:main' into enhancement/decouple-first

e2dd81c

nslaughter requested a review from a team as a code owner April 16, 2024 16:37

nslaughter added 2 commits April 16, 2024 15:47

fix toggle for append predicate

4c51018

Fix typo in function comment

38e8f13

nslaughter force-pushed the enhancement/decouple-first branch from d11ea6c to 38e8f13 Compare April 16, 2024 20:56

nslaughter added 3 commits April 16, 2024 17:00

Document converter and auto-configuration

e35b344

Document converter and auto-configuration

b536046

rm errant test

a10b9e1

adcharre reviewed Apr 17, 2024

View reviewed changes

nslaughter and others added 7 commits April 17, 2024 12:49

Add tests to clarify decouple->batch ill-formed chain

65ce10f

Fix typo in test case description

ec45851

Improve name of predicate/helper

7b236f3

Update collector/processor/decoupleprocessor/README.md

1b21b0b

Co-authored-by: Adam Charrett <73886859+adcharre@users.noreply.github.com>

gofmt -s -w .

a818179

restructure tests to extend coverage

9b72b9e

go mod tidy

d88a4e2

adcharre reviewed Apr 17, 2024

View reviewed changes

collector/internal/confmap/converter/decoupleafterbatchconverter/README.md Outdated Show resolved Hide resolved

Update collector/internal/confmap/converter/decoupleafterbatchconvert…

d8d5062

…er/README.md Co-authored-by: Adam Charrett <73886859+adcharre@users.noreply.github.com>

adcharre approved these changes Apr 18, 2024

View reviewed changes

tylerbenson approved these changes Apr 18, 2024

View reviewed changes

rapphil approved these changes Apr 22, 2024

View reviewed changes

tylerbenson reviewed Apr 22, 2024

View reviewed changes

nslaughter added 2 commits April 22, 2024 12:26

Add auto-config explaination to Collector

5038809

Merge branch 'main' into enhancement/decouple-first

9db6767

tylerbenson merged commit dff9bc6 into open-telemetry:main Apr 22, 2024
12 checks passed

tylerbenson mentioned this pull request Apr 22, 2024

Use decouple processor as the default processor #1197

Closed

nslaughter mentioned this pull request Apr 24, 2024

Add decouple processor to the default config #1196

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add decoupleafterbatch converter to ensure decouple processor follows batch processor #1255

Add decoupleafterbatch converter to ensure decouple processor follows batch processor #1255

nslaughter commented Apr 16, 2024 •

edited

Loading

tylerbenson commented Apr 17, 2024

adcharre left a comment

nslaughter commented Apr 17, 2024

adcharre left a comment

tylerbenson left a comment

tylerbenson Apr 22, 2024

nslaughter Apr 22, 2024


		## Auto-Configuration

		Due to the significant performance improvements with this approach, the OpenTelemetry Lambda Layer automatically configures the decouple processor when the batch processor is used. This ensures the best performance by default.

Add decoupleafterbatch converter to ensure decouple processor follows batch processor #1255

Add decoupleafterbatch converter to ensure decouple processor follows batch processor #1255

Conversation

nslaughter commented Apr 16, 2024 • edited Loading

Problem

Solution

Rationale

Alternatives Considered

Testing

tylerbenson commented Apr 17, 2024

adcharre left a comment

Choose a reason for hiding this comment

nslaughter commented Apr 17, 2024

adcharre left a comment

Choose a reason for hiding this comment

tylerbenson left a comment

Choose a reason for hiding this comment

tylerbenson Apr 22, 2024

Choose a reason for hiding this comment

nslaughter Apr 22, 2024

Choose a reason for hiding this comment

nslaughter commented Apr 16, 2024 •

edited

Loading