Update extension overhead explanation #159

tianchu · 2023-08-08T15:01:40Z

Mention DD_SERVERLESS_FLUSH_STRATEGY and aws.lambda.post_runtime_extensions_duration.

sumedham

minor nits

README.md

sumedham · 2023-08-09T19:50:17Z

README.md

+You may notice an increase of your Lambda function's reported duration (`aws.lambda.duration` or `aws.lambda.enhanced.duration`). This is because the Datadog Lambda Extension needs to flush data back to the Datadog API. Although the time spent by the extension flushing data is reported as part of the duration, it's done *after* AWS returns your function's response back to the client. In other words, the added duration *does not slow down* your Lambda function. See this [AWS blog post](https://aws.amazon.com/blogs/compute/performance-and-functionality-improvements-for-aws-lambda-extensions/) for more technical information. To monitor your function's actual performance and exclude the duration added by the Datadog extension, use the metric `aws.lambda.enhanced.runtime_duration`.

-By default, the Extension flushes data back to Datadog at the end of each invocation (for example, cold starts always trigger flushing). This avoids delays of data arrival for sporadic invocations from low-traffic applications, cron jobs, and manual tests. Once the Extension detects a steady and frequent invocation pattern (more than once per minute), it batches data from multiple invocations and flushes periodically at the beginning of the invocation when it's due. This means that *the busier your function is, the lower the average duration overhead per invocation*. In other words, for low-traffic applications, the duration overhead would be noticeable while the associated cost overhead is typically negligible; for high-traffic applications, the duration overhead would be barely noticeable.   
+By default, the Extension flushes data back to Datadog at the end of each invocation (for example, cold starts always trigger flushing). This avoids delays of data arrival for sporadic invocations from low-traffic applications, cron jobs, and manual tests. Once the Extension detects a steady and frequent invocation pattern (more than once per minute), it batches data from multiple invocations and flushes periodically at the beginning of the invocation when it's due. This means that *the busier your function is, the lower the average duration overhead per invocation*. In other words, for low-traffic applications, the duration overhead would be noticeable while the associated cost overhead is typically negligible; for high-traffic applications, the duration overhead would be barely noticeable. To understand the cost overhead casued by the duration used by the Datadog extension to flush data, use the metric `aws.lambda.post_runtime_extensions_duration` or `aws.lambda.enhanced.post_runtime_duration`. 


Suggested change

By default, the Extension flushes data back to Datadog at the end of each invocation (for example, cold starts always trigger flushing). This avoids delays of data arrival for sporadic invocations from low-traffic applications, cron jobs, and manual tests. Once the Extension detects a steady and frequent invocation pattern (more than once per minute), it batches data from multiple invocations and flushes periodically at the beginning of the invocation when it's due. This means that *the busier your function is, the lower the average duration overhead per invocation*. In other words, for low-traffic applications, the duration overhead would be noticeable while the associated cost overhead is typically negligible; for high-traffic applications, the duration overhead would be barely noticeable. To understand the cost overhead casued by the duration used by the Datadog extension to flush data, use the metric `aws.lambda.post_runtime_extensions_duration` or `aws.lambda.enhanced.post_runtime_duration`.

By default, the Extension flushes data back to Datadog at the end of each invocation (for example, cold starts always trigger flushing). This avoids delays of data arrival for sporadic invocations from low-traffic applications, cron jobs, and manual tests. Once the Extension detects a steady and frequent invocation pattern (more than once per minute), it batches data from multiple invocations and flushes periodically at the beginning of the invocation when it's due. This means that *the busier your function is, the lower the average duration overhead per invocation*. In other words, for low-traffic applications, the duration overhead would be noticeable while the associated cost overhead is typically negligible; for high-traffic applications, the duration overhead would be barely noticeable. To understand the compute overhead casued by the duration used by the Datadog extension to flush data, use the metric `aws.lambda.post_runtime_extensions_duration` or `aws.lambda.enhanced.post_runtime_duration`.

It's a little odd to say "cost overhead" and not point to a metric with $ units, lmk if this makes sense

i agree, but compute overhead is also confusing to me. I would just say duration overhead then.

Co-authored-by: sumedham <87997309+sumedham@users.noreply.github.com>

Update extension overhead explanation

a849054

tianchu requested a review from a team as a code owner August 8, 2023 15:01

sumedham approved these changes Aug 9, 2023

View reviewed changes

tianchu and others added 2 commits August 9, 2023 16:39

Apply suggestions from code review

bcca5fe

Co-authored-by: sumedham <87997309+sumedham@users.noreply.github.com>

Update README.md

70ae068

DarcyRaynerDD approved these changes Aug 9, 2023

View reviewed changes

tianchu merged commit e5e2fcc into main Aug 9, 2023

tianchu deleted the tian.chu/update-overhead-doc branch August 9, 2023 20:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update extension overhead explanation #159

Update extension overhead explanation #159

Uh oh!

tianchu commented Aug 8, 2023

Uh oh!

sumedham left a comment

Uh oh!

Uh oh!

sumedham Aug 9, 2023

Uh oh!

sumedham Aug 9, 2023

Uh oh!

tianchu Aug 9, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Update extension overhead explanation #159

Update extension overhead explanation #159

Uh oh!

Conversation

tianchu commented Aug 8, 2023

Uh oh!

sumedham left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sumedham Aug 9, 2023

Choose a reason for hiding this comment

Uh oh!

sumedham Aug 9, 2023

Choose a reason for hiding this comment

Uh oh!

tianchu Aug 9, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants