Unable to temporary disable specific plugin #1763

dobeerman · 2022-01-17T11:35:11Z

Describe the bug

The tracer is initialised at the beginning of the process. With a couple of plugins overrides.

import { tracer as DataDogTracer } from "dd-trace";

export const tracer = DataDogTracer.init()
    .use("http", { blocklist: [/.*sqs\..*\.amazonaws\.com.*/] })
    .use("fs", { enabled: false })
    .use("express", { blocklist: ["/metrics"] });

For some longtime processes we need to disable some of plugins either completely or partially (aws-sdk plugin or dynamodb only), due to it collects a large amount of data and it leads to a memory leaks. (we use aws-sdk@2.521.0 version)

So, we do the following:

await tracer
    .use("aws-sdk", false)
    .use("http", {
        blocklist: [/.*dynamodb\..*\.amazonaws\.com.*/],
        middleware: false,
    })
    .trace("handle-job", { tags: { name: jobAttributes.name } }, () =>
        handleJob(jobAttributes)
    );

Unfortunately, we still receive a lot of related traces:

which includes tons of aws.request PutItem, http.request POST, tcp.connect and dns.lookup to DynamoDB.

Q: What we do wrong? 🤔

Environment node:12-alpine3.11
Tracer version: ^1.1.2

The text was updated successfully, but these errors were encountered:

rochdev · 2022-01-17T17:59:33Z

From the snippet above, it looks like the tracer may not be used as intended. Calls to init and use are intended to be before any other module is loaded in the process, and calls to trace are meant to trace an operation at any point while the process is running. The fact that both are called at the same place indicates one of them may not be called at the right time.

For example, I would expect to see something like this instead:

datadog.js

import { tracer } from "dd-trace";

tracer
    .init()
    .use("aws-sdk", false)
    .use("http", {
        blocklist: [/.*dynamodb\..*\.amazonaws\.com.*/],
        middleware: false,
    });

export { tracer };

server.js

import { tracer as DataDogTracer } from "./datadog"; // different file to avoid hoisting

async function someFunction () {
    await tracer
        .trace("handle-job", { tags: { name: jobAttributes.name } }, () =>
            handleJob(jobAttributes)
        );
}

Most of the time, when a plugin configuration doesn't work it's because it was configured too late and not at the beginning of the process.

dobeerman · 2022-01-17T18:19:44Z

From the snippet above, it looks like the tracer may not be used as intended. Calls to init and use are intended to be before any other module is loaded in the process, and calls to trace are meant to trace an operation at any point while the process is running. The fact that both are called at the same place indicates one of them may not be called at the right time.

Hmm... The tracer is initialised at the top of the process. But, it is initialised with the aws-sdk plugin enabled.

Most of the time, when a plugin configuration doesn't work it's because it was configured too late and not at the beginning of the process.

This is exactly the problem that it is impossible to disable some plugins later in the code. We just need to disable aws-sdk for ONLY specific scope which is handle-job.

rochdev · 2022-01-17T18:42:57Z

Oh ok I misunderstood and thought it was for the entire process. This is not currently possible and is something we are currently working on but is unlikely to land before at least a few months.

However, tracing shouldn't cause any memory leak regardless of the number of spans created as long as the trace finish in a timely manner. Can you provide more details about the specific problem these spans are causing?

dobeerman · 2022-01-18T07:24:03Z

Can you provide more details about the specific problem these spans are causing?

As a part of business logic we have a service that perform a longtime operations on multiple objects. That includes download source file, parsing one and data matching modules. The two last modules perform operations on each single data item that include PutItem, GetItem from DynamoDB, making request to a third-party service for each single data item, UpdateItem in DynamoDB. Amount of data might be up to 150K items and the operation might take around 6 and more hours.

In the end we have a large amount of tracing data that has not pushed to datadog yet, therefore it is persists in memory and at some point crashes the service with Out of memory exception.

rochdev · 2022-01-18T21:20:54Z

Ok, in this case it's definitely a bug. We're supposed to have a maximum number of spans before a flush occurs automatically to avoid this, which doesn't seem to happen for your use case. Assuming this worked properly and traces would be sent in many smaller chunks without significantly affecting memory usage, would it be fine if these spans were sent?

dobeerman · 2022-01-19T08:12:00Z

would it be fine if these spans were sent?

👍 I think it's a good idea if these ranges end up grouped into parent ranges. Or, how would that look in the user interface?

rochdev · 2022-01-19T16:32:40Z

It would still be a single trace in the UI even if sent in chunks since there would still be a single root operation, just a very large one.

tlhunter · 2023-12-18T19:30:47Z

@dobeerman is this still an issue for you?

tlhunter · 2024-01-12T17:34:36Z

I'll close this for now but we can reopen if it turns out to still be an issue.

dobeerman added the bug Something isn't working label Jan 17, 2022

tlhunter closed this as completed Jan 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to temporary disable specific plugin #1763

Unable to temporary disable specific plugin #1763

dobeerman commented Jan 17, 2022

rochdev commented Jan 17, 2022

dobeerman commented Jan 17, 2022

rochdev commented Jan 17, 2022

dobeerman commented Jan 18, 2022

rochdev commented Jan 18, 2022

dobeerman commented Jan 19, 2022

rochdev commented Jan 19, 2022

tlhunter commented Dec 18, 2023

tlhunter commented Jan 12, 2024

Unable to temporary disable specific plugin #1763

Unable to temporary disable specific plugin #1763

Comments

dobeerman commented Jan 17, 2022

rochdev commented Jan 17, 2022

datadog.js

server.js

dobeerman commented Jan 17, 2022

rochdev commented Jan 17, 2022

dobeerman commented Jan 18, 2022

rochdev commented Jan 18, 2022

dobeerman commented Jan 19, 2022

rochdev commented Jan 19, 2022

tlhunter commented Dec 18, 2023

tlhunter commented Jan 12, 2024