Add support for instrumenting Qless jobs #1237

sco11morgan · 2020-11-06T17:00:25Z

qless is Redis-based job queueing system inspired by Resque.

Addresses #1053

marcotc

Very good work! Super clean so far!

I left a few comments, let me know if you have any question about them.

lib/ddtrace/contrib/qless/qless_job.rb

marcotc · 2020-11-06T19:54:43Z

docs/GettingStarted.md

+| --- | ----------- | ------- |
+| `analytics_enabled` | Enable analytics for spans produced by this integration. `true` for on, `nil` to defer to the global setting, `false` for off. | `false` |
+| `service_name` | Service name used for `qless` instrumentation | `'qless'` |
+| `workers` | An array including all worker classes you want to trace (e.g. `[MyJob]`) | All jobs |


By default, enabling an instrumentation in ddtrace enables it exhaustively: we automatically instrument as much as we can.

Thankfully, it seems like Qless does allow us to completely instrument all workers, like you implemented by extending Qless::Workers::BaseWorker.

This leaves us in a good place: all users have to do is add c.use :qless and they are go to go.

The option you added here, to selectively enable the tracer, is not a pattern we currently use, as more instrumentation is normally more desirable than the contrary.

My question to you is: do you have any requirements today that would be fulfilled by selective instrumentation, vs exhaustive instrumentation.

The option you added here, to selectively enable the tracer

@marcotc which option are you referring to? analytics_enabled or workers?

Looking at all the other integrations, analytics_enabled mostly defaults to false.
By default all worker classes are instrumented, but I'm giving the option to restrict the list (like the resque integration does).

We would not use selective instrumentation. Would you prefer I remove the workers option?

Sorry if I wasn't clear. You are correct that I was only referring to workers. analytics_enabled is correct.

We would not use selective instrumentation.

In this case I do suggest the removal of the workers option. We try our best to write instrumentation that is safe to enable in the whole application, so restricting instrumentation would only happen in case there there's a strong user case for it.

spec/ddtrace/contrib/qless/job.rb

marcotc · 2020-11-06T20:00:21Z

lib/ddtrace/contrib/qless/tracer_cleaner.rb

+        def around_perform(job)
+          return super unless datadog_configuration && tracer
+
+          tracer.shutdown! if forked?


Do you think we'd be able to exercise this fork clean up in a test case?

I tried adding tests for the forked path but I couldn't get the forked process working correctly in tests (we had a conversation about testing forked code back in #1053). I've manually tested it and the tracer is shutdown.

I'll set up some time so we help you with this. We'll try to write a forking test and see what issues we encounter.

@marcotc Let me know when you have some time. Maybe this Friday? I'm in the US Central time zone

marcotc · 2020-11-09T20:47:55Z

lib/ddtrace/contrib/qless/qless_job.rb

+            span.span_type = Datadog::Ext::AppTypes::WORKER
+            span.set_tag(Ext::TAG_JOB_ID, job.jid)
+            span.set_tag(Ext::TAG_JOB_QUEUE, job.queue_name)
+            span.set_tag(Ext::TAG_JOB_TAGS, job.tags)


I was reading about the purpose of job.tags and I noticed they state:

Tagging / Tracking -- Some jobs are more interesting than others. Track those jobs to get updates on their progress. Tag jobs with meaningful identifiers to find them quickly in the UI.

This seems to me like tags can carry uniquely identifiable user information (like email addresses). In contrast with job.queue_name (setup time configuration) or job.jid (opaque UID), it think that job.tags can contain information that the user might not want to have stored by default.

Would you say I'm understanding the tags concept correctly? If so, we should gate this data collection the same way you did for job.data already (thank you for that one, btw!).

marcotc · 2020-11-19T21:53:47Z

👋 @sco11morgan, I'll work on your PR tomorrow to try to figure out the forking tests, I'll let you know of the outcomes (🤞 I'll see a few commits adding such test to your branch).

marcotc · 2020-11-20T22:17:15Z

@sco11morgan, I added test for the default use case using forks.

Turns out the include order was actually inverted for QlessJob and TracerCleaner, and the cleaner was shutting down the tracer before we had a chance to instrument it.

Could you please take a look at my commits and see if they make sense to you?
We can work on merging it soon after.

sco11morgan · 2020-11-20T22:48:45Z

You got the forked test to work! That's great. Good catch on the flipping of the order - makes sense as that means the TracerCleaner is the first layer of middleware so that it can run its "after" last and cleanup the tracer.

ericmustin

This generally lgtm and i think it's good to ship.

i know we've been trying to add error_handler as an option for job error handling https://github.com/DataDog/dd-trace-rb/blob/master/docs/GettingStarted.md#sidekiq. Is that something we want to add here? I think it's fine if not, it may be something we want to add to all integrations as part of future work.

marcotc · 2020-11-26T18:48:42Z

Thank you for the review, @ericmustin!

I think the integration in its current state already provides enough value for users that we can confidently ship it.

I agree that adding the error_handler is definitely something we should have in the future.

marcotc · 2021-01-06T21:12:18Z

Thank you again for your work in this PR, @sco11morgan. We've just released v0.44.0, which includes these changes. Let us know if you have any feedback with this new version.

sco11morgan requested a review from a team November 6, 2020 17:00

Add support for Qless jobs

f6d9739

marcotc reviewed Nov 6, 2020

View reviewed changes

marcotc added community Was opened by a community member feature Involves a product feature integrations Involves tracing integrations labels Nov 6, 2020

Scott Morgan added 3 commits November 6, 2020 15:40

move qless spec job to support dir

0326b70

tag span with interesting qless job info

edc7392

update docs for new option tag_job_data

c26cbd8

marcotc reviewed Nov 9, 2020

View reviewed changes

Scott Morgan added 3 commits November 18, 2020 11:03

optionally allow for qless to tag with job tags

a2a1031

qless: always instrument all workers

e23956c

fix rubocop for: optionally allow for qless to tag with job tags

4a7a7b9

Scott Morgan and others added 4 commits November 19, 2020 16:47

cleanup leftover workers array

094da9c

Merge branch 'master' into instrument-qless

99c45d1

Add fork testing

764c8dd

Skip fork test on platforms that don't support it

42b8197

marcotc requested a review from ericmustin November 23, 2020 16:39

ericmustin approved these changes Nov 26, 2020

View reviewed changes

marcotc merged commit 82a79f1 into DataDog:master Nov 26, 2020

marcotc added this to the 0.44.0 milestone Nov 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for instrumenting Qless jobs #1237

Add support for instrumenting Qless jobs #1237

sco11morgan commented Nov 6, 2020

marcotc left a comment

marcotc Nov 6, 2020

sco11morgan Nov 6, 2020

marcotc Nov 9, 2020

sco11morgan Nov 18, 2020

marcotc Nov 6, 2020

sco11morgan Nov 6, 2020

marcotc Nov 9, 2020

sco11morgan Nov 18, 2020

marcotc Nov 9, 2020

sco11morgan Nov 18, 2020

marcotc commented Nov 19, 2020

marcotc commented Nov 20, 2020

sco11morgan commented Nov 20, 2020

ericmustin left a comment

marcotc commented Nov 26, 2020

marcotc commented Jan 6, 2021

Add support for instrumenting Qless jobs #1237

Add support for instrumenting Qless jobs #1237

Conversation

sco11morgan commented Nov 6, 2020

marcotc left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

marcotc commented Nov 19, 2020

marcotc commented Nov 20, 2020

sco11morgan commented Nov 20, 2020

ericmustin left a comment

Choose a reason for hiding this comment

marcotc commented Nov 26, 2020

marcotc commented Jan 6, 2021