Add analyzed span configuration option for Celery #1383

ericmustin · 2020-04-25T11:46:11Z

Summary

This PR addresses Open Issue 1217, by adding app analytics support for celery worker and producer spans. Previously it was not possible to enable spans created by Celery instrumentation as analyzed spans for use in app analytics. The only workaround was relying on the datadog-agent configuration setting which is brittle and decoupled from application settings, making it difficult to maintain. This PR exposes an environment variable DD_CELERY_ANALYTICS_ENABLED at the application level to enable analyzed spans for celery.

Example Usage

DD_CELERY_ANALYTICS_ENABLED=true ddtrace-run python app.py

Notes

long time listener first time caller. Tried to follow the documentation and test style of similar integrations where app analytics is disabled by default but can be enabled optionally via env var. Please let me know if there's anything I missed or any way y'all would prefer this documented. Happy to make any changes
Additionally, it's worth noting that this option enables both worker and producer spans for app analytics. Seemed like a reasonable enough behaviour, but if you feel like there ought to be optionality to choose only worker spans or only producer spans to be enabled for app analytics, I'm happy to refactor this. I noticed that celery's configuration offers the ability to name the worker_service_name and producer_service_name separately.
Lastly, it appears that algoliasearch, dogpile_cache, jinja2, and mako also cannot enable app analytics at the moment. If this approach looks good I would be happy to try to tackle those as well either in this pr or in a separate one. Just figured I would grab celery first since there's an open issue on it.

Kyle-Verhoog

Just one comment and I think it's good to go!

👍 for docs updates
👍 for great set of tests

Kyle-Verhoog · 2020-04-27T15:19:44Z

ddtrace/contrib/celery/signals.py

@@ -30,6 +30,11 @@ def trace_prerun(*args, **kwargs):
    # propagate the `Span` in the current task Context
    service = config.celery['worker_service_name']
    span = pin.tracer.trace(c.WORKER_ROOT_SPAN, service=service, resource=task.name, span_type=SpanTypes.WORKER)
+    # set analytics sample rate
+    span.set_tag(


I think we only want to set the tag here if config.celery.get_analytics_sample_rate() is not None. 🙂

tbh I was just borrowing how the other integrations do it, like boto for instance, which just naively sets the return of get_analytics_sample_rate(), which would be None in the case that analytics_enabled is not set or set to false.

Happy to change it if setting a tag to None is an antipattern

dd-trace-py/ddtrace/settings/integration.py

Lines 82 to 98 in 4653e97

def get_analytics_sample_rate(self, use_global_config=False):

"""

Returns analytics sample rate but only when integration-specific

analytics configuration is enabled with optional override with global

configuration

"""

if self._is_analytics_enabled(use_global_config):

analytics_sample_rate = getattr(self, 'analytics_sample_rate', None)

# return True if attribute is None or attribute not found

if analytics_sample_rate is None:

return True

# otherwise return rate

return analytics_sample_rate

# Use `None` as a way to say that it was not defined,

# `False` would mean `0` which is a different thing

return None

Hmm fair enough. However I think it is an antipattern since TMU None will get stringified to 'None' which is meaningless for this tag 😛.

I'm sure it gets handled and dropped somewhere in the backend but I think it's cleaner if we just don't set it. Less data to process and pass around 🙂

makes sense, appreciate the clarification, updated.

Kyle-Verhoog

Thanks!! 😄

add analytics option for celery worker and producer

8567745

ericmustin requested a review from a team as a code owner April 25, 2020 11:46

ericmustin added 2 commits April 25, 2020 13:57

[celery] black formatting changes

1f5aed0

[celery] fix syntax for setting span tag

513cae7

Kyle-Verhoog reviewed Apr 27, 2020

View reviewed changes

[celery] be more defensive about setting tag to None for app analytics

d5a9353

Kyle-Verhoog approved these changes Apr 27, 2020

View reviewed changes

Merge branch 'master' into add_app_analytics_celery

00e571b

Kyle-Verhoog merged commit 92766da into DataDog:master Apr 28, 2020

Kyle-Verhoog added this to the 0.38.0 milestone May 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add analyzed span configuration option for Celery #1383

Add analyzed span configuration option for Celery #1383

ericmustin commented Apr 25, 2020

Kyle-Verhoog left a comment

Kyle-Verhoog Apr 27, 2020

ericmustin Apr 27, 2020 •

edited

Kyle-Verhoog Apr 27, 2020

ericmustin Apr 27, 2020

Kyle-Verhoog left a comment

	def get_analytics_sample_rate(self, use_global_config=False):
	"""
	Returns analytics sample rate but only when integration-specific
	analytics configuration is enabled with optional override with global
	configuration
	"""
	if self._is_analytics_enabled(use_global_config):
	analytics_sample_rate = getattr(self, 'analytics_sample_rate', None)
	# return True if attribute is None or attribute not found
	if analytics_sample_rate is None:
	return True
	# otherwise return rate
	return analytics_sample_rate

	# Use `None` as a way to say that it was not defined,
	# `False` would mean `0` which is a different thing
	return None

Add analyzed span configuration option for Celery #1383

Add analyzed span configuration option for Celery #1383

Conversation

ericmustin commented Apr 25, 2020

Summary

Example Usage

Notes

Kyle-Verhoog left a comment

Choose a reason for hiding this comment

Kyle-Verhoog Apr 27, 2020

Choose a reason for hiding this comment

ericmustin Apr 27, 2020 • edited

Choose a reason for hiding this comment

Kyle-Verhoog Apr 27, 2020

Choose a reason for hiding this comment

ericmustin Apr 27, 2020

Choose a reason for hiding this comment

Kyle-Verhoog left a comment

Choose a reason for hiding this comment

ericmustin Apr 27, 2020 •

edited