[Single Span Sampling] Add single span parser #2095

marcotc · 2022-06-17T21:26:50Z

Follows up from #2091

This PR parses user provided JSON rules into Datadog::Tracing::Sampling::Span::Rule objects.

The code ensures that no errors are propagated back to the user in the form of an exception, but all errors are logged to aid in addressing them.

In the future, we'll likely support parsing this data directly from YAML or a global datadog.conf JSON, thus a non-JSON specific method, parse_list, is also provided in additional to the main parse_json method.

A few other changes:

The TokenBucket now is more strict about its inputs, which helps the validation of Single Span sample rules.
Equality helpers were added to Rule and Matcher to ease testing.

ivoanjo

👍 LGTM

lib/datadog/tracing/sampling/rate_limiter.rb

ivoanjo · 2022-06-29T10:50:37Z

lib/datadog/tracing/sampling/span/matcher.rb

+          def ==(other)
+            return super unless other.is_a?(Matcher)
+
+            name == other.name &&
+              service == other.service
+          end


Minor: Arguably this is not very duck-typing of us; perhaps we could check that other.respond_to?(:name) && other.respond_to?(:service)?

(Would apply to other similar changes in this PR)

Do you think this is needed, even with return super unless other.is_a?(Matcher) being checked?

I'll come back to this in the feature branch if you think this needs to be improved, merging this for now to keep me sane.

Ah! Yes, I wasn't clear about that part -- my suggestion would mean replacing the is_a?(Matcher) and relying only on the respond_to? instead. (But I marked this as minor, so definitely no need to hold anything back just for it)

lib/datadog/tracing/sampling/span/rule_parser.rb

ivoanjo · 2022-06-29T10:58:14Z

lib/datadog/tracing/sampling/span/rule_parser.rb

+            # Parses a list of Hashes containing the parsed JSON information
+            # for Single Span Sampling configuration.
+            # In case of parsing errors, `nil` is returned.
+            #
+            # @param rules [Array<String] the JSON configuration rules to be parsed
+            # @return [Array<Datadog::Tracing::Sampling::Span::Rule>] a list of parsed rules
+            # @return [nil] if parsing failed
+            def parse_list(rules)
+              unless rules.is_a?(Array)
+                # Using JSON terminology for the expected error type
+                Datadog.logger.warn("Span Sampling Rules are not an array: #{JSON.dump(rules)}")
+                return nil
+              end
+
+              parsed = rules.map do |hash|
+                unless hash.is_a?(Hash)
+                  # Using JSON terminology for the expected error type
+                  Datadog.logger.warn("Span Sampling Rule is not a key-value object: #{JSON.dump(hash)}")
+                  return nil
+                end
+
+                begin
+                  parse_rule(hash)
+                rescue => e
+                  Datadog.logger.warn("Cannot parse Span Sampling Rule #{JSON.dump(hash)}: " \


I can live-ish with it I guess, but from your PR description:

In the future, we'll likely support parsing this data directly from YAML or a global datadog.conf JSON, thus a non-JSON specific method, parse_list, is also provided in additional to the main parse_json method.

Is it me or is this method still quite biased towards JSON in its current form? Should we just make it private for now, and revisit it later once we want to expand to yaml and other formats?

Changed to non-json bias.

lib/datadog/tracing/sampling/span/rule_parser.rb

ivoanjo · 2022-06-29T11:06:03Z

lib/datadog/tracing/sampling/span/rule_parser.rb

+                rescue => e
+                  Datadog.logger.warn("Cannot parse Span Sampling Rule #{JSON.dump(hash)}: " \
+                  "#{e.class.name} #{e} at #{Array(e.backtrace).first}")
+                  nil


This semantics is somewhat unexpected -- we seem to be quite strict with all other checks (and bail out from this method entirely if something is off), but if parse_rule fails then we just skip over that rule.

This seems deliberate (the tests have is_expected.to be_empty), but I'm curious why we're more strict in some cases and less in others?

This is not spec'ed out but I liked the strict parsing, I've changed the PR accordingly.

I'll make sure the spec reflects it as well. (Or if there's disagreement at spec-level, I'll make changes to the feature branch accordingly)

spec/datadog/tracing/sampling/span/rule_parser_spec.rb

marcotc added the feature Involves a product feature label Jun 17, 2022

marcotc requested a review from a team June 17, 2022 21:26

marcotc self-assigned this Jun 17, 2022

marcotc force-pushed the single-span-parser branch from b83d795 to eba6923 Compare June 20, 2022 17:39

marcotc changed the title ~~[Single Span Sampling] Add single span sampling rule~~ [Single Span Sampling] Add single span parser Jun 20, 2022

marcotc mentioned this pull request Jun 20, 2022

[Single Span Sampling] Span Sampler #2098

Merged

ivoanjo approved these changes Jun 29, 2022

View reviewed changes

Base automatically changed from single-span-rule to feature-single-span-sampling June 30, 2022 19:05

marcotc force-pushed the single-span-parser branch 2 times, most recently from ddd19e7 to e5ba346 Compare June 30, 2022 19:24

marcotc added 3 commits June 30, 2022 12:24

[Single Span Sampling] Parse user configuration

8645e4c

Better error message on json parse error

d00ea3a

Even better error messages

de3b4a3

marcotc force-pushed the single-span-parser branch from e5ba346 to de3b4a3 Compare June 30, 2022 19:24

Remove stable JSON-related comments

a03ae7f

marcotc merged commit 216a43d into feature-single-span-sampling Jun 30, 2022

marcotc deleted the single-span-parser branch June 30, 2022 19:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Single Span Sampling] Add single span parser #2095

[Single Span Sampling] Add single span parser #2095

marcotc commented Jun 17, 2022

ivoanjo left a comment

ivoanjo Jun 29, 2022

marcotc Jun 30, 2022

marcotc Jun 30, 2022

ivoanjo Jul 1, 2022

ivoanjo Jun 29, 2022

marcotc Jun 30, 2022

ivoanjo Jun 29, 2022

marcotc Jun 30, 2022

[Single Span Sampling] Add single span parser #2095

[Single Span Sampling] Add single span parser #2095

Conversation

marcotc commented Jun 17, 2022

ivoanjo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment