fix(tracing): do not raise exception if partial flush is triggered without any spans #9349

romainkomorndatadog · 2024-05-22T15:49:07Z

Adds a guard against on_span_finish() with partial flushing on running into an IndexError because there are no spans to flush (which may happen if tracer.configure() was called between the time a span was created and the time it was finished).

In practice, this turns into:

>>> import ddtrace
>>> with ddtrace.tracer.trace("regression"):
...     ddtrace.tracer.configure(partial_flush_min_spans=1)
...
Partial flush triggered but no spans to flush (was tracer reconfigured?)

This also refactors the test for our os.fork() wrapper to have the child process unpatch coverage (just in case, since it occasionally causes exceptions on exit) and exit cleanly (otherwise it would continue running other tests which is not what we want).

Checklist

Change(s) are motivated and described in the PR description
Testing strategy is described if automated tests are not included in the PR
Risks are described (performance impact, potential for breakage, maintainability)
Change is maintainable (easy to change, telemetry, documentation)
Library release note guidelines are followed or label changelog/no-changelog is set
Documentation is included (in-code, generated user docs, public corp docs)
Backport labels are set (if applicable)
If this PR changes the public interface, I've notified @DataDog/apm-tees.

Reviewer Checklist

Title is accurate
All changes are related to the pull request's stated goal
Description motivates each change
Avoids breaking API changes
Testing strategy adequately addresses listed risks
Change is maintainable (easy to change, telemetry, documentation)
Release note makes sense to a user of the library
Author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment
Backport labels are set in a manner that is consistent with the release branch maintenance policy

datadog-dd-trace-py-rkomorn · 2024-05-22T15:59:42Z

Datadog Report

Branch report: romain.komorn/AIT-10242/dont_crash_when_spans_list_empty
Commit report: 13f14b6
Test service: dd-trace-py

❌ 1 Failed (0 Known Flaky), 169460 Passed, 1077 Skipped, 9h 44m 55.2s Total duration (25m 13.56s time saved)

❌ Failed Tests (1)

test_slow_imports - test_serverless.py - Details

Expand for error

 Expected status 0, got 1.
 === Captured STDOUT ===
 === End of captured STDOUT ===
 === Captured STDERR ===
 Traceback (most recent call last):
   File "/root/project/ddtrace/internal/packages.py", line 250, in is_distribution_available
     import importlib.metadata as importlib_metadata
   File "tests/internal/test_serverless.py", line 125, in find_spec
     raise ImportError(f"module {fullname} was imported!")
 ImportError: module importlib.metadata was imported!
 ...

brettlangdon

can we add a regression test?

the simple case we had should work well:

with tracer.trace("test"):
    tracer.configure(partial_flush_enabled=True, partial_flush_min_spans=1)

ddtrace/_trace/processor/__init__.py

pr-commenter · 2024-05-22T16:18:33Z

Benchmarks

Benchmark execution time: 2024-06-18 08:06:16

Comparing candidate commit a49062d in PR branch romain.komorn/AIT-10242/dont_crash_when_spans_list_empty with baseline commit 0fb7afa in branch main.

Found 0 performance improvements and 0 performance regressions! Performance is the same for 221 metrics, 9 unstable metrics.

brettlangdon

do we want to backport this? I would think no since it isn't a bug fix. if there isn't a release note we cannot release a patch with this anyways, so it would have to wait for another change to be backported.

we might want to change this to a fix(tracing): do not raise exception if partial flush is triggered without any spans, add a release note, and then keep the backports. wdyt?

ddtrace/_trace/processor/__init__.py

...asenotes/notes/fix-tracing-dont_raise_exception_on_empty_partial_flush-131cd3268101f255.yaml

tests/internal/test_settings.py

codecov-commenter · 2024-05-22T17:36:52Z

Codecov Report

Attention: Patch coverage is 17.64706% with 28 lines in your changes missing coverage. Please review.

Project coverage is 10.10%. Comparing base (deadfcd) to head (1193055).
Report is 9 commits behind head on main.

Files	Patch %	Lines
ddtrace/_trace/processor/__init__.py	33.33%	12 Missing ⚠️
tests/tracer/test_processors.py	0.00%	12 Missing ⚠️
tests/contrib/subprocess/test_subprocess.py	0.00%	4 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #9349       +/-   ##
===========================================
- Coverage   75.61%   10.10%   -65.51%     
===========================================
  Files        1336     1347       +11     
  Lines      125991   125077      -914     
===========================================
- Hits        95271    12643    -82628     
- Misses      30720   112434    +81714

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…artial_flush-131cd3268101f255.yaml Co-authored-by: Brett Langdon <brett.langdon@datadoghq.com>

…y' of github.com:DataDog/dd-trace-py into romain.komorn/AIT-10242/dont_crash_when_spans_list_empty

ddtrace/_trace/processor/__init__.py

…y' of github.com:DataDog/dd-trace-py into romain.komorn/AIT-10242/dont_crash_when_spans_list_empty

romainkomorndatadog · 2024-06-17T14:13:32Z

So after yet another twist and turn... @brettlangdon and I decided that we don't want to alter the behavior of the span aggregator in this PR.

Any spans opened prior to forking could end up getting submitted if they're closed in the forked process (which is not currently the case). Deciding whether that's desirable behavior is outside of the scope of this change.

I'm leaving in the change to the test_fork() test just because it bugs me that we're forking and not having the child exit out as soon as it can, though.

…s_list_empty

github-actions · 2024-06-18T09:11:40Z

The backport to 2.7 failed:

The process '/usr/bin/git' failed with exit code 1

To backport manually, run these commands in your terminal:

# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add .worktrees/backport-2.7 2.7
# Navigate to the new working tree
cd .worktrees/backport-2.7
# Create a new branch
git switch --create backport-9349-to-2.7
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 fffab017cb2de72d10b35585350c7fa65756e785
# Push it to GitHub
git push --set-upstream origin backport-9349-to-2.7
# Go back to the original working tree
cd ../..
# Delete the working tree
git worktree remove .worktrees/backport-2.7

Then, create a pull request where the base branch is 2.7 and the compare/head branch is backport-9349-to-2.7.

…thout any spans (#9349) Adds a guard against `on_span_finish()` with partial flushing on running into an `IndexError` because there are no spans to flush (which may happen if `tracer.configure()` was called between the time a span was created and the time it was finished). In practice, this turns into: ``` >>> import ddtrace >>> with ddtrace.tracer.trace("regression"): ... ddtrace.tracer.configure(partial_flush_min_spans=1) ... Partial flush triggered but no spans to flush (was tracer reconfigured?) ``` This also refactors the test for our `os.fork()` wrapper to have the child process unpatch `coverage` (just in case, since it occasionally causes exceptions on exit) and exit cleanly (otherwise it would continue running other tests which is not what we want). ## Checklist - [x] Change(s) are motivated and described in the PR description - [x] Testing strategy is described if automated tests are not included in the PR - [x] Risks are described (performance impact, potential for breakage, maintainability) - [x] Change is maintainable (easy to change, telemetry, documentation) - [x] [Library release note guidelines](https://ddtrace.readthedocs.io/en/stable/releasenotes.html) are followed or label `changelog/no-changelog` is set - [x] Documentation is included (in-code, generated user docs, [public corp docs](https://github.com/DataDog/documentation/)) - [x] Backport labels are set (if [applicable](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)) - [x] If this PR changes the public interface, I've notified `@DataDog/apm-tees`. ## Reviewer Checklist - [x] Title is accurate - [x] All changes are related to the pull request's stated goal - [x] Description motivates each change - [x] Avoids breaking [API](https://ddtrace.readthedocs.io/en/stable/versioning.html#interfaces) changes - [x] Testing strategy adequately addresses listed risks - [x] Change is maintainable (easy to change, telemetry, documentation) - [x] Release note makes sense to a user of the library - [x] Author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment - [x] Backport labels are set in a manner that is consistent with the [release branch maintenance policy](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting) --------- Co-authored-by: Brett Langdon <brett.langdon@datadoghq.com> Co-authored-by: Federico Mon <federico.mon@datadoghq.com> Co-authored-by: Emmett Butler <723615+emmettbutler@users.noreply.github.com> (cherry picked from commit fffab01)

…thout any spans (#9349) Adds a guard against `on_span_finish()` with partial flushing on running into an `IndexError` because there are no spans to flush (which may happen if `tracer.configure()` was called between the time a span was created and the time it was finished). In practice, this turns into: ``` >>> import ddtrace >>> with ddtrace.tracer.trace("regression"): ... ddtrace.tracer.configure(partial_flush_min_spans=1) ... Partial flush triggered but no spans to flush (was tracer reconfigured?) ``` This also refactors the test for our `os.fork()` wrapper to have the child process unpatch `coverage` (just in case, since it occasionally causes exceptions on exit) and exit cleanly (otherwise it would continue running other tests which is not what we want). - [x] Change(s) are motivated and described in the PR description - [x] Testing strategy is described if automated tests are not included in the PR - [x] Risks are described (performance impact, potential for breakage, maintainability) - [x] Change is maintainable (easy to change, telemetry, documentation) - [x] [Library release note guidelines](https://ddtrace.readthedocs.io/en/stable/releasenotes.html) are followed or label `changelog/no-changelog` is set - [x] Documentation is included (in-code, generated user docs, [public corp docs](https://github.com/DataDog/documentation/)) - [x] Backport labels are set (if [applicable](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)) - [x] If this PR changes the public interface, I've notified `@DataDog/apm-tees`. - [x] Title is accurate - [x] All changes are related to the pull request's stated goal - [x] Description motivates each change - [x] Avoids breaking [API](https://ddtrace.readthedocs.io/en/stable/versioning.html#interfaces) changes - [x] Testing strategy adequately addresses listed risks - [x] Change is maintainable (easy to change, telemetry, documentation) - [x] Release note makes sense to a user of the library - [x] Author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment - [x] Backport labels are set in a manner that is consistent with the [release branch maintenance policy](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting) --------- Co-authored-by: Brett Langdon <brett.langdon@datadoghq.com> Co-authored-by: Federico Mon <federico.mon@datadoghq.com> Co-authored-by: Emmett Butler <723615+emmettbutler@users.noreply.github.com> (cherry picked from commit fffab01)

…thout any spans [backport 2.8] (#9574) Backport fffab01 from #9349 to 2.8. Adds a guard against `on_span_finish()` with partial flushing on running into an `IndexError` because there are no spans to flush (which may happen if `tracer.configure()` was called between the time a span was created and the time it was finished). In practice, this turns into: ``` >>> import ddtrace >>> with ddtrace.tracer.trace("regression"): ... ddtrace.tracer.configure(partial_flush_min_spans=1) ... Partial flush triggered but no spans to flush (was tracer reconfigured?) ``` This also refactors the test for our `os.fork()` wrapper to have the child process unpatch `coverage` (just in case, since it occasionally causes exceptions on exit) and exit cleanly (otherwise it would continue running other tests which is not what we want). ## Checklist - [x] Change(s) are motivated and described in the PR description - [x] Testing strategy is described if automated tests are not included in the PR - [x] Risks are described (performance impact, potential for breakage, maintainability) - [x] Change is maintainable (easy to change, telemetry, documentation) - [x] [Library release note guidelines](https://ddtrace.readthedocs.io/en/stable/releasenotes.html) are followed or label `changelog/no-changelog` is set - [x] Documentation is included (in-code, generated user docs, [public corp docs](https://github.com/DataDog/documentation/)) - [x] Backport labels are set (if [applicable](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)) - [x] If this PR changes the public interface, I've notified `@DataDog/apm-tees`. ## Reviewer Checklist - [x] Title is accurate - [x] All changes are related to the pull request's stated goal - [x] Description motivates each change - [x] Avoids breaking [API](https://ddtrace.readthedocs.io/en/stable/versioning.html#interfaces) changes - [x] Testing strategy adequately addresses listed risks - [x] Change is maintainable (easy to change, telemetry, documentation) - [x] Release note makes sense to a user of the library - [x] Author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment - [x] Backport labels are set in a manner that is consistent with the [release branch maintenance policy](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting) Co-authored-by: Romain Komorn <136473744+romainkomorndatadog@users.noreply.github.com> Co-authored-by: Federico Mon <federico.mon@datadoghq.com>

…thout any spans [backport 2.10] (#9576) Backport fffab01 from #9349 to 2.10. Adds a guard against `on_span_finish()` with partial flushing on running into an `IndexError` because there are no spans to flush (which may happen if `tracer.configure()` was called between the time a span was created and the time it was finished). In practice, this turns into: ``` >>> import ddtrace >>> with ddtrace.tracer.trace("regression"): ... ddtrace.tracer.configure(partial_flush_min_spans=1) ... Partial flush triggered but no spans to flush (was tracer reconfigured?) ``` This also refactors the test for our `os.fork()` wrapper to have the child process unpatch `coverage` (just in case, since it occasionally causes exceptions on exit) and exit cleanly (otherwise it would continue running other tests which is not what we want). ## Checklist - [x] Change(s) are motivated and described in the PR description - [x] Testing strategy is described if automated tests are not included in the PR - [x] Risks are described (performance impact, potential for breakage, maintainability) - [x] Change is maintainable (easy to change, telemetry, documentation) - [x] [Library release note guidelines](https://ddtrace.readthedocs.io/en/stable/releasenotes.html) are followed or label `changelog/no-changelog` is set - [x] Documentation is included (in-code, generated user docs, [public corp docs](https://github.com/DataDog/documentation/)) - [x] Backport labels are set (if [applicable](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting)) - [x] If this PR changes the public interface, I've notified `@DataDog/apm-tees`. ## Reviewer Checklist - [x] Title is accurate - [x] All changes are related to the pull request's stated goal - [x] Description motivates each change - [x] Avoids breaking [API](https://ddtrace.readthedocs.io/en/stable/versioning.html#interfaces) changes - [x] Testing strategy adequately addresses listed risks - [x] Change is maintainable (easy to change, telemetry, documentation) - [x] Release note makes sense to a user of the library - [x] Author has acknowledged and discussed the performance implications of this PR as reported in the benchmarks PR comment - [x] Backport labels are set in a manner that is consistent with the [release branch maintenance policy](https://ddtrace.readthedocs.io/en/latest/contributing.html#backporting) Co-authored-by: Romain Komorn <136473744+romainkomorndatadog@users.noreply.github.com>

chore(internal): log a warning if partial flushes has zero spans to send

c1f25f2

romainkomorndatadog added the changelog/no-changelog A changelog entry is not required for this PR. label May 22, 2024

romainkomorndatadog self-assigned this May 22, 2024

romainkomorndatadog requested a review from a team as a code owner May 22, 2024 15:49

romainkomorndatadog requested review from erikayasuda and brettlangdon May 22, 2024 15:49

romainkomorndatadog added backport 2.7 backport 2.8 backport 2.9 labels May 22, 2024

romainkomorndatadog mentioned this pull request May 22, 2024

fix(ci_visibility): use default tracer in CI Visibility (#9328) #9350

Merged

17 tasks

brettlangdon reviewed May 22, 2024

View reviewed changes

ddtrace/_trace/processor/__init__.py Outdated Show resolved Hide resolved

brettlangdon reviewed May 22, 2024

View reviewed changes

ddtrace/_trace/processor/__init__.py Outdated Show resolved Hide resolved

add test

005a279

romainkomorndatadog changed the title ~~chore(internal): log a warning if partial flushes has zero spans to send~~ fix(tracing): do not raise exception if partial flush is triggered without any spans May 22, 2024

relnote

76271f6

romainkomorndatadog requested a review from a team as a code owner May 22, 2024 17:08

romainkomorndatadog requested a review from P403n1x87 May 22, 2024 17:08

romainkomorndatadog added 2 commits May 22, 2024 18:13

force test to run with partial spans

90d5346

comment to force test failure

8bd0aaf

brettlangdon reviewed May 22, 2024

View reviewed changes

romainkomorndatadog and others added 5 commits May 23, 2024 09:12

Update releasenotes/notes/fix-tracing-dont_raise_exception_on_empty_p…

e38af19

…artial_flush-131cd3268101f255.yaml Co-authored-by: Brett Langdon <brett.langdon@datadoghq.com>

stash

4c430ca

fix test

f2a9bc5

Merge branch 'romain.komorn/AIT-10242/dont_crash_when_spans_list_empt…

c2d6fe0

…y' of github.com:DataDog/dd-trace-py into romain.komorn/AIT-10242/dont_crash_when_spans_list_empty

better log message

fbf212e

romainkomorndatadog requested a review from brettlangdon May 23, 2024 10:35

brettlangdon reviewed May 23, 2024

View reviewed changes

ddtrace/_trace/processor/__init__.py Outdated Show resolved Hide resolved

romainkomorndatadog added 5 commits June 17, 2024 10:12

comment out originally commented line

4f73a8a

Merge branch 'romain.komorn/AIT-10242/dont_crash_when_spans_list_empt…

0467e22

…y' of github.com:DataDog/dd-trace-py into romain.komorn/AIT-10242/dont_crash_when_spans_list_empty

revert os.fork instrumentation changes

d7434ef

revert changes to re-add trace on new traces found

14c1cc4

return test to initial PR plan

07649c9

romainkomorndatadog removed request for juanjux and christophe-papazian June 17, 2024 14:11

remove duplicate test

a5bb63b

emmettbutler approved these changes Jun 17, 2024

View reviewed changes

romainkomorndatadog and others added 3 commits June 17, 2024 16:05

Merge branch 'main' into romain.komorn/AIT-10242/dont_crash_when_span…

a86c5ad

…s_list_empty

switch logging levels to debug, telemetry to 'ERROR'

5120910

Merge branch 'main' into romain.komorn/AIT-10242/dont_crash_when_span…

1193055

…s_list_empty

romainkomorndatadog enabled auto-merge (squash) June 18, 2024 07:27

Merge branch 'main' into romain.komorn/AIT-10242/dont_crash_when_span…

a49062d

…s_list_empty

gnufede approved these changes Jun 18, 2024

View reviewed changes

juanjux approved these changes Jun 18, 2024

View reviewed changes

romainkomorndatadog merged commit fffab01 into main Jun 18, 2024
225 of 230 checks passed

romainkomorndatadog deleted the romain.komorn/AIT-10242/dont_crash_when_spans_list_empty branch June 18, 2024 09:11

github-actions bot mentioned this pull request Jun 18, 2024

fix(tracing): do not raise exception if partial flush is triggered without any spans [backport 2.8] #9574

Merged

17 tasks

github-actions bot mentioned this pull request Jun 18, 2024

fix(tracing): do not raise exception if partial flush is triggered without any spans [backport 2.9] #9575

Open

17 tasks

github-actions bot mentioned this pull request Jun 18, 2024

fix(tracing): do not raise exception if partial flush is triggered without any spans [backport 2.10] #9576

Merged

17 tasks

romainkomorndatadog mentioned this pull request Jun 18, 2024

fix(tracing): do not raise exception if partial flush is triggered without any spans [backport 2.7] #9587

Open

17 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(tracing): do not raise exception if partial flush is triggered without any spans #9349

fix(tracing): do not raise exception if partial flush is triggered without any spans #9349

romainkomorndatadog commented May 22, 2024 •

edited

datadog-dd-trace-py-rkomorn bot commented May 22, 2024 •

edited

brettlangdon left a comment

pr-commenter bot commented May 22, 2024 •

edited

brettlangdon left a comment

codecov-commenter commented May 22, 2024 •

edited

romainkomorndatadog commented Jun 17, 2024

github-actions bot commented Jun 18, 2024

fix(tracing): do not raise exception if partial flush is triggered without any spans #9349

fix(tracing): do not raise exception if partial flush is triggered without any spans #9349

Conversation

romainkomorndatadog commented May 22, 2024 • edited

Checklist

Reviewer Checklist

datadog-dd-trace-py-rkomorn bot commented May 22, 2024 • edited

Datadog Report

❌ Failed Tests (1)

brettlangdon left a comment

Choose a reason for hiding this comment

pr-commenter bot commented May 22, 2024 • edited

Benchmarks

brettlangdon left a comment

Choose a reason for hiding this comment

codecov-commenter commented May 22, 2024 • edited

Codecov Report

romainkomorndatadog commented Jun 17, 2024

github-actions bot commented Jun 18, 2024

romainkomorndatadog commented May 22, 2024 •

edited

datadog-dd-trace-py-rkomorn bot commented May 22, 2024 •

edited

pr-commenter bot commented May 22, 2024 •

edited

codecov-commenter commented May 22, 2024 •

edited