Clean up detection sentry events + tests by RobertJoonas · Pull Request #5833 · plausible/analytics

RobertJoonas · 2025-10-27T09:30:11Z

Changes

This PR cleans up Sentry reporting for detection. Doing so, we're also changing the way we observe detection failures. Instead of "unhandled vs handled" (which makes sense in verification), a detection result is either a success or a failure. The telemetry events are also renamed accordingly in this PR.

A detection failure can be two things - an issue with the customer website, or an issue on our side. The issues on our side are further split into Browserless issues vs our own. Hence the telemetry bit is branched out quite a lot, but I think it should help us prioritize any future issues better.

Tests

Automated tests have been added

Changelog

This PR does not make a user-facing change

Documentation

This change does not need a documentation update

Dark mode

This PR does not change the UI

Otherwise, it can sometimes remain unclear in the diagnostics, whether it was InstallationV2 or InstallationV2CacheBust that timed out.

The current production logs show two types of verification timeouts: * service_error: "Unhandled Browserless response status: 408" (vast majority of cases) * service_error: :timeout (only a few cases) The latter happens when we hit the Req receive_timeout (endpoint_timeout + 2s). I've seen Browserless not respect the timeout param from time to time, so it's better to keep the timeout logic "in-house" only.

...but still consider them "unhandled" for telemetry, also notifying Sentry and logging the warning.

…llation

Also rename current liveview modules and routes, removing the v2 suffix

Also fix dockerignore and elixir.yml referencing a wrong priv path

…entry

apata · 2025-10-27T10:07:59Z

extra/lib/plausible/installation_support/detection/diagnostics.ex

+      String.contains?(extra, "net::") -> failure(:client_issue)
+      String.contains?(String.downcase(extra), "execution context") -> failure(:client_issue)


nitpick, non-blocking: This remap from :browserless_client_error to :client_issue and :unknown_issue loses some information in the process. At this point, extra is known, but once we map it to :unknown_issue, it gets thrown away and in the checks module we go to the case

_unknown_failure -> {true, true, "Unknown failure"}

It seems there's actually two codes that we want to emit from the Detection check, :browserless_client_error and :browserless_client_error_silenced

The information doesn't get lost. These error-grouping atoms are just for determining which kind of telemetry we want. In any case, if it's an interesting case, the whole diagnostics struct (e.g. %{..., service_error: %{code: :browserless_client_error, extra: "whatever message"}}, ...) is logged and captured by Sentry as well.

apata · 2025-10-27T10:10:01Z

extra/lib/plausible/installation_support/detection/diagnostics.ex

+
+  def interpret(%__MODULE__{service_error: %{code: code}}, _url)
+      when code in [:domain_not_found, :invalid_url] do
+    failure(:client_issue)


nitpick, non-blocking: this does not seem like a client issue (browser driven by Browserless.io) issue, it's invalid input issue: we don't even start the browser in these scenarios

Fair point, thanks! I've renamed it to :customer_website_issue which should make it more clear. 04942e6

RobertJoonas and others added 27 commits October 23, 2025 09:35

add module name to service_error when check times out

9745d99

Otherwise, it can sometimes remain unclear in the diagnostics, whether it was InstallationV2 or InstallationV2CacheBust that timed out.

make service_error into a map with code and extra

d24414e

interpret temporary service errors

9e4a927

...but still consider them "unhandled" for telemetry, also notifying Sentry and logging the warning.

separate sentry messages (verification)

3e9848a

make Verification.ChecksTest more DRY

5da2f6d

organize tests into describe blocks

1a193a0

test verification telemetry and logging

b6a90f4

fix codespell

623827f

get rid of legacy verification

e1b7ae2

rename Checks.InstallationV2 -> Checks.VerifyInstallation

f239090

delete Live.Installation and rename Live.InstallationV2 -> Live.Insta…

439d3b2

…llation

rename installationv2 (live) files as well

a579adc

delete old change-domain routes

3075e25

Also rename current liveview modules and routes, removing the v2 suffix

rename domain_change_v2 files, removing v2 suffix

d5f8cb0

remove legacy JS verifier code

742f669

Also fix dockerignore and elixir.yml referencing a wrong priv path

rename verification_v2_test -> verification_test

1b769cc

remove v2 prefix from logs and sentry messages

53463af

clean up duplicate external_sites_controller_test.exs tests

fb8bdf6

remove flag

a95dcb0

fix typespec

5b9c88e

pass timeout as query param to Browserless too

afd57df

Fixup external sites controller test module (#5826)

164b7de

fix test description

30469eb

Merge branch 'verification-fixes' into cleanup-scriptv2-flag

ebc786c

Merge remote-tracking branch 'origin/master' into cleanup-scriptv2-flag

4347e8a

clean up detection sentry events + tests

c2163df

RobertJoonas force-pushed the cleanup-detection-sentry branch from 1931b19 to c2163df Compare October 27, 2025 09:39

Base automatically changed from cleanup-scriptv2-flag to master October 27, 2025 09:53

Merge remote-tracking branch 'origin/master' into cleanup-detection-s…

322cc51

…entry

apata reviewed Oct 27, 2025

View reviewed changes

apata approved these changes Oct 27, 2025

View reviewed changes

improve naming

04942e6

RobertJoonas added this pull request to the merge queue Oct 27, 2025

Merged via the queue into master with commit 7540511 Oct 27, 2025
16 checks passed

RobertJoonas deleted the cleanup-detection-sentry branch October 27, 2025 10:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clean up detection sentry events + tests#5833

Clean up detection sentry events + tests#5833
RobertJoonas merged 29 commits intomasterfrom
cleanup-detection-sentry

RobertJoonas commented Oct 27, 2025 •

edited

Loading

Uh oh!

apata Oct 27, 2025

Uh oh!

RobertJoonas Oct 27, 2025 •

edited

Loading

Uh oh!

apata Oct 27, 2025

Uh oh!

RobertJoonas Oct 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		String.contains?(extra, "net::") -> failure(:client_issue)
		String.contains?(String.downcase(extra), "execution context") -> failure(:client_issue)

Uh oh!

Conversation

RobertJoonas commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Tests

Changelog

Documentation

Dark mode

Uh oh!

apata Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

RobertJoonas Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

apata Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

RobertJoonas Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

RobertJoonas commented Oct 27, 2025 •

edited

Loading

RobertJoonas Oct 27, 2025 •

edited

Loading