-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bouncer giving out down test-helpers for ~16 hours #377
Comments
Another option is to take RUM route: count volume or percentage of erratic measurements coming to the collector. |
So it turned out that actually the httpth was also wrongly configured on the bouncer and we were giving out an address of a down service for that until ~21:00 2019-10-26 UTC. |
For people accessing this incident issue from the internet, the incident has been fully resolved since Saturday evening. The incident affects measurements from the following time period: 2019-10-24 16:00 UTC - 2019-10-26 23:00 UTC. We keep the issue ticket open until we have taken all the steps we consider necessary to prevent it re-occurring in the future. |
The way to check if the measurement you are looking at is a false positive is:
|
As part of #356 the echo and http test helpers were redeployed to new hosts (
mia-echoth.ooni.nu
&mia-httpth.ooni.nu
) however the bouncer was not updated to reflect these changes.The old hosts which had a different IP (
37.218.247.110
for echo &37.218.247.95
for http) were shutdown at around 19:00 UTC+2.Detection: alert from user
Timeline:
~17:00 UTC 24-10-2019 the old test helper hosts are shutdown
03:00 UTC 25-10-2019 a user informs us via twitter of this issue
09:00 UTC 25-10-2019 the bouncer is updated with the IPs of the new services
What went well:
What went wrong:
What we should do to prevent it happening the future:
The text was updated successfully, but these errors were encountered: