-
-
Notifications
You must be signed in to change notification settings - Fork 5.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can/should we add privacy-respecting usage metrics
?
#4456
Comments
I personally fully support your idea, as long as it's effectively anonymized, and look forward to others' opinions. |
I think that collecting data anonymously on the usage of Uptime Kuma is a good idea. |
Yes, the idea is undeniably good and would be of great benefit to the developers of the project. From privacy or ethical point of view, I think that the end user should be given the right to choose whether this feature is active or inactive, although the data is anonymous. |
Hard requirement: any data collection must be opt-in only.
Perfect, but focus also on the "informed" bit: exactly what data is collected, for what purpose, and how is it used?
I agree with this requirement, but it's not well supported by your summary.
For each of these supposed benefits, you need to make a clearer case in order to justify monitoring. "It would be interesting" is generally not good enough. For example:
Positing a particular kind of metric you want to track like this is a good start, but expand on it:
|
Also be aware that adding these features is going to subject you to the GDPR. You will have to comply with it, which means things like having a publicly accessible data protection officer. |
@ddevault You are correct, I think a public site explaining with the following content will be necessary
As for tooling, a tool like divviup by the ISRG might be a good choice.
Actually, the GDPR only covers personally identifying data. Since I do not intend to ever store such data, such a data protection policy is simple. I have written them before and can do that again. As for the duration of collection, I would say this depends entirely on how our users' upgrade behaviour (likely different between major - minor - patch updates) works, as I think only via updates new metrics can be introduced or clientside-disabled. |
privacy-respecting usage metrics
?
This comment was marked as spam.
This comment was marked as spam.
I am with @ddevault If this is implemented then at max opt-in only and with very clear defined limited scope. |
I'm also in favor of adding such telemetry as it will be very beneficial as you laid out nicely. Of course, I agree that it should be Opt-In only. Thoughts on the implementationWhen asking for the admin's consent, please let the initial message be short and concise. From personal experience: the more text there is, the more I suspect it to be because of corporate legal reasons. Therefore i suspect nothing good, am too lazy to read on and just decline.
Thoughts on evaluating the resultsPlease keep in mind that while this data might help you to prioritize bug fixes or enhancements, new features should still be considered with a reasonable high priority: you cannot measure what is not there yet. 😉 To prioritize between several new features, the number of Another thing to remember when looking at the data: an apparently low usage might not necessarily indicate little potential but might as well show opportunities for UX improvements. An example from personal experience: when I first started using Uptime Kuma, it was not very intuitive for me to find that there is a simple incident system included. I stumbled across it by accident, then did not remember how I found it in the first place and had to figure out again how I got there in the first place. The reason: you can only add an incident if you created a status page and if you are visiting said page and if you are editing it right now. Should you not meet either one of these conditions, you may never know about it, because not even the documentation mentions this feature. Therefore, in this example, a change in UX might also increase the usage of the feature and consequently any prioritization related to it. |
@louislam I am following the uptime-kuma journey since the very beginning and would be very interested to have your take on all this. |
I am 100% in the favor of adding this feature ☺ |
So this topic here has been unpinned from the main page. |
+1 for implementing such metrics. It would be useful to have some data from "extreme" users that push the limits of the app, that'd help for performance updates. |
I advice to use Opentelemetry on the demo server of Uptime Kuma or we can ship Opentelemetry inside of docker image and allow user to have full control to opt-in and opt-out for ofc privacy reason and only collects anonymous data and send that data to services like Grafana or Prometheua etc., whatever Louis choose... Where that data will be used to shape Uptime Kuma future. What's your thoughts on this @CommanderStorm @chakflying |
OTel is an
(https://opentelemetry.io/docs/what-is-opentelemetry/)
Metrics also allow de-identification => not privacy preserving. => DivviUp seems to be still the best option as fast as I can see, but I don't have the time to spend on such a project for the foreseeable future |
DiviUp only have client libraries for Rust and Typescript even if we or someone else wants to do this, how this can be done? |
The process would be:
|
So, first we can implement divviup on the Demo Website first to check how many instances are running all around the world then we can start integrating Divviup on this repo. Luckily Demo Website server writen in Typescript so i can give it a shot and start implementing on the Demo first. I started to initialize the Task first Now Have to add measurement to send data(need help in that) |
I have questions: Who is in the end collecting this data? |
Ofc, the server to collect the data is owned & maintained by Louislam himself, Only Louis has access to that data or maybe he want he can allow Frank and Nelson(apologize for mentioning name's as i don't to ping and disturb someone) to see the data and maintain the project more deeply. |
|
The reason i give u earlier is my opinion on how this can be happen things can change by time. if Louis wants it to keep tottaly transparent to his users or make it private it's his choice on it, we are not going to collect sensitive user data we will only store data such as:
He's busy completing the release of |
Zaid, I appreciate your input, but this is not about simply collecting "non-sensitive" data. It’s about ensuring the project remains transparent and aligned with its open-source principles. Whether the data is sensitive or not, storing it on a private server with access restricted to a few people goes against those values. Regarding Louis, I don’t need to be told about pinging etiquette. The reason I brought him into this discussion is because this decision could fundamentally change the direction of the project. As the founder, his input is critical before moving forward. |
Alright sounds good, P.S |
I don't care enough to fight about this. Here is a bit of input:
In case people are curious, this is how other oss project solve this (via the same tech):
|
Exactly that's what i am trying to say, Big Tech Companies never public their metrics information idk why he is keep saying the same thing anyways Good u close this as right now most important is Release of v2.0.0, Good Luck with it! |
Dear community,
I would like to discuss if adding privacy-respecting usage metrics is something we want to learn from and to inform our actions.
This discussion is based on two impulses
In the talk, he goes into why free software projects might want to collect data on how the software is used, and how this can be done in a privacy-respecting fashion.
Core requirements:
Implementing such metrics would have these concrete benefits:
v2.0
release, we focused a lot on improving performance for large (more than 500 monitors) deployments and reducing storage requirements.It would be valuable to know if our prioritisation of these features is correct ("Are we pandering to he 20% or the 80%?") to make sure that we are offering a good serivce to most of our users.
Going on wild tangents which only matter to the minority of our users might not be the best use of maintainer time (despite optimisation being fun => sometimes nessesary ^^).
This would also have downsides:
I would especially appreciate feedback from the regular contributors (I apologize for the ping) @louislam, @chakflying, @Zaid-maker, @marco-doerig, @Saibamen, @Computroniks, @MrEddX, @AnnAngela, @cyril59310, @apio-sys
PS: I know that privacy is a charged topic, but please let's keep the discussion civil ^^
The text was updated successfully, but these errors were encountered: