Usage telemetry of Hamilton features #248

skrawcz · 2022-12-16T20:05:28Z

Is your feature request related to a problem? Please describe.
To be able to better serve the Hamilton community, finer grained usage metrics would be very helpful.

In the project's current state, we don't know any usage of the feature set that hamilton offers, other than want people ask in the slack help channel.

It would be create to know what is really being used. E.g. what decorators, what experimental modules, etc.
That way when deciding on future improvements and adjustments we could:

Make an informed decision as to how likely a change is to impact the community.
Understand the impact of new feature additions and adoption.
Understand when features should move on from being experimental.
Understand how quickly people adjust and upgrade their Hamilton versions.
Understand where people encounter the most errors -- and help improve documentation/and or error messages.

Describe the solution you'd like
It would be great to know in an anonymous fashion:

Provide the ability to opt-out to not sending any tracking information.
What decorators are used in a Hamilton DAG definition.
What graph adapters are used.
How many functions comprise a DAG & what are the in/out edge counts.
Python version
Operating system type
Operating system version
Source of errors at DAG construction time, i.e. which part of the Hamilton code base is throwing it. Ideally we know which line of Hamilton code caused it.
Source of errors at DAG execution time -- is it user code, or Hamilton code.

Of course we'd have an explicit policy on its usage, and make it clear to users how to opt-out.

Describe alternatives you've considered
N/A

Additional context
Telemetry usage tracking is becoming more standard in open source. It helps the maintainers to better serve the community.

E.g. data diff does this -- see their tracking code and privacy policy:

elijahbenizzy · 2022-12-16T22:17:22Z

I think we want to be very clear about what not to include, although most of this is implied above:

(1) IP Address, anything identifying (implied by anonymous)
(2) function names
(3) Any information about return sizes
(4) Any information about graph shape (other than size)

Another Q is how we disambiguate on a user's behalf anonymously -- one option is to include a tracking ID as an env variable (generate a token), but I think that's too high lift.

elshize · 2022-12-20T02:46:52Z

I personally have no issue with the idea of telemetry, though as you both mentioned, it is crucial to (1) be clear and transparent as to what and how is collected, and (2) there is an easy way to disable it altogether.

Another thing to consider is that certain environments where Hamilton is deployed may simply not allow to send anything outside of the runtime environment. By allow I mean things like firewalls rather than policies. It should not affect the overall functionality if that is the case (like a crash or significant slowdown, or anything like that). I'm sure you've already thought of that, but it doesn't hurt to mention...

elijahbenizzy · 2022-12-21T00:15:58Z

I personally have no issue with the idea of telemetry, though as you both mentioned, it is crucial to (1) be clear and transparent as to what and how is collected, and (2) there is an easy way to disable it altogether.

Another thing to consider is that certain environments where Hamilton is deployed may simply not allow to send anything outside of the runtime environment. By allow I mean things like firewalls rather than policies. It should not affect the overall functionality if that is the case (like a crash or significant slowdown, or anything like that). I'm sure you've already thought of that, but it doesn't hurt to mention...

Yeah I think that's a great call -- specifically adding another requirement:

errors = log, not failure

elshize · 2022-12-21T14:27:41Z

errors = log, not failure

Yep, it would be sad if I pushed an update to production and it failed because it can't send metrics. It wouldn't actually happen to me because my staging env would fail first, but you get the point.

I also would have to think that I cannot be the only person in this situation, something to think about when analyzing the data. It could be that a big chunk of it will be from a development/local environment and not necessarily fully representative of what is run in production.

It could be a good idea to write all telemetry to a file in a case of failure to send, and maybe provide a simple way to submit that manually. I honestly don't know it that's worth the effort or not, I imagine not many people would do that, and if they did, that would probably be a one-time thing, but maybe it would provide some useful information to y'all...

I have never tried to collect telemetry in a similar scenario (I work on an internal solution, so telemetry is a different, simpler story), so I have no idea if any of what I say makes sense, but just throwing my thoughts out there :)

skrawcz · 2022-12-21T23:46:29Z

Started a draft PR to sketch out some of what has been discussed here with #255

skrawcz · 2022-12-27T06:36:30Z

PR is up for those interested, with tests and all - #255.

skrawcz added product idea repo hygiene labels Dec 16, 2022

skrawcz linked a pull request Dec 21, 2022 that will close this issue

Adds telemetry #255

Merged

7 tasks

skrawcz self-assigned this Dec 21, 2022

skrawcz closed this as completed in #255 Jan 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Usage telemetry of Hamilton features #248

Usage telemetry of Hamilton features #248

skrawcz commented Dec 16, 2022 •

edited

elijahbenizzy commented Dec 16, 2022 •

edited

elshize commented Dec 20, 2022

elijahbenizzy commented Dec 21, 2022

elshize commented Dec 21, 2022

skrawcz commented Dec 21, 2022

skrawcz commented Dec 27, 2022

Usage telemetry of Hamilton features #248

Usage telemetry of Hamilton features #248

Comments

skrawcz commented Dec 16, 2022 • edited

elijahbenizzy commented Dec 16, 2022 • edited

elshize commented Dec 20, 2022

elijahbenizzy commented Dec 21, 2022

elshize commented Dec 21, 2022

skrawcz commented Dec 21, 2022

skrawcz commented Dec 27, 2022

skrawcz commented Dec 16, 2022 •

edited

elijahbenizzy commented Dec 16, 2022 •

edited