feat: telemetry, error tracking, CLI & config manager #538

AyushExel · 2023-10-05T19:58:51Z

No description provided.

Conditions required to send errors (ALL conditions must be met or no errors will be reported): - sentry_sdk package is installed (Maybe we make it a dependency?) - sync=True in settings - pytest is not running - running in a pip package installation - running in a non-git directory - online environment - CLI config commands are needed to allow the user to be able to edit them, but maybe it'll be too large for this PR. TODOs: - [x] remove verbose sentry log msg (next PR) - [x] needs lancedb.__version__ to tag the version of the error origin --------- Co-authored-by: Lance Release <lance-dev@lancedb.com> Co-authored-by: Rob Meng <rob.xu.meng@gmail.com> Co-authored-by: Will Jones <willjones127@gmail.com> Co-authored-by: Chang She <759245+changhiskhan@users.noreply.github.com> Co-authored-by: rmeng <rob@lancedb.com> Co-authored-by: Chang She <chang@lancedb.com> Co-authored-by: Rok Mihevc <rok@mihevc.org>

Depends on - #492 Current approach: * All events are batched and sent once every time the `rate_limit` is crossed. All events past `max_events` limit are dropped within each `rate_limit` timeframe. This means that we're not capturing the exact usage and we'll need to compare relative usage * Currently the `rate_limit` is set to 60seconds, meaning there will be 1 request made after each 60 seconds. `max_events` is set to 25 which means maximum 25 events will be captures at a time and will be dropped past that. These numbers will need to be tuned according to our needs. Introduced Events class to track events without disrupting any workflow. Allows setting rate limits. EDIT: - [ ] Ohh need to turn on sentry integration too --------- Co-authored-by: Lance Release <lance-dev@lancedb.com> Co-authored-by: Rob Meng <rob.xu.meng@gmail.com> Co-authored-by: Will Jones <willjones127@gmail.com> Co-authored-by: Chang She <759245+changhiskhan@users.noreply.github.com> Co-authored-by: rmeng <rob@lancedb.com> Co-authored-by: Chang She <chang@lancedb.com> Co-authored-by: Rok Mihevc <rok@mihevc.org>

Usage: `lancedb` `lancedb --help` `lancedb diagnostics --enabled` , `lancedb disgnostics --disabled` `lancedb config`

python/lancedb/cli/cli.py

docs/src/cli_config.md

python/lancedb/utils/config.py

…telemetry_exp

changhiskhan

just realized there are no tests for these.
could you add tests when you get a chance?

for click: https://click.palletsprojects.com/en/8.1.x/testing/
for sentry/posthog: use pytest-mock

you'll need to have some option to turn on diagnostics during CI but it should only send to the mock endpoint and then you can assert the event attributes is what you expect.

AyushExel · 2023-10-06T08:07:08Z

python/tests/test_telemetry.py

+    # TODO: don't hardcode these here. Instead create a module level json scehma in lancedb.utils.events for better evolvability
+    batch_keys = ["api_key", "distinct_id", "batch"]
+    event_keys = ["event", "properties", "timestamp", "distinct_id"]
+    property_keys = ["cli", "install", "platforms", "version", "session_id", "blud"]


Added loose tests for now here. Ideally we could hardcode the json schema as global var and directly assert here but I don't want to edit the reviewed files so will do in another PR.

And sentry can't be tested the same way as it's integrated via the sdk and there are no manual API calls. Nothing to worry there though, the whole sentry workflow is wrapped in exception handler (https://github.com/lancedb/lancedb/pull/538/files#diff-603e1842b90966a1a2eb9e41f61c62a6db0fd6b3d5a71cec2f0355387c3d7eb7R33)

AyushExel · 2023-10-07T05:54:01Z

Wanted to use responses for requests testing but I don't think adding dependency for a single test would be the best idea so just put together something. Take a look and merge if all looks good.

Co-authored-by: Lance Release <lance-dev@lancedb.com> Co-authored-by: Rob Meng <rob.xu.meng@gmail.com> Co-authored-by: Will Jones <willjones127@gmail.com> Co-authored-by: Chang She <759245+changhiskhan@users.noreply.github.com> Co-authored-by: rmeng <rob@lancedb.com> Co-authored-by: Chang She <chang@lancedb.com> Co-authored-by: Rok Mihevc <rok@mihevc.org>

AyushExel and others added 5 commits September 30, 2023 23:51

Basic click CLI setup (#528)

7a6f5c9

Usage: `lancedb` `lancedb --help` `lancedb diagnostics --enabled` , `lancedb disgnostics --disabled` `lancedb config`

move metadata to top level keys

ffe2840

exception safe and don't track vars

183816c

AyushExel commented Oct 5, 2023

View reviewed changes

python/lancedb/cli/cli.py Show resolved Hide resolved

AyushExel commented Oct 5, 2023

View reviewed changes

docs/src/cli_config.md Show resolved Hide resolved

AyushExel commented Oct 5, 2023

View reviewed changes

python/lancedb/utils/config.py Show resolved Hide resolved

AyushExel commented Oct 5, 2023

View reviewed changes

python/lancedb/utils/config.py Show resolved Hide resolved

AyushExel added 4 commits October 6, 2023 01:36

black

e1fe449

Merge branch 'main' of https://github.com/lancedb/lancedb into ayush/…

bbb8388

…telemetry_exp

isort

50d295a

fix docs typos

a696ed6

changhiskhan approved these changes Oct 6, 2023

View reviewed changes

AyushExel added 6 commits October 6, 2023 09:49

add cli tests

217828b

add loose tests

1a7ddb9

add copyright message

cf66021

isorttttt

228942d

black comon now

7e1ee70

spell correction

50b1dd8

AyushExel commented Oct 6, 2023

View reviewed changes

AyushExel added 3 commits October 6, 2023 17:55

note for future

304af94

make event testing robust

213ed5a

isort

3e91698

AyushExel requested a review from changhiskhan October 7, 2023 05:21

AyushExel merged commit a1377af into main Oct 8, 2023
11 checks passed

AyushExel deleted the ayush/telemetry_exp branch October 8, 2023 17:41

glenn-jocher mentioned this pull request Apr 5, 2024

Potential Violation of AGPL-3.0 License Terms in lancedb/lancedb #1197

Closed

alexkohler pushed a commit to alexkohler/lancedb that referenced this pull request Apr 20, 2024

[Rust] Fix main branch CI failure and bump arrow version (lancedb#538)

18b682a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: telemetry, error tracking, CLI & config manager #538

feat: telemetry, error tracking, CLI & config manager #538

AyushExel commented Oct 5, 2023

changhiskhan left a comment

AyushExel Oct 6, 2023 •

edited

Loading

AyushExel commented Oct 7, 2023 •

edited

Loading

feat: telemetry, error tracking, CLI & config manager #538

feat: telemetry, error tracking, CLI & config manager #538

Conversation

AyushExel commented Oct 5, 2023

changhiskhan left a comment

Choose a reason for hiding this comment

AyushExel Oct 6, 2023 • edited Loading

Choose a reason for hiding this comment

AyushExel commented Oct 7, 2023 • edited Loading

AyushExel Oct 6, 2023 •

edited

Loading

AyushExel commented Oct 7, 2023 •

edited

Loading