Metrics Parsing #39

z3z1ma · 2021-07-20T09:59:15Z

Description

Bring your metrics into revision control, bring them into your data model, sync it directly with Metabase.

dbt-metabase metrics \
  --dbt_database test \
  --dbt_path tests/fixtures/metric/ \
   --schema public \
   --metabase_host localhost:3000 \
   --metabase_user alex@... \
   --metabase_password "..." \
   --metabase_database unit_testing \
   --metabase_use_http --verbose

Expression syntax: https://www.metabase.com/docs/latest/users-guide/expressions.html

  - name: Number of Customers with Large Orders
    description: Customers who are big spenders should be tracked independently of total,
      any customer who orders over 20 AUD of jaffle is counted
    metric: countif([customer_lifetime_value] > 20)

We will parse: countif([customer_lifetime_value] > 20) into['count-where', ['>', ['field', 41, None], 20]]
The parser should be able to handle any type of expression allowing users to use the nice excel like syntax built by metabase team directly alongside your data models. They become centralized, self contained, and gain all the advantages of dbt/jinja.

Parsing Examples

Purposefully convoluted examples showing robustness and possibilities (most metrics are simple in theory with preprocessing logic in the model)

Input: Sum(case([site_dispenser_count] + 1 > 1 or [site_dispenser_count] - 1 > 1, [site_dispenser_count] + 1))
Output: ['sum', ['case', [[['or', ['>', ['+', ['field', 1, 'site_dispenser_count'], 1], 1], ['>', ['-', ['field', 1, 'site_dispenser_count'], 1], 1]], ['+', ['field', 1, 'site_dispenser_count'], 1]]]]]

Input: Sum([table.order] + [qty] * 2 + 4 + 5 - 4 + 5)
Output: ['sum', ['+', ['-', ['+', ['field', 1, 'table.order'], ['*', ['field', 1, 'qty'], 2], 4, 5], 4], 5]]

Input: SumIf([site_dispenser_count], [site_city] > "Phoenix" or [site_state] = "Arizona")
Output: ['sum-where', ['field', 1, 'site_dispenser_count'], ['or', ['>', ['field', 1, 'site_city'], '"Phoenix"'], ['=', ['field', 1, 'site_state'], '"Arizona"']]]

Input: Distinct(case([site_city] = "Phoenix", [site_panel_count])) / distinct([site_dispenser_count])
Output: ['/', ['distinct', ['case', [[['=', ['field', 1, 'site_city'], '"Phoenix"'], ['field', 1, 'site_panel_count']]]]], ['distinct', ['field', 1, 'site_dispenser_count']]]

Type of change

New feature (non-breaking change which adds functionality)
This change requires a documentation update

How Has This Been Tested?

CI test for metric propagation and synchronization

Test Configuration:

Python version: 3.8

Checklist:

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

References #25

…ersion of dbt-metabase having improved yaml parsing. large formatting update to conform to black.

…special or semantic until ready to deprecate

…e properly used if found in metabase api response.

…synchronization, and formatting

… without depends on/test_metadata dont throw

…xpected input format to be defined in readme. otherwise automatic resolution of target field using relation test will prepend target run schema which should be fine in 95% of use cases. cases outside that can use manifest.json parsing or set fk_ref in yml.

…d ensured support for schema agnostic fk targets (schema resolved from manifest.json)

…sync will now only fail hard if timeout is explicit, otherwise default behaviour if --sync is true is to attempt sync for 30 seconds and proceed with aligning what can be aligned successfully. more formatting and a few comments for clarity of intent. also added option to pass custom cert bundle to verify.

…ranslated some args to store action based.

…usly with seemless function alongside primary artifact parser (manifest.json).

…f regex to permit catching last arg of either ref or source always being the target table. if pointing to an alias, we are collecting aliases during yml parsing to be passed to metabased client and translated to metabase table names as needed. this functionality should be unnoticed by the user but provide more resiliency as well as more user friendly outcome whilst still being very specific in our logging.

…the getter calls will just return none

…se with prev version.

…on path strings allowing relative paths for --dbt_path or --dbt_manifest_path simplified

…ntees us `schema.table` format. This allows us to guarantee the incorrectly formatted ref (which should be `schema.table`) is originating from yml. Log the warning and infer correct schema for our users using target schema which covers the 90% use case.

…ial types

…ing as empty list in function call

…nit__ to nonemptystr class, cleaned some logging calls to use lazy interpolation

…r metric testing

…ync opts

remigabillet · 2021-09-20T13:13:33Z

hey @z3z1ma What's the latest on this PR btw? I think this feature is HUGE! I'd love to see it land.

z3z1ma · 2021-09-20T16:35:10Z

hey @z3z1ma What's the latest on this PR btw? I think this feature is HUGE! I'd love to see it land.

Hey @remigabillet

Yeah totally. I think I just need to rebase on the latest stable master branch and put in a couple unit tests. It's actually not much left to do at all provided there aren't too many conflicts. The metric parser is in its own module so I expect it to be easy. Let me run this down so we get a branch in the main repo we can use until goulline is ready to merge it to an RC. I'll handle this, this week. It is indeed a superpowered feature.

remigabillet · 2021-09-20T17:34:44Z

Sounds great @z3z1ma. Ping me when it's ready for review.

z3z1ma · 2021-10-03T21:14:47Z

closing this in favor of #66 which is the same PR but from a fresh branch

falador_wiz1 and others added 30 commits June 12, 2021 14:06

Add .idea to gitignore

c4ab897

merge changes from read-from-artifacts fork integrating with latest v…

5a463c7

…ersion of dbt-metabase having improved yaml parsing. large formatting update to conform to black.

fix grammatical error and update readme to say semantic type

c2684ab

fix error in field lookup key setting and continue to support either …

cf85f1e

…special or semantic until ready to deprecate

internally use semantic but support meta refs to special which will b…

2d1d7e0

…e properly used if found in metabase api response.

safe importing of dependent modules in addition to bugfixes, logical …

ef7fd20

…synchronization, and formatting

added more verbose comments tracking intent as well as ensuring nodes…

24e05e7

… without depends on/test_metadata dont throw

setting fields to PK type is worthy of info logging

ecb6458

added debug log for validating parsed schema/fields for fk targets an…

3248209

…d ensured support for schema agnostic fk targets (schema resolved from manifest.json)

corrected typo in exclude var and added support for verbosity flag. t…

4a1bf2e

…ranslated some args to store action based.

updates to handle aliases when ran via dbt_path (yml parser) ubiquito…

c201431

…usly with seemless function alongside primary artifact parser (manifest.json).

a blank dict attribute is okay here since we know our refs our clean …

c12cd2b

…the getter calls will just return none

correct referencing of semantic type and not special

fb6070c

arg var renamed back to --database to ensure seamless compatibility/u…

b550424

…se with prev version.

improve typing ensuring python 3.6+ compatibility, expanduser called …

c4022ac

…on path strings allowing relative paths for --dbt_path or --dbt_manifest_path simplified

explicit Any type hint for consistency

a78a93b

use mapping for column and express last bit of typing for bool args

35e5eff

re added clarification on semantic types being formerly known as spec…

1d9fdd4

…ial types

Following best practices, declare default args for lists as none sett…

cce2f8e

…ing as empty list in function call

docstrings to reflect default arg is None

fae0cf5

Merge remote-tracking branch 'upstream/master'

e8c99a1

updates to ensure use of warning instead of warn on logger, added __i…

a7c113e

…nit__ to nonemptystr class, cleaned some logging calls to use lazy interpolation

simplified typing and ensure type tests pass

cbfc841

more typing updates

bcb75fb

added mutablemapping types

c10d482

explicit type for reader as being either manifest or yml

0837a5d

Falador_wiz1 added 19 commits July 22, 2021 11:21

Merge branch 'exposures' into metrics

790acb4

use table id + metric name for uniqueness, typing linting updates

6d7f9f4

logging error to warn since we have an acceptable default

de3b87b

more verbose logging

28a1a5d

move extra args to debug

61cb4bf

update to handle countif and count properly and tested with both

217483e

fixed jaffle shop mismatch from sql file to yaml and added fixture fo…

c60a913

…r metric testing

added type ignore to pyparsing since it is not typed and no stubs

f5b7d5f

fix test to match updated yaml which was updated to match model sql

83baf90

close files using context when loading streams

389e012

replace print with logging

b04cbeb

small spacing adjustment to emphasize metrics

55f70f7

logging message more verbose

44d46f3

filter tests

02de9dc

support filters as keys and boolean expressions when detected

7850268

filter support

8926e8f

slight change to support capturing orphaned metrics for future full s…

2ba1f55

…ync opts

remove uneeded sign op precedence since scalars can simply be negative

24e20b9

add coalesce to expanded grammar since it follows suite

e9fe026

gouline added this to the v0.9.x milestone Jul 28, 2021

z3z1ma and others added 4 commits August 1, 2021 11:56

Merge branch 'master' into metrics

83b8119

re-add metrics func, remove uneeded log

e364672

typing update and remove double import from merge

4fbc965

fic docs arg from merge

59810ac

z3z1ma closed this Oct 3, 2021

z3z1ma mentioned this pull request Oct 3, 2021

feat: Metric Parsing #66

Closed

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metrics Parsing #39

Metrics Parsing #39

z3z1ma commented Jul 20, 2021 •

edited

remigabillet commented Sep 20, 2021

z3z1ma commented Sep 20, 2021 •

edited

remigabillet commented Sep 20, 2021

z3z1ma commented Oct 3, 2021

Metrics Parsing #39

Metrics Parsing #39

Conversation

z3z1ma commented Jul 20, 2021 • edited

Description

Parsing Examples

Type of change

How Has This Been Tested?

Checklist:

remigabillet commented Sep 20, 2021

z3z1ma commented Sep 20, 2021 • edited

remigabillet commented Sep 20, 2021

z3z1ma commented Oct 3, 2021

z3z1ma commented Jul 20, 2021 •

edited

z3z1ma commented Sep 20, 2021 •

edited