Enable person breakdowns querying for all ordering funnels #5043

neilkakkar · 2021-07-08T14:29:04Z

Changes

Resolves #5030

Checklist

All querysets/queries filter by Organization, by Team, and by User
Django backend tests
Jest frontend tests
Cypress end-to-end tests
Migrations are safe to run at scale (e.g. PostHog Cloud) – present proof if not obvious
New/changed UI is decent on smartphones (viewport width around 360px)

neilkakkar · 2021-07-08T14:29:50Z

ee/clickhouse/queries/funnels/test/test_funnel.py

                    timestamp="2020-01-01T13:00:00Z",
                )
                _create_event(
                    team=self.team,
                    event="buy",
                    distinct_id=f"person_{num}_{i}",
-                    properties={"key": "val", "some_breakdown_val": f"{num}"},
+                    properties={"key": "val", "some_breakdown_val": num},


This enables testing for both, numeric and string values. Did a quick search on Metabase, and not everyone sends strings always

EDsCODE

Are you expecting the breakdown to contain double quotes now or did you handle it somewhere that I'm missing?

EDsCODE · 2021-07-09T17:39:08Z

ee/clickhouse/queries/funnels/funnel.py

@@ -66,7 +64,8 @@ def _format_single_funnel(self, result, with_breakdown=False):
                serialized_result.update({"average_conversion_time": None})

            if with_breakdown:
-                serialized_result.update({"breakdown": result[-1][1:-1]})  # strip quotes
+                serialized_result.update({"breakdown": result[-1]})


hmm, do you not handle the double quotes now when the breakdown val is a string?

I don't handle them (so the tests have double quotes now). The reason being: with JSONExtractRaw used everywhere, the keys on which we filter props have double quotes when they're strings, and they are strings when they're input as numbers. I attempted playing with this extraction such that we get no quotes in our keys, but that turned out to be very annoying when you don't know your data type apriori. (I have a test for this now, where the values of a property are both strings and integers)

Us removing double quotes ourselves has implications: the front-end has to guess whether the key had double quotes or not when they try to get the persons for this breakdown, which makes things hard.

Hence, I propose: we don't modify the keys at all. If we want to change the presentation, we let the front-end strip the quotes, if they exist. But, when asking for persons for a breakdown, we expect the front-end to send the specific key which we sent ourselves, sans modification.

I think this approach makes things cleaner.

Thoughts? Also cc: @liyiy @samwinslow (or whoever is going to implement the frontend bit 😅 )

To give a concrete example of JSONExtractRaw:

SELECT JSONExtractRaw('{"a": "hello", "b": [-100, 200.0, 300], "c": "chrome", "d": 12}', 'c');

returns the string "Chrome"

SELECT JSONExtractRaw('{"a": "hello", "b": [-100, 200.0, 300], "c": "chrome", "d": 12}', 'd');

returns the string 12.

we expect the front-end to send the specific key which we sent ourselves, sans modification.

Do you mean with or without quotes here then? I think sending with quotes would start getting confusing especially since it's inconsistent with the rest of our app

Otherwise, I would agree as long as it's handled somewhere and stays consistent with how we're displaying the breakdown values elsewhere

So, I'm not talking about how we display the values at all, just about how we should pass them to the front-end.

To display the problem with stripping quotes more concretely:

SELECT JSONExtractRaw('{"a": "1", "b": 1}', 'a'); -- returns the string "1" SELECT JSONExtractRaw('{"a": "1", "b": 1}', 'b'); -- returns the string 1

In effect, we can't arbitrarily strip quotes, because it changes the key.

I'm all for the front-end removing the quotes, if they exist, to display breakdown values without quotes, but I don't think we can do this in the backend, since it changes our filter keys for props :$ '"1"' != '1'

AH! I see, there's the property filtering, which removes this from JSONExtractRaw: https://github.com/PostHog/posthog/blob/master/ee/clickhouse/models/property.py#L89

I guess we just need to follow this everywhere for breakdowns as well, and we should be good to go (we don't yet). I'll fix this like so instead then!

SELECT trim(BOTH '\"' FROM JSONExtractRaw('{"a": "1", "b": 1}', 'a')); is same as SELECT trim(BOTH '\"' FROM JSONExtractRaw('{"a": "1", "b": 1}', 'b'));

So we (will be) good.

This makes sense to me. I mean, we could actually convert to the appropriate type in the backend really, but in a smarter way that woud result in '"1"' → '1' (string), '1' → 1 (number).

I actually tried that with JSONExtract('{"a": "1", "b": 1}', 'a', <type>) - but it didn't let me infer the types using type inference as well :$ - so annoying.

I.e. JSONExtract('{"a": "1", "b": 1}', 'a', JSONType('{"a": "1", "b": 1}', 'a')) doesn't work 😢

Yeah, this'd have to happen in Django at earliest

EDsCODE

lgtm

Twixes

Code looks good to me but 55dde4c broke tests

neilkakkar · 2021-07-12T16:29:32Z

I resolved this for trends + funnels both. Some existing tests were reliant on internal sorting, so I've added an explicit sort on top of this to ensure there's no test failures here.

I don't think the order in the list means anything with breakdowns, we don't sort it based on anything, but let me know if the tests were testing for something I'm not aware of.

You can see these specific changes here: b74ba00

neilkakkar added 8 commits July 7, 2021 16:07

first pass mix and match everything

833e4c3

resolve merge conflicts

1a083dd

cleanup

396cd8e

resolve merge conflicts

510b02f

small refactoring of get_query

5c8ae7b

remove top level filter props

524bfdc

save the file

69f5372

enable person breakdowns querying for regular + strict funnels

a33bf5e

neilkakkar commented Jul 8, 2021

View reviewed changes

timgl temporarily deployed to posthog-pr-5043 July 8, 2021 14:31 Inactive

neilkakkar mentioned this pull request Jul 8, 2021

Mix and Match ordering and visualization of funnels #5031

Merged

6 tasks

Base automatically changed from refactor to master July 8, 2021 17:25

resolve merge conflicts

8bbb981

timgl temporarily deployed to posthog-pr-5043 July 9, 2021 10:32 Inactive

neilkakkar added 2 commits July 9, 2021 11:35

move mixin testing to more appropriate location

d9607f1

add unordered, testing quality of life improvements

0ed31e2

timgl temporarily deployed to posthog-pr-5043 July 9, 2021 11:53 Inactive

clean up

253a51d

timgl temporarily deployed to posthog-pr-5043 July 9, 2021 12:01 Inactive

neilkakkar marked this pull request as ready for review July 9, 2021 12:02

neilkakkar requested review from Twixes and EDsCODE July 9, 2021 12:02

neilkakkar changed the title ~~Enable person breakdowns querying for regular + strict funnels~~ Enable person breakdowns querying for all ordering funnels Jul 9, 2021

add strict+unordered connectivity test for time to convert

18c9b46

timgl temporarily deployed to posthog-pr-5043 July 9, 2021 12:23 Inactive

EDsCODE reviewed Jul 9, 2021

View reviewed changes

This was referenced Jul 9, 2021

Funnel cohort breakdown #5053

Merged

Sprint 1.27.0 3/2 - Jul 5 to Jul 16 (Funnels #2) #4968

Closed

neilkakkar mentioned this pull request Jul 12, 2021

Run broken down analyses in one SQL query like others #5072

Closed

2 tasks

trim after extracting

55dde4c

timgl temporarily deployed to posthog-pr-5043 July 12, 2021 14:43 Inactive

EDsCODE approved these changes Jul 12, 2021

View reviewed changes

Twixes reviewed Jul 12, 2021

View reviewed changes

clean up, fix tests, trim everywhere

b74ba00

timgl temporarily deployed to posthog-pr-5043 July 12, 2021 16:28 Inactive

Twixes merged commit c554bb5 into master Jul 12, 2021

Twixes deleted the persons_and_breakdowns branch July 12, 2021 16:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable person breakdowns querying for all ordering funnels #5043

Enable person breakdowns querying for all ordering funnels #5043

neilkakkar commented Jul 8, 2021

neilkakkar Jul 8, 2021

EDsCODE left a comment

EDsCODE Jul 9, 2021

neilkakkar Jul 12, 2021

neilkakkar Jul 12, 2021

EDsCODE Jul 12, 2021

neilkakkar Jul 12, 2021 •

edited

Loading

neilkakkar Jul 12, 2021

neilkakkar Jul 12, 2021

Twixes Jul 12, 2021

neilkakkar Jul 12, 2021

Twixes Jul 12, 2021

EDsCODE left a comment

Twixes left a comment

neilkakkar commented Jul 12, 2021 •

edited

Loading

Enable person breakdowns querying for all ordering funnels #5043

Enable person breakdowns querying for all ordering funnels #5043

Conversation

neilkakkar commented Jul 8, 2021

Changes

Checklist

Choose a reason for hiding this comment

EDsCODE left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

neilkakkar Jul 12, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

EDsCODE left a comment

Choose a reason for hiding this comment

Twixes left a comment

Choose a reason for hiding this comment

neilkakkar commented Jul 12, 2021 • edited Loading

neilkakkar Jul 12, 2021 •

edited

Loading

neilkakkar commented Jul 12, 2021 •

edited

Loading