Revert friends-of-friends follow recommendation query to using a CTE #29619

ClearlyClaire · 2024-03-15T19:31:51Z

Also fixes ordering back to the initial intent.

Note that I have not verified whether this had better performances than the previous version. I also used subqueries instead of the usual approach of using Account.not_excluded_by_account and Account.not_domain_blocked_by_account because:

we never actually verified if that approach had better performance
it's a lot more painful to use without ActiveRecord helpers

Also fixes ordering back to the initial intent

mjankowski · 2024-03-15T19:40:42Z

On the ordering change here -- was this an oversight we missed in review? Or was the intention to change the ordering, but due to the perf issues we're going back to original ordering?

ClearlyClaire · 2024-03-15T19:45:51Z

The change of ordering in my earlier PR was an oversight of mind, I misunderstood the original ordering, and there were no tests to catch the change. I just noticed the ordering had been changed by starting from the old CTE.

mjankowski · 2024-03-15T19:57:59Z

Cool - given that, I agree that the spec changes here do recapture what the original pre-refactor ordering looked like. I'll defer to you all on the query change / perf implications / etc.

Do we want to fix specs and the refactor version (swap the order) first here? just so we have a baseline of a logically correct change? (happy to do this as quick separate PR if you'd like)
Other than playing around with local data setup and reviewing some explain/analyze combinations ... is there any way we can benchmark this better?

renchap

We tested the queries on mastodon.social and perf is similar to the previous code

ClearlyClaire · 2024-03-18T13:19:52Z

We tested the queries on mastodon.social and perf is similar to the previous code

To add to that, we tested multiple versions of the query for multiple sets of active mastodon.social users (random 400 users, 5 with very few follows, 5 with large amounts of follows and blocks) and the results were always pretty consistent: the code in this PR is roughly on par with the old query that was missing filtering, while the code before this PR was way slower (10 to 50 times slower). The code in main...mjankowski:mastodon:fix-order-in-floof-foof-fooferoni-changes was consistently ~1.5× slower than that of this PR, which could be due to some extra work instead of reusing the CTE for filtering.

Other than playing around with local data setup and reviewing some explain/analyze combinations ... is there any way we can benchmark this better?

This is a good question. I guess understanding the query plan is the way to go, but even that is specific to what's actually in the database. Also, extracting the query is a bit of a pain depending on how the query is written.

I've been toying with populating fake data, but I think we'd need to take shortcuts in generation for it to be usable.

mjankowski · 2024-03-18T14:04:48Z

One thing I suspect may have been happening with the poorly performing query is that, for the setup of first_degree, when I was reviewing the changes and thinking about what was being built - I was thinking about that aspect specifically as first grabbing a big array of IDs (which, in itself could have been expensive) before the larger query was built, but what was actually happening is that query is getting included as a subquery in the larger query, which could certainly get expensive depending on the surrounding rows/joins/etc. That said, more than one thing changed at once, so it may be multiple causes.

Before I attempt this -- are we open to repeating the style/compositional improvement of these changes, if we can preserve the current more performant query? (putting aside like, framework sql-quoting style and whatnot). That current WIP branch (that you linked) restores the composed scope approach - but adds an additional join which is not present in this restored/performant version. I suspect that contributes to the ~1.5x perf drop on that branch.

…astodon#29619)

Revert friends-of-friends follow recommendation query to using a CTE

ff60e3b

Also fixes ordering back to the initial intent

ClearlyClaire added performance Runtime performance build-image Build a container image for this PR labels Mar 15, 2024

renchap approved these changes Mar 18, 2024

View reviewed changes

ClearlyClaire added this pull request to the merge queue Mar 18, 2024

Merged via the queue into main with commit d506307 Mar 18, 2024
53 checks passed

ClearlyClaire deleted the revert-cte branch March 18, 2024 13:02

lutoma pushed a commit to ohaisocial/mastodon that referenced this pull request Mar 19, 2024

Revert friends-of-friends follow recommendation query to using a CTE (m…

69ea5c4

…astodon#29619)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert friends-of-friends follow recommendation query to using a CTE #29619

Revert friends-of-friends follow recommendation query to using a CTE #29619

ClearlyClaire commented Mar 15, 2024 •

edited

mjankowski commented Mar 15, 2024

ClearlyClaire commented Mar 15, 2024

mjankowski commented Mar 15, 2024

renchap left a comment

ClearlyClaire commented Mar 18, 2024

mjankowski commented Mar 18, 2024

Revert friends-of-friends follow recommendation query to using a CTE #29619

Revert friends-of-friends follow recommendation query to using a CTE #29619

Conversation

ClearlyClaire commented Mar 15, 2024 • edited

mjankowski commented Mar 15, 2024

ClearlyClaire commented Mar 15, 2024

mjankowski commented Mar 15, 2024

renchap left a comment

Choose a reason for hiding this comment

ClearlyClaire commented Mar 18, 2024

mjankowski commented Mar 18, 2024

ClearlyClaire commented Mar 15, 2024 •

edited