Update user listen history query to use new indexed table #2185

isaacsolo · 2021-12-21T17:20:26Z

Description

Query user_listening_history table directly instead of plays. This will enable pruning old plays. The only difference between the response should be deduplication.

Tests

Tested against prod snapshot. Confirmed de-duplication.

Remote box pointed to prod snapshot can take from 2 seconds to 30s. About the same as master pointed to prod snapshot.. This approach avoids querying plays directly but does query track. Not sure if there's a better way.

How will this change be monitored?

Manually checking /history.

isaacsolo · 2021-12-21T22:19:19Z

discovery-provider/src/api/v1/helpers.py

@@ -371,7 +371,6 @@ def format_offset(args, max_offset=MAX_LIMIT):
 def get_default_max(value, default, max=None):
    if not isinstance(value, int):
        return default
-    elif max is None:


I think new linter asked for this change.

isaacsolo · 2021-12-21T22:21:10Z

discovery-provider/src/queries/get_track_history.py

-    # bundle peripheral info into track results
-    tracks = populate_track_metadata(session, track_ids, tracks, current_user_id)
-
-    if args.get("with_users", False):


with_users is always true and user data is already populated due to model relationship.

raymondjacobson

Did we miss a step where we need to migrate existing user history into this table?

raymondjacobson · 2022-01-03T16:04:01Z

discovery-provider/src/queries/get_user_listening_history.py

+    tracks = helpers.query_result_to_list(sorted_track_results)
+
+    # bundle peripheral info into track results
+    tracks = populate_track_metadata(session, track_ids, tracks, current_user_id)


I wonder if we can shave a round trip to the DB by just having populate_track_metadata grab the raw track themselves

Oh yeah I'll see.

Actually, I don't see a way we can avoid querying for track data. Do you see a potential optimization?

Yeah i think you're right here. In the future, probably we could rewrite populate_track_metadata to be join based rather than 2 round trips

raymondjacobson · 2022-01-03T16:05:46Z

discovery-provider/src/api/v1/users.py

-            "with_users": True,
-        }
-        track_history = get_track_history(get_tracks_args)
+        get_tracks_args = GetUserListeningHistoryArgs(


while I agree with your change (that we should be using current user id), that's not how this worked before and it makes this endpoint semantically kind of weird because you could specify user_id and it effectively does nothing.

can we keep that existing behavior and pass both current_user_id and user_id through to the queries file?

To get the behavior you added, we could just do
if (user_id != current_user_id) return [] here

Yup will do 👍

isaacsolo · 2022-01-03T17:17:59Z

Did we miss a step where we need to migrate existing user history into this table?

Like this? https://github.com/AudiusProject/audius-protocol/pull/2133/files

It's not in prod yet so I'm gonna wait until it's populated before merging this.

raymondjacobson · 2022-01-21T08:32:41Z

discovery-provider/src/queries/get_user_listening_history.py

+    tracks = helpers.query_result_to_list(sorted_track_results)
+
+    # bundle peripheral info into track results
+    tracks = populate_track_metadata(session, track_ids, tracks, current_user_id)


Yeah i think you're right here. In the future, probably we could rewrite populate_track_metadata to be join based rather than 2 round trips

pull-request-size bot added the size/L label Dec 21, 2021

isaacsolo force-pushed the is-listen-history-api branch from 9256907 to a3890cb Compare December 21, 2021 17:36

Update user listen history query to use new indexed table

fe91ef8

isaacsolo force-pushed the is-listen-history-api branch from a3890cb to fe91ef8 Compare December 21, 2021 19:14

isaacsolo commented Dec 21, 2021

View reviewed changes

isaacsolo marked this pull request as ready for review December 21, 2021 22:21

isaacsolo requested a review from raymondjacobson December 21, 2021 22:21

isaacsolo added 4 commits December 21, 2021 23:59

remove unnecessary args

d785794

revert add users to tracks

ac667b3

clean up integration test

74156fa

Add arg details

f790027

raymondjacobson reviewed Jan 3, 2022

View reviewed changes

add extra user id back

40f7fd9

pull-request-size bot added size/XL and removed size/L labels Jan 3, 2022

isaacsolo requested a review from raymondjacobson January 4, 2022 19:42

raymondjacobson approved these changes Jan 21, 2022

View reviewed changes

pull-request-size bot added size/L and removed size/XL labels Jan 21, 2022

Merge branch 'master' into is-listen-history-api

e9f9b0b

isaacsolo force-pushed the is-listen-history-api branch from 75de877 to e9f9b0b Compare January 21, 2022 18:46

isaacsolo added 2 commits January 21, 2022 19:01

sort imports

62c7290

sort test imports

0fa9731

isaacsolo merged commit f13f1dd into master Jan 21, 2022

isaacsolo deleted the is-listen-history-api branch January 21, 2022 19:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update user listen history query to use new indexed table #2185

Update user listen history query to use new indexed table #2185

isaacsolo commented Dec 21, 2021 •

edited

isaacsolo Dec 21, 2021

isaacsolo Dec 21, 2021

raymondjacobson left a comment

raymondjacobson Jan 3, 2022

isaacsolo Jan 3, 2022

isaacsolo Jan 3, 2022

raymondjacobson Jan 21, 2022

raymondjacobson Jan 3, 2022

isaacsolo Jan 3, 2022 •

edited

isaacsolo commented Jan 3, 2022

raymondjacobson Jan 21, 2022

Update user listen history query to use new indexed table #2185

Update user listen history query to use new indexed table #2185

Conversation

isaacsolo commented Dec 21, 2021 • edited

Description

Tests

How will this change be monitored?

Choose a reason for hiding this comment

Choose a reason for hiding this comment

raymondjacobson left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

isaacsolo Jan 3, 2022 • edited

Choose a reason for hiding this comment

isaacsolo commented Jan 3, 2022

Choose a reason for hiding this comment

isaacsolo commented Dec 21, 2021 •

edited

isaacsolo Jan 3, 2022 •

edited