Database transaction `get_unread_event_push_actions_by_room` churns database CPU #11887

jaywink · 2022-02-02T09:33:01Z

Description

We have a host in EMS with a room that is getting approx 2 events per second into it. A few users sitting in the room are churning the database CPU due to constant sync which is causing the synchrotron process to spend a large amount of transaction time in get_unread_event_push_actions_by_room.

The database query is:

SELECT COUNT(CASE WHEN notif = ? THEN ? END), COUNT(CASE WHEN highlight = ? THEN ? END), COUNT(CASE WHEN unread = ? THEN ? END) FROM event_push_actions ea WHERE user_id = ? AND room_id = ? AND stream_ordering > ?

Two of the users in the room are causing an equal 50/50 of the database load.

Database metrics showing the query being mostly CPU time.

More context in https://matrix.to/#/!ixSKkmjfDhyFWsKSEY:federator.dev/$ZFJ7IDlk0gt_l8ht9hEK3DZVg5mhucbaEtM4qCKPiQk?via=matrix.org&via=vector.modular.im

Version information

Homeserver: EMS, customer in context link
Version: v1.51.0
Install method: Docker, official images
Platform: EMS

The text was updated successfully, but these errors were encountered:

clokep · 2022-02-02T14:11:57Z

We did some investigation into this today and it was due to a room with a high rate of events, but with users with very old read receipts. This caused them to query in _get_unread_counts_by_pos_txn (mentioned above) to do a table scan instead of using the indexes.

We pretty much did a:

SELECT count(stream_ordering), user_id FROM event_push_actions WHERE room_id = '...' GROUP by user_id;

This showed a few users with ~100k pending entries while most users were much smaller. This is due to those few users never reading that room (and thus never updating their read receipt).

A short term fix was to delete the rows for their users, which will cause incorrect unread message counts for those rooms.

A long term fix is probably to make the notification count calculation incremental instead of queuing all messages. This should make it robust against users not reading rooms.

clokep · 2022-02-02T14:41:59Z

#11893 was also found while investigating this but we do not believe to be directly causing this issue.

richvdh · 2022-02-02T15:19:27Z

vaguely related: #5569

erikjohnston · 2022-02-17T15:19:54Z

My understanding was that we moved push actions to a summary table after 24 hours? Though that doesn't happen for "highlight" push actions IIRC.

Next time this goes off we should figure out why the query takes ages to run, is there lots of push actions in the last 24 hours or are they highlights? Sounds like last time it was filled with push actions for the affected user and so the table got table scanned.

erikjohnston · 2022-02-17T15:22:08Z

Maybe a solution is to aggregate more aggressively if there are lots of push actions, e.g. move to push actions summary table if there are more than 100 rows. The reason not to do that is that the counts won't be as accurate, but that hardly matters when the count hundreds.

erikjohnston · 2022-02-17T17:00:19Z

Also looks like we will rotate at maximum ~3333 events/s (a limit of 10000 rows then a delay of 3s).

richvdh · 2022-02-23T19:07:32Z

Next time this goes off we should figure out why the query takes ages to run, is there lots of push actions in the last 24 hours or are they highlights? Sounds like last time it was filled with push actions for the affected user and so the table got table scanned.

Just lots of regular unread messages.
The query-plan looks like:

explain SELECT COUNT(CASE WHEN notif = 1 THEN 1 END), COUNT(CASE WHEN highlight = 1 THEN 1 END), COUNT(CASE WHEN unread = 1 THEN 1 END) FROM event_push_actions ea WHERE user_id = '@<user>' AND room_id = '!<room>' AND stream_ordering > 13329084;
                                                           QUERY PLAN                                                            
---------------------------------------------------------------------------------------------------------------------------------
 Aggregate  (cost=22045.65..22045.66 rows=1 width=24)
   ->  Bitmap Heap Scan on event_push_actions ea  (cost=2043.15..21616.92 rows=28582 width=6)
         Recheck Cond: ((room_id = '!<room>'::text) AND (user_id = '@<user>'::text))
         Filter: (stream_ordering > 13329084)
         ->  Bitmap Index Scan on event_push_actions_room_id_user_id  (cost=0.00..2036.00 rows=28745 width=0)
               Index Cond: ((room_id = '!<room>'::text) AND (user_id = '@<user>'::text))
(6 rows)

erikjohnston · 2022-02-24T09:47:12Z

Sounds like we need to aggregate more aggressively then.

erikjohnston · 2022-05-18T09:33:30Z

This is where we'd need to add the additional rotation logic:

synapse/synapse/storage/databases/main/event_push_actions.py

Lines 749 to 756 in 0d17357

    
           logger.info("Rotating notifications") 
        
           caught_up = await self.db_pool.runInteraction( 
        
               "_rotate_notifs", self._rotate_notifs_txn 
        
           ) 
        
           if caught_up: 
        
               break 
        
           await self.hs.get_clock().sleep(self._rotate_delay)

Right now, it works by rotating all push actions that have a stream ordering that is older than one day, doing a max of 10000 rows per iteration.

I think we can add a second loop where we check if there are lots of push actions for a given room, and rotate old rows until there is a max of e.g. 100 rows left in the event_push_actions table. This should be safe if we change the bounds of how we fetch unprocessed counts at:

synapse/synapse/storage/databases/main/event_push_actions.py

Line 821 in 0d17357

WHERE ? <= stream_ordering AND stream_ordering < ?

to instead use the old stream ordering that we store in event_push_summary.

erikjohnston · 2022-06-07T09:22:24Z

Ok, so my suggestion in the previous comment doesn't work, as it results in table scanning event_push_actions, which is not tenable for larger hosts.

erikjohnston · 2022-06-07T09:51:20Z

New plan:

Have a new table called event_push_counts, which is like event_push_summaries but keeps track of the full count (rather than just up to one day ago). This will allow fetching the unread counts to be the same as today, except that the range we'll need to scan in event_push_actions is much smaller.

To keep it updated:

On a new event we:
1. Fetch all rows from event_push_actions for the room/user from the stream_ordering in event_push_counts (or if a row there doesn't exist, from event_push_summaries).
2. Add that count to event_push_counts and update the stream_ordering.
When we get a receipt from a user in a room we recalculate event_push_counts by counting remaining matching rows in event_push_actions.
On rotation of push actions into event_push_summaries no action needs to be taken, as it should be a no-op.

Fixes #11887 hopefully. The core change here is that `event_push_summary` now holds a summary of counts up until a much more recent point, meaning that the range of rows we need to count in `event_push_actions` is much smaller. This needs two major changes: 1. When we get a receipt we need to recalculate `event_push_summary` rather than just delete it 2. The logic for deleting `event_push_actions` is now divorced from calculating `event_push_summary`. In future it would be good to calculate `event_push_summary` while we persist a new event (it should just be a case of adding one to the relevant rows in `event_push_summary`), as that will further simplify the get counts logic and remove the need for us to periodically update `event_push_summary` in a background job.

clokep added S-Minor Blocks non-critical functionality, workarounds exist. T-Defect Bugs, crashes, hangs, security vulnerabilities, or other reported issues. labels Feb 14, 2022

DMRobertson added this to the Q2 2022 ─ EMS Resource Usage milestone May 8, 2022

erikjohnston self-assigned this Jun 7, 2022

erikjohnston mentioned this issue Jun 9, 2022

Speed up get_unread_event_push_actions_by_room #13005

Merged

erikjohnston closed this as completed in #13005 Jun 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Database transaction `get_unread_event_push_actions_by_room` churns database CPU #11887

Database transaction `get_unread_event_push_actions_by_room` churns database CPU #11887

jaywink commented Feb 2, 2022

clokep commented Feb 2, 2022

clokep commented Feb 2, 2022

richvdh commented Feb 2, 2022

erikjohnston commented Feb 17, 2022

erikjohnston commented Feb 17, 2022

erikjohnston commented Feb 17, 2022

richvdh commented Feb 23, 2022 •

edited

erikjohnston commented Feb 24, 2022

erikjohnston commented May 18, 2022

erikjohnston commented Jun 7, 2022

erikjohnston commented Jun 7, 2022

Database transaction get_unread_event_push_actions_by_room churns database CPU #11887

Database transaction get_unread_event_push_actions_by_room churns database CPU #11887

Comments

jaywink commented Feb 2, 2022

Description

Version information

clokep commented Feb 2, 2022

clokep commented Feb 2, 2022

richvdh commented Feb 2, 2022

erikjohnston commented Feb 17, 2022

erikjohnston commented Feb 17, 2022

erikjohnston commented Feb 17, 2022

richvdh commented Feb 23, 2022 • edited

erikjohnston commented Feb 24, 2022

erikjohnston commented May 18, 2022

erikjohnston commented Jun 7, 2022

erikjohnston commented Jun 7, 2022

Database transaction `get_unread_event_push_actions_by_room` churns database CPU #11887

Database transaction `get_unread_event_push_actions_by_room` churns database CPU #11887

richvdh commented Feb 23, 2022 •

edited