Graph linearisation #187

kegsay · 2020-04-30T10:39:57Z

Problem context

Rooms are a directed acyclic graph. Events contain a depth field which gives an approximate ordering. The depth can lie e.g for the events A -> B -> C as nothing stops C saying depth: 11 even though the depth is actually 3 (starting at 1 with A). Basically, we should not use depth provided by servers.

However, we do. We rely on the depth to work out topological tokens when backfilling via the CSAPI call /messages. If two or more events share the same depth value, we tiebreak based on the stream position (which is just a monotonically increasing integer). This is "okay", but allows attackers to inject events into any point in room history by fiddling with the depth. This doesn't affect state, just the timeline sent to clients.

Proposed solution

We need to calculate depth for every event we receive based on prev_events. We may also need to retrospectively update the depth of events when we get told about new events.

The naive solution would be to recalculate the depth for every single event when a new event arrives. This takes θ(n+m) time per edge where n=nodes and m=edges. To insert m edges takes θ(m^2) time. There are algorithms which perform much better than this, notably Katriel–Bodlaender which:

runs in time O(mk log^(2)n), where k is the treewidth of the input DAG

We should implement this in GMSL or even as a stand-alone library with an interface for the ordinal data structure:

The ord labels of the nodes are maintained by an
Ordered List data structure ORD, which is a data structure that allows to maintain
a total order over a list of items and to perform the following operations in constant
amortized time [Dietz and Sleator 1987; Bender et al. 2002]: InsertAfter(x, y)
(InsertBefore(x, y)) inserts the item x immediately after (before) the item y in the
total order, Delete(x) removes the item x, the query Order(x, y) determines whether
x precedes y or y precedes x in the total order and Next(x)(Prev(x)) returns the
item that appears immediately after (before) x in the total order. These operations
are implemented by associating integer labels with the items in the list such that
the label associated with x is smaller than the label associated with y iff x precedes
y in the total order.

We should then make a separate component in Dendrite which consumes the roomserver output log and calculates depths for events, and then sends them on (or alternatively, keep the syncapi in charge of doing this).

NB: We still have the outstanding problem of what to do if the depths change and hence the linearisation of the DAG changes.

NBB: We still need to handle chunks of DAG (e.g joined the room, got some events, leave the room for a while, then rejoin). The above algorithm would need to be applied to each chunk independently, with book-keeping for a chunk along the lines of (from @erikjohnston ):

Track "chunks" of events, i.e. connected sets of events in a room. (You might have multiple chunks if you leave and rejoin the room for example)
Track which chunk is "live", i.e. what chunk is currently being updated (this is a bit hand wavey how you do this)
Within a chunk use Katriel–Bodlaender to keep a topological ordering that can be efficiently updated when new events get added to a chunk
Use that ordering when back paginating through a chunk
Decided what to do when you get to the end of the current chunk, either stop or choose another chunk to start paginating through

The text was updated successfully, but these errors were encountered:

…vers (MSC2716) (#11114) Fix #11091 Fix #10764 (side-stepping the issue because we no longer have to deal with `fake_prev_event_id`) 1. Made the `/backfill` response return messages in `(depth, stream_ordering)` order (previously only sorted by `depth`) - Technically, it shouldn't really matter how `/backfill` returns things but I'm just trying to make the `stream_ordering` a little more consistent from the origin to the remote homeservers in order to get the order of messages from `/messages` consistent ([sorted by `(topological_ordering, stream_ordering)`](https://github.com/matrix-org/synapse/blob/develop/docs/development/room-dag-concepts.md#depth-and-stream-ordering)). - Even now that we return backfilled messages in order, it still doesn't guarantee the same `stream_ordering` (and more importantly the [`/messages` order](https://github.com/matrix-org/synapse/blob/develop/docs/development/room-dag-concepts.md#depth-and-stream-ordering)) on the other server. For example, if a room has a bunch of history imported and someone visits a permalink to a historical message back in time, their homeserver will skip over the historical messages in between and insert the permalink as the next message in the `stream_order` and totally throw off the sort. - This will be even more the case when we add the [MSC3030 jump to date API endpoint](matrix-org/matrix-spec-proposals#3030) so the static archives can navigate and jump to a certain date. - We're solving this in the future by switching to [online topological ordering](matrix-org/gomatrixserverlib#187) and [chunking](#3785) which by its nature will apply retroactively to fix any inconsistencies introduced by people permalinking 2. As we're navigating `prev_events` to return in `/backfill`, we order by `depth` first (newest -> oldest) and now also tie-break based on the `stream_ordering` (newest -> oldest). This is technically important because MSC2716 inserts a bunch of historical messages at the same `depth` so it's best to be prescriptive about which ones we should process first. In reality, I think the code already looped over the historical messages as expected because the database is already in order. 3. Making the historical state chain and historical event chain float on their own by having no `prev_events` instead of a fake `prev_event` which caused backfill to get clogged with an unresolvable event. Fixes #11091 and #10764 4. We no longer find connected insertion events by finding a potential `prev_event` connection to the current event we're iterating over. We now solely rely on marker events which when processed, add the insertion event as an extremity and the federating homeserver can ask about it when time calls. - Related discussion, #11114 (comment) Before | After --- | --- ![](https://user-images.githubusercontent.com/558581/139218681-b465c862-5c49-4702-a59e-466733b0cf45.png) | ![](https://user-images.githubusercontent.com/558581/146453159-a1609e0a-8324-439d-ae44-e4bce43ac6d1.png) #### Why aren't we sorting topologically when receiving backfill events? > The main reason we're going to opt to not sort topologically when receiving backfill events is because it's probably best to do whatever is easiest to make it just work. People will probably have opinions once they look at [MSC2716](matrix-org/matrix-spec-proposals#2716) which could change whatever implementation anyway. > > As mentioned, ideally we would do this but code necessary to make the fake edges but it gets confusing and gives an impression of “just whyyyy” (feels icky). This problem also dissolves with online topological ordering. > > -- #11114 (comment) See #11114 (comment) for the technical difficulties

This was referenced Apr 30, 2020

Correctly calculating inbound event depth #88

Closed

We should not use depth in the sync API matrix-org/dendrite#967

Open

kegsay mentioned this issue Aug 26, 2020

Figure out how to do stream positions in the syncserver matrix-org/dendrite#211

Closed

MadLittleMods mentioned this issue Oct 27, 2021

Fix historical messages backfilling in random order on remote homeservers (MSC2716) matrix-org/synapse#11114

Merged

11 tasks

MadLittleMods mentioned this issue Feb 24, 2022

Fetch events from multiple homeservers matrix-org/matrix-viewer#5

Open

kegsay added T-Task T-Enhancement labels Dec 6, 2022

MadLittleMods mentioned this issue Mar 28, 2023

MSC2716: Incrementally importing history into existing rooms matrix-org/matrix-spec-proposals#2716

Open

MadLittleMods mentioned this issue Aug 5, 2024

CS API refers to undefined terms stream ordering and topological ordering matrix-org/matrix-spec#1334

Open

This was referenced Oct 30, 2024

Figure out how to do stream positions in the syncserver element-hq/dendrite#211

Closed

We should not use depth in the sync API element-hq/dendrite#967

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Graph linearisation #187

Graph linearisation #187

kegsay commented Apr 30, 2020 •

edited

Loading

Graph linearisation #187

Graph linearisation #187

Comments

kegsay commented Apr 30, 2020 • edited Loading

Problem context

Proposed solution

kegsay commented Apr 30, 2020 •

edited

Loading