diff options
author | Richard van der Hoff <1389908+richvdh@users.noreply.github.com> | 2021-03-15 13:51:02 +0000 |
---|---|---|
committer | GitHub <noreply@github.com> | 2021-03-15 13:51:02 +0000 |
commit | af2248f8bf1cf11a230577650e84885387f1920f (patch) | |
tree | 819e8b9e69910f73eca7c492e94c05b9642752a7 /changelog.d/9601.feature | |
parent | Fix additional type hints from Twisted 21.2.0. (#9591) (diff) | |
download | synapse-af2248f8bf1cf11a230577650e84885387f1920f.tar.xz |
Optimise missing prev_event handling (#9601)
Background: When we receive incoming federation traffic, and notice that we are missing prev_events from the incoming traffic, first we do a `/get_missing_events` request, and then if we still have missing prev_events, we set up new backwards-extremities. To do that, we need to make a `/state_ids` request to ask the remote server for the state at those prev_events, and then we may need to then ask the remote server for any events in that state which we don't already have, as well as the auth events for those missing state events, so that we can auth them. This PR attempts to optimise the processing of that state request. The `state_ids` API returns a list of the state events, as well as a list of all the auth events for *all* of those state events. The optimisation comes from the observation that we are currently loading all of those auth events into memory at the start of the operation, but we almost certainly aren't going to need *all* of the auth events. Rather, we can check that we have them, and leave the actual load into memory for later. (Ideally the federation API would tell us which auth events we're actually going to need, but it doesn't.) The effect of this is to reduce the number of events that I need to load for an event in Matrix HQ from about 60000 to about 22000, which means it can stay in my in-memory cache, whereas previously the sheer number of events meant that all 60K events had to be loaded from db for each request, due to the amount of cache churn. (NB I've already tripled the size of the cache from its default of 10K). Unfortunately I've ended up basically C&Ping `_get_state_for_room` and `_get_events_from_store_or_dest` into a new method, because `_get_state_for_room` is also called during backfill, which expects the auth events to be returned, so the same tricks don't work. That said, I don't really know why that codepath is completely different (ultimately we're doing the same thing in setting up a new backwards extremity) so I've left a TODO suggesting that we clean it up.
Diffstat (limited to 'changelog.d/9601.feature')
-rw-r--r-- | changelog.d/9601.feature | 1 |
1 files changed, 1 insertions, 0 deletions
diff --git a/changelog.d/9601.feature b/changelog.d/9601.feature new file mode 100644 index 0000000000..5078d63ffa --- /dev/null +++ b/changelog.d/9601.feature @@ -0,0 +1 @@ +Optimise handling of incomplete room history for incoming federation. |