After moving from lemmy.ml to programming.dev, I’ve noticed that web responses are fulfilled much more quickly, even for content on federated instances like lemmy.ml and lemmy.world.
It seems like this shouldn’t make such a big difference. If a large instance is overloaded, it’s overloaded, whether the traffic is coming from clients with accounts on that instance or from other federated instances.
Can this be explained entirely by response caching?
Caching is definitely part of the answer. Another answer I saw recently is that the Admins of an instance called Programming.dev are pretty likely to know what they are doing lol