Does Lemmy really benefit from Rust? Is code execution speed the bottleneck?

Buttons · edit-2 2 years ago

Does Lemmy really benefit from Rust? Is code execution speed the bottleneck?

Espi@kbin.social · 2 years ago

I would say that it’s extremely unlikely.

Websites in general are never limited by raw code execution, they are mostly limited by IO. Be that disk IO as files are read and written, database IO as you need to execute complex queries to gather all the data to build the user timeline, and network IO to transfer data to and from the user. For decentralized social media like Kbin or Lemmy its even more IO limited as each instance needs to go back and forth to other instances to keep up-to-date data.

Websites usually benefit much more from caching and in-memory databases to keep frequently used data in fast storage.

This is why simple, high level, object oriented, garbage collected languages have become so common. All the CPU performance penalties they incur don’t actually affect the website performance.

TortoiseWrath@tortoisewrath.com · edit-2 2 years ago

Not relevant to lemmy (yet), but this does break down a bit at very large scales. (Source: am infra eng at YouTube.)

System architecture (particularly storage) is certainly by far the largest contributor to web performance, but the language of choice and its execution environment can matter. It’s not so important when it’s the difference between using 51% and 50% of some server’s CPU or serving requests in 101 vs 100 ms, but when it’s the difference between running 5100 and 5000 servers or blocking threads for 101 vs 100 CPU-hours per second, you’ll feel it.

Languages also build up cultures and ecosystems surrounding them that can lend themselves to certain architectural decisions that might not be beneficial. I think one of the major reasons they migrated the YouTube backend from Python to C++ isn’t really anything to do with the core languages themselves, but the fact that existing C++ libraries tend to be way more optimized than their Python equivalents, so we wouldn’t have to invest as much in developing more efficient libraries.

terebat · 2 years ago

It is fairly relevant to lemmy as is. Quite a few instances have ram constraints and are hitting swap. Consider how much worse it would be in python.

Currently most of the issues are architectural and can be fixed with tweaking how certain things are done (i.e., image hosting on an object store instead of locally).

th3raid0r@tucson.social · edit-2 2 years ago

In lemmy’s case, my perusal of the DB didn’t really suggest that the queries would be that complex and I suspect that moving it to a higher performance NoSQL DB might be possible, but I’d have to take a look at a few more queries to be sure.

I wonder if this could be made to work with Aerospike Community Edition…

Obviously it could be more effort than it’s worth though.

terebat · 2 years ago

The issues I’ve seen more are around images. Hosting the images on an object store (cloudflare r2, s3) and linking there would reduce a lot of federation bandwidth, as that’s probably cause higher ram/swap usage too.

pict-rs supports storing in object stores, but when getting/serving images, it still serves through the instance as the bottleneck IIRC. That would do quite a bit to free up some resources and lower overall IO needed by the server.

hungrybread@lemmygrad.ml · 2 years ago

There’s no need to migrate the database, that shouldn’t be an issue at this size. Caching should be implemented as another comment suggested.

alertsleeper · 2 years ago

Would you be so kind as to recommend some resources about caching? I’ve read the basics, but have yet to dive deep on it

hungrybread@lemmygrad.ml · 2 years ago

The basic idea is to keep data as close to the processor as possible, so with a database that means storing the result of commonly used queries in memory.

Baldur Nil · 2 years ago

Good resources.

TortoiseWrath@tortoisewrath.com · 2 years ago

Oh shit does lemmy not have response caching? Yeah, that’s gonna be an issue pretty soon.

hungrybread@lemmygrad.ml · 2 years ago

I have no idea, just inferred that from some other posts.

Baldur Nil · 2 years ago

https://www.reddit.com/r/Lemmy/comments/14h965f/comment/jpdemet

TortoiseWrath@tortoisewrath.com · edit-2 2 years ago

Ehhhhhhh. Using a relational database for Lemmy was certainly a choice, but I don’t think it’s necessarily a bad one.

Within Lemmy, by far the most expensive part of the database is going to be comment trees, and within the industry the consensus on the best database structure to represent these is… well, there isn’t one. The efficiency of this depends way more on how you implement it within a given database model than on the database model itself. Comment trees are actually a pretty difficult problem; you’ll notice a lot of platforms have limits on comment depth, and there’s a reason for that. Getting just one level of replies to work efficiently can be tricky, regardless of the choice of DBMS.

Looking at the schema Lemmy uses, I see a couple opportunities to optimize it down the road. One of the first things I noticed is that comment replies don’t seem to be directly related back to the top-level post, meaning you’re restricted to a breadth-first search of the comment tree at serving time. Most comments will be at pretty shallow depths, so it sometimes makes sense to flatten the first few levels of this structure so you can get most relevant comments in a single query and rebuild the tree post-fetching. But this makes nomination (i.e. getting the “top 100” or whatever comments to show on your page) a lot more difficult, so it makes sense that it’s currently written the way it is.

If it’s true (as another commenter said) that there’s no response caching for comment queries, that’s a much bigger opportunity for optimization than anything else in the database.

Baldur Nil · edit-2 2 years ago

That’s correct. I wonder if YouTube still uses Python to this day (seems like they migrated to C++?)

Not saying there isn’t a difference in language performance, but for most world problems the architecture and algorithms matter more than the language for performance. Unless you’re in a very constrained environment such as lower end smartphones or embedded systems.

TortoiseWrath@tortoisewrath.com · 2 years ago

I wonder if YouTube still uses Python to this day

We do not.

Baldur Nil · 2 years ago

Also this makes you think (assuming it’s true lol): https://www.reddit.com/r/Lemmy/comments/14h965f/comment/jpdemet