I'm working on a TL;DR bot for Lemmy, powered by GPT-3.5

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 · 1 year ago

I'm working on a TL;DR bot for Lemmy, powered by GPT-3.5

Salamander · 1 year ago

I am very happy to hear that you will open source it!

I am curious - have you tested how well it handles a direct link to a scientific article in PDF format?

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 · 1 year ago

It only handles HTML currently, but I like your idea, thank you! I’ll look into implementing reading PDFs as well. One problem with scientific articles however is that they are often quite long, and they don’t fit into the model’s context. I would need to do recursive summarization, which would use much more tokens, and could become pretty expensive. (Of course, the same problem occurs if a web page is too long; I just truncate it currently which is a rather barbaric solution.)

Salamander · 1 year ago

Thanks for your response!

I imagined that this would be harder to pull off. There is also the added complexity that the layout contains figures and references… Sill, it’s pretty cool, I’ll keep an eye on this project, and might give self-hosting it a try once it’s ready!

𝕊𝕚𝕤𝕪𝕡𝕙𝕖𝕒𝕟 · 1 year ago

LLMs can do a surprisingly good job even if the text extracted from the PDF isn’t in the right reading order.

Another thing I’ve noticed is that figures are explained thoroughly most of the time in the text so there is no need for the model to see them in order to generate a good summary. Human communication is very redundant and we don’t realize it.