[question] Alt-text service, self hosted
Is there a service that I could plug into my misskey/firefish/whatev that provides an alt-text for an image I’m posting. I don’t post images that often, so the few times, it would be nice to have an automatic OCR or LLM/Stable Diffusion description of a picture attached to it.
Of course, without depending on Google or similars.
Any ideas? cc @[email protected] @[email protected]
I feel that auto-generated descriptions are going to generally be terrible, even with the new GPT AIs. There’s too much context needed to do a good job to be able to just feed an image into some code and get something useful.
On the other hand, transcriptions should be able to be done more accurately, particularly with a bit of extra logic to recognise forms like Twitter posts.
Some database of alt-texts might be possible by scraping for alt-texts and transcriptions from the fediverse, reddit, etc, but a quick search didn’t come up with anything.
@[email protected] maybe just OCR would be enough for a lot of cases
Yeah, that’s what I mean by transcriptions; if you’re mostly posting screenshots of social media posts then it’d probably work quite well, but if it’s photos you’re definitely going to need something more complicated.