- cross-posted to:
- Aii
- [email protected]
- [email protected]
- cross-posted to:
- Aii
- [email protected]
- [email protected]
A couple of years ago I decided to turn this blog into a podcast. At the time, I decided to make up a stupid rule: whatever model I use to clone my voice and generate article transcripts needs to be an open model.



The guy has pretty big constraints on his setup: Voice Cloning AND open source …
To be fair there are quite a few open source TTS AI engines that support voice cloning. Coqui TTS springs to mind. They do take some configuring and training to get right, especially for voice cloning, but it’s definitely doable opensource.
Kokoro is designed to turn epubs into audio books, and designed to be lightweight. I think he’s looking at the wrong tools for his use case.
And also a leaderboard that doesn’t even have all the TTS models? There are a TON of them.