Deleted Comment
Totally agree on the pain points - I covered similar thoughts in my post: https://lielvilla.com/blog/death-of-demo/
The big difference from LLMs is that we don’t really have production-grade, standardized benchmarks for long-form TTS. We need things like volume-stability across segments, speech-rate consistency, and pronunciation accuracy over a hard corpus.
I wrote up what this could look like here: https://lielvilla.com/blog/death-of-demo/
Deleted Comment
https://wonderpods.app/ - create custom podcasts for kids