For the segment of mom and pop store who need an explainer video or Facebook ad made in Canva and don't want to pay someone to record, they want easy of use, realism and editability/speed.
My friend who runs a Shopify store asked for this. They are not going to fiddle with VST plugins or local/cloud GPUs.
Aren't they better off hiring cheap on Fiverr for someone else to do the entire video? The traditional reason against this was that you'd want your narrator to sound like a native speaker. But if AI fixes that, is there any downside to outsourcing video voice-overs to cheap labor countries?
How is that better? The AI should be cheaper and the with less hassle (creating a job, reviewing freelancers, negotiating) with less risk of poor quality/reworks and disputes and yes accent is a big one.
The ideal TTS product for such a person would be something like: sign up and pay > choose voice > paste text > download audio
My friend who runs a Shopify store asked for this. They are not going to fiddle with VST plugins or local/cloud GPUs.