return2ozma@lemmy.world to Technology@lemmy.worldEnglish · 2 days agoAudible unveils plans to use AI voices to narrate audiobookswww.theguardian.comexternal-linkmessage-square176fedilinkarrow-up1367arrow-down118
arrow-up1349arrow-down1external-linkAudible unveils plans to use AI voices to narrate audiobookswww.theguardian.comreturn2ozma@lemmy.world to Technology@lemmy.worldEnglish · 2 days agomessage-square176fedilink
minus-squarevenusaur@lemmy.worldlinkfedilinkEnglisharrow-up18·2 days agoSure there are. ElevenLabs is one. You can probably tell they’re not human but they’re really decent.
minus-squareEcho Dot@feddit.uklinkfedilinkEnglisharrow-up8arrow-down1·1 day agoThey still don’t understand the context of what they’re reading though so they can’t apply tone correctly.
minus-squarevenusaur@lemmy.worldlinkfedilinkEnglisharrow-up1·1 day agoFair. Definitely some awkward phrasing, but it’ll get better.
minus-squaressillyssadass@lemmy.worldlinkfedilinkEnglisharrow-up3arrow-down1·1 day agoFrom what I’ve been able to hear it’s not that bad. They’re pretty good at having a general tone. But they may fail when it comes to emotional tones, like anger or sadness. But for just reading a book aloud there shouldn’t be any issue.
minus-squareLandless2029@lemmy.worldlinkfedilinkEnglisharrow-up4·1 day agoJust tried it. Still a machine buy much better than default TTS.
minus-squarevenusaur@lemmy.worldlinkfedilinkEnglisharrow-up2·1 day agoIn 10 years it’s probably gonna be really impressive.
Sure there are. ElevenLabs is one. You can probably tell they’re not human but they’re really decent.
They still don’t understand the context of what they’re reading though so they can’t apply tone correctly.
Fair. Definitely some awkward phrasing, but it’ll get better.
From what I’ve been able to hear it’s not that bad. They’re pretty good at having a general tone. But they may fail when it comes to emotional tones, like anger or sadness. But for just reading a book aloud there shouldn’t be any issue.
Just tried it. Still a machine buy much better than default TTS.
In 10 years it’s probably gonna be really impressive.