return2ozma@lemmy.world to Technology@lemmy.worldEnglish · 2 个月前Audible unveils plans to use AI voices to narrate audiobookswww.theguardian.comexternal-linkmessage-square189linkfedilinkarrow-up1389arrow-down120cross-posted to: [email protected]
arrow-up1369arrow-down1external-linkAudible unveils plans to use AI voices to narrate audiobookswww.theguardian.comreturn2ozma@lemmy.world to Technology@lemmy.worldEnglish · 2 个月前message-square189linkfedilinkcross-posted to: [email protected]
minus-squarevenusaur@lemmy.worldlinkfedilinkEnglisharrow-up18·2 个月前Sure there are. ElevenLabs is one. You can probably tell they’re not human but they’re really decent.
minus-squareEcho Dot@feddit.uklinkfedilinkEnglisharrow-up9arrow-down1·2 个月前They still don’t understand the context of what they’re reading though so they can’t apply tone correctly.
minus-squaressillyssadass@lemmy.worldlinkfedilinkEnglisharrow-up3arrow-down1·2 个月前From what I’ve been able to hear it’s not that bad. They’re pretty good at having a general tone. But they may fail when it comes to emotional tones, like anger or sadness. But for just reading a book aloud there shouldn’t be any issue.
minus-squarevenusaur@lemmy.worldlinkfedilinkEnglisharrow-up1·2 个月前Fair. Definitely some awkward phrasing, but it’ll get better.
minus-squareLandless2029@lemmy.worldlinkfedilinkEnglisharrow-up4·2 个月前Just tried it. Still a machine buy much better than default TTS.
minus-squarevenusaur@lemmy.worldlinkfedilinkEnglisharrow-up2·2 个月前In 10 years it’s probably gonna be really impressive.
Sure there are. ElevenLabs is one. You can probably tell they’re not human but they’re really decent.
They still don’t understand the context of what they’re reading though so they can’t apply tone correctly.
From what I’ve been able to hear it’s not that bad. They’re pretty good at having a general tone. But they may fail when it comes to emotional tones, like anger or sadness. But for just reading a book aloud there shouldn’t be any issue.
Fair. Definitely some awkward phrasing, but it’ll get better.
Just tried it. Still a machine buy much better than default TTS.
In 10 years it’s probably gonna be really impressive.