ChatGPT's voice, self-hosted?

mrashley@lemmy.ca · 6 months ago

ChatGPT's voice, self-hosted?

kakes@sh.itjust.works · 6 months ago

Regarding the TTS specifically, I remember looking into TorToiSeTTS back when this stuff was first coming out. You can generate ElevenLabs quality audio with it, but it’s insanely slow. In fact, when I was looking into it, it seemed like ElevenLabs may have been using a (much faster at the time) version of TorToiSe TTS, given the output is so similar.

According to the linked Github page, they seem to have solved the speed issues now, so it might be worth looking into. Of course, the other commenters have provided solutions that are pre-integrated into the LLM, but if you’re just looking for TTS this could be worth checking out. Also worth noting that this requires an NVIDIA GPU.

requiem@lemmy.world · 6 months ago

https://github.com/vndee/local-talking-llm

adr1an@programming.dev · 6 months ago

This is what OP looks for. It exists! Other repos only cover partially (e.g. either ollama or tts)

ReedReads@lemmy.zip · 6 months ago

Localai.io

https://github.com/mudler/LocalAI

B0rax@feddit.de · 6 months ago

You mean just the text to speech part? Look into Piper

Ashyr@sh.itjust.works · 6 months ago

When can I get one of these voices to read an epub on my phone? I’d love to have something like that

pythia@lemmy.dbzer0.com · edit-2 6 months ago

i use these two all the time for tts:

https://github.com/JarodMica/ai-voice-cloning / https://github.com/gitmylo/audio-webui

PeachMan@lemmy.world · 6 months ago

epub2tts: https://github.com/aedocw/epub2tts

Looks like a project that utilizes coqui-AI: https://github.com/coqui-ai/TTS

mrashley@lemmy.ca · 6 months ago

Oh WOW! Thanks to all who commented. Next time I get a chance I’m going to check these all out! 👍🏻 I hope others find this thread helpful too!

Lemmy Tagginator@utter.online · 6 months ago

New Lemmy Post: ChatGPT’s voice, self-hosted? (https://lemmyverse.link/lemmy.world/post/15336896)
Tagging: #SelfHosted

(Replying in the OP of this thread (NOT THIS BOT!) will appear as a comment in the lemmy discussion.)

I am a FOSS bot. Check my README: https://github.com/db0/lemmy-tagginator/blob/main/README.md