this post was submitted on 11 Feb 2025
40 points (97.6% liked)

Selfhosted

42108 readers
410 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 2 years ago
MODERATORS
 

Title question mostly. I've played with XTTS-v2 and it worked pretty well, but I'm wondering if folks are using anything else special. I'd like to train my own voice finetune which is what I did with XTTS-v2, and then use it with home assistant's voice feature. Welcome all opinions on it!

top 16 comments
sorted by: hot top controversial new old
[–] [email protected] 2 points 1 day ago

You might consider asking this in the [email protected] community, too.

[–] [email protected] 4 points 2 days ago (1 children)

Piper works pretty well. I'm only using it because it was easier to find a custom glados voice.

Kokoro has good default voices. I also started trying out Speaches recently. It provides an open ai api wrapper around several options

[–] [email protected] 1 points 2 days ago

Any tips on getting speaches to work with Home assistant? Got speaches working but haven't gone the next step yet.

[–] [email protected] 10 points 2 days ago (1 children)

if you need English - right now it's kokoro-fastapi https://github.com/remsky/Kokoro-FastAPI set this container up and use it as an openai TTS endpoint using this hacs integration https://github.com/sfortis/openai_tts

[–] [email protected] 1 points 2 days ago

Very nice! I'll check this out!

[–] [email protected] 2 points 2 days ago

Don’t know much about the training side of things, but I have Piper set up with home assistant using the Wyoming protocol and it just goes. Some of the out-of-the-box voices are pretty decent too.

[–] [email protected] 2 points 2 days ago (1 children)

Pretty much just personal preference at this point. XTTS is certainly not the most efficient though.

[–] [email protected] 2 points 2 days ago (1 children)

any personal preferences you recommend?

[–] [email protected] 3 points 2 days ago* (last edited 2 days ago) (1 children)

Pico, Piper, Mary, and Google all run locally and off of CPU only.

I think all the rest require cloud accounts or acceleration hardware to work quickly.

I'm personally fine with Mary or Piper, but I know some people like the fancier ones.

[–] [email protected] 1 points 2 days ago (1 children)

Google? Have you verified that?

[–] [email protected] 2 points 2 days ago (1 children)
[–] [email protected] 1 points 2 days ago (1 children)

The docs don't say it's completely offline. Can you turn off your LAN connection and it still works? Have you tried this? Or just firewall off out bound access to Google services?

This comment:

Contrary to what the name suggests, the integration only does text-to-speech and does not translate messages sent to it.

doesn't say it doesn't call out to Google services; it says only that it doesn't use translation services. I didn't see anything else that implies it doesn't send data to Google.

[–] [email protected] 0 points 1 day ago* (last edited 1 day ago) (2 children)

If it doesn't require an API key in the config, it's offline. My HA works totally offline unless I need to do updates, and it's always worked for me.

You can also view the code, cuz open source.

[–] [email protected] 1 points 16 hours ago

Yeah, so I dug into it, and it's definitely not offline. It uses gtts, which ultimately makes calls to google.com for the tts. You can track it down yourself, but you'll eventually end up here, which talks about how to change the google host name in case it's blocked.

I'm not sure why you believe not needing an API key means it isn't calling a Google API, especially in this case where it clearly states it's using an unofficial channel - which is the same trick third party YouTube clients use to access YouTube videos without using API keys.

[–] [email protected] 0 points 1 day ago (1 children)

👍 Thanks. I'm surprised, and still skeptical, but thanks.

[–] [email protected] 1 points 1 day ago

I was curious and it uses gTTS.

It calls what's probably the "speak" button on translate.google.com

This project is leveraging the undocumented Google Translate speech functionality