this post was submitted on 15 Jan 2025
39 points (86.8% liked)

Technology

60462 readers
3886 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 2 years ago
MODERATORS
top 12 comments
sorted by: hot top controversial new old
[–] [email protected] 1 points 34 minutes ago

Kinda off topic but does anyone know a reliable thing that does the opposite, i.e. a good speech to text tool? There's a few audiobook series that I'd love to read but are only available in audio format

[–] [email protected] 1 points 2 hours ago

Sweet. I'll see if I can get working later on. Maybe I missed it, how much RAM is required for this?

[–] [email protected] 3 points 6 hours ago

I thought about creating something like this some time ago, guess I should think less and do more

[–] [email protected] 2 points 6 hours ago (1 children)

Looks like cuda or cpu, and i just bought a radeon card.

[–] [email protected] 1 points 3 hours ago

Does ZLUDA help here?

[–] [email protected] 1 points 8 hours ago* (last edited 8 hours ago)

Been hoping for something like this. Hello waiting ebooks!

[–] [email protected] 11 points 10 hours ago (1 children)

I’m curious enough to try this.

Audiobooks are made or broken by the quality of the narration. It’s not enough to simply narrate written word, the narrator has to “act” as well. Given the quality of other arts engaged by AI I’m going into this skeptical.

[–] [email protected] 1 points 1 hour ago

You should give Ender's Game a listen, the one narrated by Harlan Ellison and Stefan Rudnicki (I borrowed from my local library). The performance is excellent.

[–] [email protected] 2 points 11 hours ago* (last edited 11 hours ago) (1 children)

I'll give it a shot.

I tried something similar a year or so ago with my own script. What I found was, the longer the book, the harder the ai has on having the same voice.

I eventually got around this limitation by using piper. https://github.com/rhasspy/piper

It was good enough for my purposes. But it definitely sounded mechanical.

[–] [email protected] 2 points 9 hours ago (1 children)

Let me know if it's worth the time.

[–] [email protected] 1 points 9 hours ago

For sure, just give me a nudge as I may forget to come back haha. I'll try it out tonight if I get a chance.

[–] [email protected] 7 points 12 hours ago

Interesting. Just a couple of years ago AI generated audiobooks were unlistenable for me due to the dead voice delivery but the example here actually sounds like it would be alright to listen to.

Definitely want to give this a try for some books that have never had an audio book recording. Thanks for posting!