this post was submitted on 13 May 2025
236 points (94.0% liked)

Technology

70001 readers
4749 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
top 50 comments
sorted by: hot top controversial new old
[–] [email protected] 6 points 1 hour ago

It's Amazon, what did you expect? Enshittification and monopoly abuse, no surprise.

[–] [email protected] 1 points 1 hour ago

So you can take the square root of that:

5x+7integral from 5z to 9x derivative of deltaT minus minus multiply times 3. Figure 1

Figure 1 shows a typical lizard living in a square root.

[–] [email protected] 18 points 1 hour ago (1 children)

trained on stolen books? then I guess I can download these from anywhere I may find for free as well, right?

[–] [email protected] 1 points 1 hour ago

I like your way of thinking

[–] [email protected] 52 points 9 hours ago (4 children)

I can get that for free. There are apps that will read an ebook to you already. The whole point of paying the premium on audible is the superior reading/acting. Not put up with mispronounced words, weird cadence and an inability to handle acronyms

[–] [email protected] 1 points 1 hour ago

I've tried one that works surprisingly well. Each sentence had great pacing, cadence, and correct enunciation- even had tone right when someone was shouting or angry or sad.

I wouldn't really recommend it, though. While I couldn't pick any single thing out that was wrong, overall it just didn't quite flow. It's like watching someone try to act that is technically doing everything right, but it just isn't good. It basically didn't understand the greater context of the story and was saying lines.

It was uncanny valley, but exclusively with voice.

[–] [email protected] 1 points 1 hour ago* (last edited 1 hour ago) (2 children)

Is there an offline tool that generates realistic audio for epubs as Mp3 ? Something like the free Ai tool, Vibe which is for transcription. Is there something similar for TTS, runs locally without complicated setup ( most are complicated using python and etc just for installation)

[–] [email protected] 1 points 28 minutes ago* (last edited 27 minutes ago) (1 children)

I've loaded epubs into the app ReadEra, which lets you read it like any other novel app or will, in real time, read it to you. It's not the most natural of speech, but was good enough for my commute when I was in the midst of a compelling book.

[–] [email protected] 1 points 12 minutes ago

Download TTS Server, and change the engine in Readera to use it. Use the Microsoft Azure settings in TTS, much more realistic. Little slow though is my only complaint as it sends/receives a paragraph at time, resulting in a pause now and again.

[–] [email protected] 1 points 1 hour ago

Great question! I need to come back to this thread to see if something is suggested.

[–] [email protected] 1 points 2 hours ago (1 children)

Looking for iOS recommendations, preferably without a subscription that can read epub/pdf

[–] [email protected] 1 points 27 minutes ago

I'm an android user, so not sure if it's on iOS but I've used ReadEra

[–] [email protected] 1 points 2 hours ago

I thought people mainly paid for the large library

[–] [email protected] -4 points 9 hours ago

Beautiful, it works. Why not.

[–] [email protected] 8 points 9 hours ago

Is voice AI trained on stolen data? I was under the impression that was LLMs.

[–] [email protected] 6 points 10 hours ago* (last edited 10 hours ago) (3 children)

This is clearly the future despite the outrage here.

There are at least 389 living languages with over 1M speakers. That alone means it's impossible to reach some people and they get left out. Most of these languages dont even have enough professional voice actors to cover the bandwidth.

There are thousands of books released every year. That's impossible to cover even in English alone.

Its an objective net good to have more accessible audio books and the privileged people who do care about this stuff can very much afford to vote with their wallets for non-ai voices.

In fact since AI moat is so minimal this will very quickly be adapted by open source solution providing audio book access to millions if not billions of people to whom this was not an option. Its amazing.

[–] [email protected] 2 points 1 hour ago

dont even have enough professional voice actors to cover the bandwidth

I'm pretty sure they'd be a lot more people ready to do that job if there was a good remuneration. Heck that sounds a lot more fun that a LOT of jobs out there!

[–] [email protected] 1 points 5 hours ago

but for a service like audible.

[–] [email protected] 11 points 6 hours ago (2 children)

Most of these languages dont even have enough professional voice actors to cover the bandwidth.

And you think anyone is training AI voice models for those languages? Have you even seen how long it takes even large companies like Google to support the languages with hundreds of millions of speakers?

[–] [email protected] 2 points 2 hours ago* (last edited 2 hours ago) (1 children)

That's the benefit of using AI and machine learning - once you have enough source material, you can throw it all in and it'll eventually spit out a model.
Which is exactly what Meta did with their Massively Multilingual Speech project which supports text-to-speech and speech-to-text for 1107 different languages.

Is it actually any good in 99% of them, I don't have a clue, but it exists.

[–] [email protected] 1 points 1 hour ago

Seems more like a proof of concept project for that paper than something they are pursuing seriously judging by the GitHub location in some example folder that hasn't seen any significant updates in over a year. If it is so great I would assume they would pursue it more actively and replace existing models with it two years later.

[–] [email protected] -1 points 4 hours ago (2 children)

It becomes easier and cheaper every day. Today's open source LLMs are better than last year's best model.

[–] [email protected] 3 points 1 hour ago

Is it? I just tried again yesterday for a simple script since coding is the one thing apparently AI will replace people like me and it could not put together a working JavaScript script.

I have yet to see tangible results not announced by the people with sunken cost exploding their balls.

[–] [email protected] 4 points 4 hours ago

You're fundamentally misunderstanding the comment you replied to, they are not saying that voice AI are bad, they are saying there is not enough training data to improve the AI for these languages. How will it improve without good training data?

load more comments
view more: next ›