this post was submitted on 09 Apr 2024
32 points (97.1% liked)

Open Source

31243 readers
258 users here now

All about open source! Feel free to ask questions, and share news, and interesting stuff!

Useful Links

Rules

Related Communities

Community icon from opensource.org, but we are not affiliated with them.

founded 5 years ago
MODERATORS
 

Is there any good speech to text transcription app?

top 14 comments
sorted by: hot top controversial new old
[–] [email protected] 1 points 7 months ago
[–] [email protected] 0 points 7 months ago (1 children)

The offline AI that I tried a few months ago probably needed training on the noise environment to get decent results. I forget which ones I tried but likely the ones with extensions already in Oobabooga Textgen. I was messing with text to speech mostly, but some of the ones that do TTS also have packages and examples for STT. Nothing I tried for offline generation was good enough to speak as an AI prompt without manual corrections.

[–] [email protected] 1 points 7 months ago

Did you try whisper?

[–] [email protected] 4 points 7 months ago

Futo voice input got damn good for german and english at least. They use whisper afaik

[–] [email protected] 1 points 7 months ago

Check out Talon Voice. It's a bit of getting your head around, but I use it for not just transcribing, but also coding, and general computer navigation.

[–] [email protected] 7 points 7 months ago* (last edited 7 months ago) (2 children)

https://github.com/mkiol/dsnote

But basically look for something that utilizes the Whisper AI model. This list has a few options, but I am not sure if there is newer stuff: https://github.com/sindresorhus/awesome-whisper

[–] [email protected] 1 points 7 months ago

That's good to know, thank you.

[–] [email protected] 4 points 7 months ago

Yeah Whisper is really good. It can also deal with a bunch of non english languages.

[–] [email protected] 21 points 7 months ago (2 children)

For android there is FUTO voice input as an fdroid repo https://app.futo.org/fdroid/repo

[–] [email protected] 8 points 7 months ago (2 children)

I can't believe how good this is! I'm on a really busy street (in the middle of Sydney, George Street actually), and it's put in commas in, it handles question marks and all my texts. I can't believe how good this system is and to think that it is local and open source is so impressive.

[–] [email protected] 2 points 7 months ago

Holy crap, the quality of the generated input is incredible.

[–] [email protected] 4 points 7 months ago (1 children)

I know, right? It's a really amazing piece of work when it comes right down to it. They have a donation thing, but since I don't have Google Play Services, I can't use it, and they don't have the crypto donation set up yet.

[–] [email protected] 3 points 7 months ago

Thanks for letting me know about the donation thing. I suffer from RSI so I'm always happy to donate to things that help. Much appreciated. And as before, this is written with the keyboard voice to text. It's now very, very noisy in a bar.

[–] [email protected] 1 points 7 months ago

Works great. Thank you.