this post was submitted on 26 Aug 2024
1 points (100.0% liked)
TechTakes
1494 readers
19 users here now
Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.
This is not debate club. Unless it’s amusing debate.
For actually-good tech, you want our NotAwfulTech community
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I realize it's probably a toy example but specifically for "cats" you could achieve the similar results by running a thesaurus/synonym-set on your stem words. With the added benefit that a client could add custom synonyms, for more domain-specific stuff that the LLM would probably not know, and not reliable learn through in-prompt or with fine-tuning. (Although i'd argue that if i'm looking for cats, I don't want to also see videos of tigers, or based on the "understanding" of the LLM of what a cat might be)
For the labeling of videos itself, the most valuable labels would be added by humans, and/or full-text search on the transcript of the video if applicable, speech-to-text being more in the realm of traditional ML than in the realm of GenAI.
As a minor quibble your use case of GenAI is not really "Generative" which is the main thing it's being sold as.
@zogwarg I've written up a quick explanation at https://gist.githubusercontent.com/Ovid/17b19faf2fb7e0019e375e97f0a4c8af/raw/196735daa5274ded8f2363a41d78a490e8325f67/vector.txt
And yes, this is still GenAI. "Gen" doesn't just mean "generating text". It also relates to "understanding" (cough) the meaning of your prompt and having a search space where it can match your meaning with the meaning of other things. That's where it starts to "generate" ideas. For vector databases, instead of generating words based on the meaning, it's generating links based on the meaning.
fosstodon is the programming dot dev of mastodon and I mean that in every negative way you can imagine
your posts all give me slimy SEO vibes and you haven’t shown any upward trajectory since claiming that only generative AI lacks a separation between code and data (fucking what? seriously, think on this) so you’re getting trimmed
back when I used the wider fediverse more frequently I had fosstodon on mute for a significant amount of time
glad to know it’s still Like That