this post was submitted on 06 Jan 2025
1 points (100.0% liked)

TechTakes

1550 readers
17 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

founded 2 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 0 points 1 week ago* (last edited 1 week ago) (2 children)

In the Bible's book of revelations, John (the author) is witnessing the end of the world and sees four horsemen being unleashed upon the world to spread a curse/trial/whatever wherever they ride. Each horseman brings with them something different- famine, disease, war (or strife), and death. Death is the last, IIRC, and rides upon a pale horse. I think that's what they're referencing. This person is saying that openAI is going to die soon.

[–] [email protected] 0 points 1 week ago

this is correct as to the background of the term itself, the reason ed uses it here is because it is the term that he selected some months ago when he listed “some likely pale horses that signal the bubble popping”

[–] [email protected] 0 points 1 week ago* (last edited 1 week ago) (4 children)

it won't. its backed by microsoft. they can literally afford to burn the cash on this while it becomes profitable, and it will AI has so many low hanging fruits to optimize its insane.

[–] [email protected] 0 points 1 week ago

They will only pay as long as it benefits them.

[–] [email protected] 0 points 1 week ago (1 children)

Okay, explain. What kinds of low hanging fruit?

[–] [email protected] 0 points 1 week ago* (last edited 1 week ago) (1 children)

quants are pretty basic. switching from floats to ints (faster instruction sets) are the well known issues. both those are related to information theory, but there are other things I legally can't mention. shrug. suffice to say the model sizes are going to be decreasing dramatically.

edit: the first two points require reworking the base infrastructure to support which is why they havent hit widespread adoption. but the research showing that 3 bits is as good as 64 is intuitive once you tie the original inspiration for some of the AI designs. that reduction alone means you can get 21x reduction in model size is pretty solid.

[–] [email protected] 0 points 1 week ago (3 children)

both those are related to information theory, but there are other things I legally can’t mention. shrug.

hahahaha fuck off with this. no, the horseshit you’re fetishizing doesn’t fix LLMs. here’s what quantization gets you:

  • the LLM runs on shittier hardware
  • the LLM works worse too
  • that last one’s kinda bad when the technology already works like shit

anyway speaking of basic information theory:

but the research showing that 3 bits is as good as 64 is intuitive once you tie the original inspiration for some of the AI designs.

lol

[–] [email protected] 0 points 1 week ago (1 children)

It's actually super easy to increase the accuracy of LLMs.

import pytorch # or ollama or however you fucking dorks use this nonsense
from decimal import Decimal

I left out all the other details because it's pretty intuitive why it works if you understand why floats have precision issues.

[–] [email protected] 0 points 1 week ago

decimal is a severely underappreciated library

[–] [email protected] 0 points 1 week ago (2 children)

I have seen these 3 bit ai papers on hacker news a few times. And the takeaway apparently is: the current models are being pretty shitty at what we want them to do, and we can reach a similar (but slightly worse) level of shittyness with 3 bits.

But that doesn't say anything about how both technologies could progress in the future. I guess you can compensate for having only three bits to pass between nodes by just having more nodes. But that doesn't really seem helpful, neither for storage nor compute.

Anyways yeah it always strikes me as a kind of trend that maybe has an application in a very specific niche but is likely bullshit if applied to the general case

[–] [email protected] 0 points 1 week ago

Far as I can tell, the only real benefit here is significant energy savings, which would take LLMs from "useless waste of a shitload of power" to "useless waste of power".

[–] [email protected] 0 points 1 week ago (1 children)

If anything that sounds like an indictment? Like, the current models are so incredibly fucking bad that we could achieve the same with three bits and a ham sandwich

[–] [email protected] 0 points 1 week ago

Oh it definitely says something about the current models for sure

[–] [email protected] 0 points 1 week ago* (last edited 1 week ago)

Honestly, the research showing that a schlong that's 3mm wide is just as satisfying as one that's 64 is intuitive once you tie the original inspiration for some of the sex positions.

[–] [email protected] 0 points 1 week ago (1 children)

the fruit can’t be rotten, you must be picking it wrong

[–] [email protected] 0 points 1 week ago

“look, Mme Karen, this is definitely not a rotten tomato. it can’t be a rotten tomato, we don’t sell rotten tomatoes. you can see here on the menu that we don’t have rotten tomatoes on offer. and see here, on your receipt, where it says quinoa salad? absolutely not rotten tomatoes!” explains the manager fervently, avoiding a tableward glance at the pungent red blob with as much will as they can muster

[–] [email protected] 0 points 1 week ago* (last edited 1 week ago) (1 children)

So many low-hanging fruits. Unbelievable fruits. You wouldn’t believe how low they’re hanging.

[–] [email protected] 0 points 1 week ago* (last edited 1 week ago) (1 children)

They are tremendous fruits. Many people are saying this. Fruits.

[–] [email protected] 0 points 1 week ago

and we have concepts for even lower hanging fruits. beautiful concepts.