Technology

38699 readers

234 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 3 years ago

MODERATORS

[email protected]

437

Ask ChatGPT to pick a number between 1 and 100 (jlai.lu)

submitted 1 year ago by [email protected] to c/[email protected]

148 comments fedilink hide all child comments

top 50 comments

sorted by: hot top controversial new old

[–] [email protected] 5 points 1 year ago

I spent an afternoon once playing Infinite Craft, which uses some sort of LLM behind the scenes to do it's combinations.

At one point I got 007, and found 007+007 = 0014.

The maths gets wild though, and because it's been trained on text, it has no idea when it comes to combinations of numbers it hasn't seen before. I spent ages trying to get it to 69420 and just couldn't, although I could get 42069.

[–] [email protected] 8 points 1 year ago* (last edited 1 year ago) (3 children)

I'm curious, is there actually so many 42's in the system? (more than 69 sounds unlikely)

What if the LLM is getting tripped up because 42 is always referred to as the answer to "the Ultimate Question of Life, the Universe, and Everything".

So you ask it a question like give a number between 1-100, it answers 42 because that's the answer to "Everything", according to it's training data.

Something similar happened to Gemini. Google discouraged Gemini from giving unsafe advice because it's unethical. Then Gemini refused to answer questions about C++ because it's considered "unsafe" (referring to memory management). But Gemini thinks C++ is "unsafe" (the normal meaning), therefore it's unethical. It's like those jailbreak tricks but from its own training set.

[–] [email protected] 3 points 1 year ago

I certainly hope that’s what happening or maybe it is actually the answer.

[–] [email protected] 8 points 1 year ago

I’m curious, is there actually so many 42’s in the system?

Sort of, it's not actually picking a random number. It does not know what "random" means. It is analyzing the number of times the question "pick a random number" was asked and what the most common responses to that question looked like.

[–] [email protected] 2 points 1 year ago

I'm curious, is there actually so many 42's in the system? (more than 69 sounds unlikely)

From hitchhiker's guide to the galaxy?

[–] [email protected] 21 points 1 year ago (2 children)

LMs aren't thinking, aren't inventing, they are predicting what is supposed to be answered next, so it's expected that they will produce the same results every time

[–] [email protected] 4 points 1 year ago (1 children)

Except it clearly doesn't produce the same result every time. You're not making a good case for whatever you're trying to say.

[–] [email protected] 5 points 1 year ago

They add some fuzziness to it so it doesn't give the exact same result. Say one gets a score of 90, another 85, and other 80. The 90 will be picked more often, but they sometimes let it pick the 85, or even the 80. It's perfectly expected, and you can see that result here with 42 being very common, but then a few others being fairly common, and most being extremely uncommon.

[–] [email protected] 12 points 1 year ago* (last edited 1 year ago)

This graph actually shows a little more about what's happening with the randomness or "temperature" of the LLM.
It's actually predicting the probability of every word (token) it knows of coming next, all at once.
The temperature then says how random it should be when picking from that list of probable next words. A temperature of 0 means it always picks the most likely next word, which in this case ends up being 42.
As the temperature increases, it gets more random (but you can see it still isn't a perfect random distribution with a higher temperature value)

load more comments