this post was submitted on 28 Jun 2024
934 points (98.7% liked)
Science Memes
11068 readers
2866 users here now
Welcome to c/science_memes @ Mander.xyz!
A place for majestic STEMLORD peacocking, as well as memes about the realities of working in a lab.
Rules
- Don't throw mud. Behave like an intellectual and remember the human.
- Keep it rooted (on topic).
- No spam.
- Infographics welcome, get schooled.
This is a science community. We use the Dawkins definition of meme.
Research Committee
Other Mander Communities
Science and Research
Biology and Life Sciences
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- !reptiles and [email protected]
Physical Sciences
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
Humanities and Social Sciences
Practical and Applied Sciences
- !exercise-and [email protected]
- [email protected]
- !self [email protected]
- [email protected]
- [email protected]
- [email protected]
Memes
Miscellaneous
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Which is what a parrot does.
Yeah this is the exact criticism. They recombine language pieces without really doing language. The end result looks like language, but it lacks any of the important characteristics of language such as meaning and intention.
If I say "Two plus two is four" I am communicating my belief about mathematics.
If an llm emits "two plus two is four" it is outputting a stochastically selected series of tokens linked by probabilities derived from training data. If the statement is true or false then that is accidental.
Hence, stochastic parrot.
If i train an LLM to do math, for the training data i generate
a+b=c
statements, never showing it the same one twice.It would be pointless for it to "memorize" every single question and answer it gets since it would never see that question again. The only way it would be able to generate correct answers would be if it gained a concept of what numbers are, and how the add operation operates on them to create a new number.
Rather than memorizing and parroting it would have to actually understand it in order to generate responses.
It's called generalization, it's why large amounts of data is required (if you show the same data again and again then memorizing becomes a viable strategy)
Seems like a pointless distinction, you were told it so you believe it to be the case? Why can't we say the LLM outputs what it believes is the correct answer? You're both just making some statement based on your prior experiences which may or may not be true
If you fine tune a LLM on math equations, odds are it won't actually learn how to reliably solve novel problems. Just the same as it won't become a subject matter expert on any topic, but it's a lot harder to write simple math that "looks, but is not, correct" than it is to waffle vaguely about a topic. The idea of a LLM creating a robust model of the semantics of the text it's trained on is, at face value, plausible; it just doesn't seem to actually happen in practice.
Prompt:
ChatGPT:
It's trained to generate what is most plausible, but with math, the only plausible response is the correct answer (assuming it has been trained on data where that has been the case)
ChatGPT uses auxiliary models to perform certain tasks like basic math and programming. Your explanation about plausibility is simply wrong.
It has access to a python interpreter and can use that to do math, but it shows you that this is happening, and it did not when i asked it.
I asked it to do another operation, this time specifying i wanted it to use an external tool, and it did
You have access to a dictionary, that doesn't prove you're incapable of spelling simple words on your own, like goddamn people what's with the hate boners for ai around here
That's not what I meant.
??? You just don't understand the difference between a LLM and a chat application using many different tools.