this post was submitted on 05 Feb 2025
218 points (81.9% liked)

Technology

61632 readers
3425 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 
(page 2) 50 comments
sorted by: hot top controversial new old
[–] [email protected] 1 points 17 hours ago

I know right? It's not a fruit it's a vegetable!

[–] [email protected] 6 points 18 hours ago (2 children)

This is literally just a tokenization artifact. If I asked you how many r’s are in /0x5273/0x7183 you’d be confused too.

load more comments (2 replies)
[–] [email protected] 16 points 18 hours ago (7 children)

This is a bad example.. If I ask a friend "is strawberry spelled with one or two r's"they would think I'm asking about the last part of the word.

The question seems to be specifically made to trip up LLMs. I've never heard anyone ask how many of a certain letter is in a word. I've heard people ask how you spell a word and if it's with one or two of a specific letter though.

If you think of LLMs as something with actual intelligence you're going to be very unimpressed.. It's just a model to predict the next word.

[–] [email protected] 26 points 18 hours ago (2 children)

If you think of LLMs as something with actual intelligence you're going to be very unimpressed.. It's just a model to predict the next word.

This is exactly the problem, though. They don’t have “intelligence” or any actual reasoning, yet they are constantly being used in situations that require reasoning.

[–] [email protected] 5 points 18 hours ago (2 children)

Maybe if you focus on pro- or anti-AI sources, but if you talk to actual professionals or hobbyists solving actual problems, you'll see very different applications. If you go into it looking for problems, you'll find them, likewise if you go into it for use cases, you'll find them.

load more comments (2 replies)
load more comments (1 replies)
load more comments (6 replies)
[–] [email protected] 5 points 19 hours ago

I asked mistral/brave AI and got this response:

How Many Rs in Strawberry

The word "strawberry" contains three "r"s. This simple question has highlighted a limitation in large language models (LLMs), such as GPT-4 and Claude, which often incorrectly count the number of "r"s as two. The error stems from the way these models process text through a process called tokenization, where text is broken down into smaller units called tokens. These tokens do not always correspond directly to individual letters, leading to errors in counting specific letters within words.

[–] [email protected] 3 points 19 hours ago

I can already see it...

Ad: CAN YOU SOLVE THIS IMPOSSIBLE RIDDLE THAT AI CAN'T SOLVE?!

With OP's image. And then it will have the following once you solve it: "congratz, send us your personal details and you'll be added to the hall of fame at CERN Headquarters"

[–] [email protected] 0 points 19 hours ago

From a linguistic perspective, this is why I am impressed by (or at least, astonished by) LLMs!

[–] [email protected] 33 points 19 hours ago (2 children)

I think I have seen this exact post word for word fifty times in the last year.

[–] [email protected] 16 points 19 hours ago (2 children)

Has the number of "r"s changed over that time?

[–] [email protected] 2 points 18 hours ago (4 children)
load more comments (4 replies)
load more comments (1 replies)
load more comments (1 replies)
[–] [email protected] 0 points 20 hours ago (3 children)

The terrifying thing is everyone criticising the LLM as being poor, however it excelled at the task.

The question asked was how many R in strawbery and it answered. 2.

It also detected the typo and offered the correct spelling.

What’s the issue I’m missing?

[–] [email protected] 3 points 19 hours ago (1 children)

There's also a "r" in the first half of the word, "straw", so it was completely skipping over that r and just focusing on the r's in the word "berry"

[–] [email protected] 1 points 18 hours ago

It wasn't focusing on anything. It was generating text per its training data. There's no logical thought process whatsoever.

[–] [email protected] 5 points 19 hours ago

Uh oh, you’ve blown your cover, robot sir.

[–] [email protected] 20 points 20 hours ago (1 children)

The issue that you are missing is that the AI answered that there is 1 'r' in 'strawbery' even though there are 2 'r's in the misspelled word. And the AI corrected the user with the correct spelling of the word 'strawberry' only to tell the user that there are 2 'r's in that word even though there are 3.

[–] [email protected] -1 points 19 hours ago (1 children)

Sure, but for what purpose would you ever ask about the total number of a specific letter in a word? This isn't the gotcha that so many think it is. The LLM answers like it does because it makes perfect sense for someone to ask if a word is spelled with a single or double "r".

[–] [email protected] 0 points 18 hours ago (1 children)

It makes perfect sense if you do mental acrobatics to explain why a wrong answer is actually correct.

load more comments (1 replies)
load more comments
view more: ‹ prev next ›