this post was submitted on 27 Sep 2024
785 points (98.6% liked)

Technology

58689 readers
4032 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Anyone who has been surfing the web for a while is probably used to clicking through a CAPTCHA grid of street images, identifying everyday objects to prove that they're a human and not an automated bot. Now, though, new research claims that locally run bots using specially trained image-recognition models can match human-level performance in this style of CAPTCHA, achieving a 100 percent success rate despite being decidedly not human.

ETH Zurich PhD student Andreas Plesner and his colleagues' new research, available as a pre-print paper, focuses on Google's ReCAPTCHA v2, which challenges users to identify which street images in a grid contain items like bicycles, crosswalks, mountains, stairs, or traffic lights. Google began phasing that system out years ago in favor of an "invisible" reCAPTCHA v3 that analyzes user interactions rather than offering an explicit challenge.

Despite this, the older reCAPTCHA v2 is still used by millions of websites. And even sites that use the updated reCAPTCHA v3 will sometimes use reCAPTCHA v2 as a fallback when the updated system gives a user a low "human" confidence rating.

(page 3) 50 comments
sorted by: hot top controversial new old
[–] [email protected] 3 points 2 weeks ago (1 children)

Our long international nightmare is finally over!

load more comments (1 replies)
[–] [email protected] 18 points 2 weeks ago (4 children)

I can see a future where the Internet is completely run by bots and AI to the point where no human actually uses the Internet anymore.

It's like an island that gets overrun with rats - there are just too many to deal with so you leave.

[–] [email protected] 6 points 2 weeks ago (2 children)

I'm already doing that now. If Lemmy starts showing signs of fuckery I'm out. I'll switch back to magazines.

load more comments (2 replies)
load more comments (3 replies)
[–] [email protected] 17 points 2 weeks ago

Cool, so can Google shut it down now?

[–] [email protected] 29 points 2 weeks ago (3 children)

Buster is awesome to get past recaptcha. I use it with my own Speech to Text API key since its free from Google. Using Google to beat Google.

https://github.com/dessant/buster

[–] [email protected] 6 points 2 weeks ago

It's so funny that this exists. I'm going to check it out!!

load more comments (2 replies)
[–] [email protected] 77 points 2 weeks ago (4 children)

And yet I can't beat the CAPTCHAs because reCAPTCHA doesn't like VPNs lol

[–] [email protected] 26 points 2 weeks ago

Captcha these days isn't even really a CAPTCHA in the traditional sense since most of the work it does is based on filtering of IP and browser fingerprinting, with a certain level of gamification because the goal is not just to keep out the people they fight against, but to waste their time, would work great if it didn't waste normal people's time, while real bad actors have easy ways to get around it.

load more comments (3 replies)
[–] [email protected] 42 points 2 weeks ago (2 children)

Well yeah, I'd hope so, that's the entire point.

Catcha's data collection always was with the intent for training ai on these skills. That's "the point" of them.

It's reasonable to expect that the older version of captchas can now be beaten by modern ai, because they're often literally trained on that exact data to beat it.

Captcha effectively is free to use on websites as a tool because the data collection is the "payment", they then license that data out to people like OpenAI to train with for stuff like image recognition.

It's why ai is progressing so fast, captchas are one of humanity's long term collected data silos that are very full now.

We are going to have to keep progressing the complexity of catches as it will be the only way to catch modern AIs, and in turn it will collect more data to improve it.

[–] [email protected] 3 points 2 weeks ago* (last edited 2 weeks ago) (4 children)

We are going to have to keep progressing the complexity of catches as it will be the only way to catch modern AIs, and in turn it will collect more data to improve it.

I wanted to use 4chan alot before I came here, but FUCK that slider capcha. I bailed after the first time I didn't pass.

load more comments (4 replies)
[–] [email protected] 12 points 2 weeks ago

Yeah, my understanding is that these capchas were made to harvest data to use for AI/Autopilot driven cars. That's why they are always having you identify motorcycles, bycicles, crosswalks, stoplights, busses, etc. It's all stuff that automatic driving cars have had a hard time identifying.

[–] [email protected] 35 points 2 weeks ago (4 children)

When it's asking for motorcycles but it's clearly a scooter

[–] [email protected] 18 points 2 weeks ago (2 children)

That tip of a handle bar that makes you wonder if that square counts or not.

load more comments (2 replies)
[–] [email protected] 13 points 2 weeks ago (1 children)

I had one with one of those Motorcycles with the long handles, apparently they aren't part of the bike, but the dudes foot holding it up is.

load more comments (1 replies)
[–] [email protected] 26 points 2 weeks ago (2 children)

Or, like, "there's the bottom 10% of a traffic light in this one. Do I click that box? Ia that supposed to count?"

[–] [email protected] 8 points 2 weeks ago (14 children)

What they are doing is comparing your answer and seeing if it is consistent with how it has been answered previously. They realize that not everyone is going to give the exact same answer, so as long as you answer it in a way that enough other people have answered it, it should let you in.

I'll usually go with the minimum number of clicks that I think will get me through, since I'm lazy and it'll also at times slow down how fast you can click which is annoying.

I'll also answer them wrong if I think it's a mistake that enough other people will make. "Yes... that RV over there is a bus..."

load more comments (14 replies)
[–] [email protected] 6 points 2 weeks ago (1 children)

Does the backside of a traffic light even count? What about these strange traffic lights that have more boarder than light?

[–] [email protected] 7 points 2 weeks ago

How about "do they want just the bulbs or the pole holding it up?"

load more comments (1 replies)
[–] [email protected] 12 points 2 weeks ago (4 children)

I never get the first one and rarely the second one. If it says to click all the squares with motorcycles and it’s just the one big picture, am I supposed to click stuff like the tire and mirrors? I always do and never get it right. Then most of the time they ask me to identify motorcycles, they show me motor scooters and what am I supposed to do then? I think I just need to get one of these bots to do it for me.

[–] [email protected] 2 points 2 weeks ago (6 children)

A motor scooter is a motorcycle in the eyes of the law.

load more comments (6 replies)
[–] [email protected] 4 points 2 weeks ago (1 children)

Fwiw they aren't really asking about the motorcycle. I mean they are but they are washing your mouse movements and how fast you click through the images. It's okay to get a few images wrong.

[–] [email protected] 8 points 2 weeks ago

Not quite.

It's mostly wisdom of the crowd, as it always has been.

As long as you mostly click the same squares most other people click, you pass.

You often at random get 2-3 images because 2 of them are actual checks, but the third is a new image that you auto pass and they're using it to gather data on what the average clicks are on it.

load more comments (2 replies)
[–] [email protected] 18 points 2 weeks ago

My score is lower.

[–] [email protected] 2 points 2 weeks ago

Unless this was something people could use i dont rly see it becoming much of a problem. Most people dont even use adblockers

[–] [email protected] 1 points 2 weeks ago (1 children)

Ai is ruining the internet.

[–] [email protected] 6 points 2 weeks ago

Corporations have already done it years ago

[–] [email protected] 9 points 2 weeks ago* (last edited 2 weeks ago) (1 children)

Thank God this means i can stop wondering if i should click on the... the 13 pixels from the fucking bike in that one corner square or wondering if i should count the scooter as a motorcycle fuck i am so tired of that shit

[–] [email protected] 2 points 2 weeks ago

Complete the obligatory "is this a staircase or street crossing" round only to be roundhouse kicked back to the beginning.

[–] [email protected] 13 points 2 weeks ago (1 children)

I mean, we literally train them by completing the CAPTCHAs. Why do you think you were picking things like bikes, traffic lights, cars, and busses? The only question now is what's next...

[–] [email protected] 5 points 2 weeks ago (2 children)

they embed dark souls into the browser

[–] [email protected] 3 points 2 weeks ago

In order to pay your utility bill, you have to beat the Undertale Sans fight in Genocide mode

load more comments (1 replies)
[–] [email protected] 7 points 2 weeks ago (2 children)

Great, so now can I get an add-on to my browser that skips these?

[–] [email protected] 2 points 2 weeks ago (1 children)

In use an add-on that does 90% of these for me already on Firefox. I would tell you what it's called but I'm not at my PC.

Which (on a side note) I'd totally go downstairs and check for you, but I just sprained my ankle real bad, and am dreading stairs. Sorry :(

[–] [email protected] 3 points 2 weeks ago

Sorry to hear about your ankle. When you're able to, I'd also like to know what the add-on is

[–] [email protected] 7 points 2 weeks ago* (last edited 2 weeks ago)

As someone who can not decide if 3 pixels of a motorcycle counts as a correct square, I need this add on.

load more comments
view more: ‹ prev next ›