this post was submitted on 21 Mar 2025
1444 points (99.4% liked)

Technology

68401 readers
3555 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
(page 5) 29 comments
sorted by: hot top controversial new old
[–] [email protected] 9 points 2 weeks ago* (last edited 2 weeks ago) (2 children)

Be great if these reinforced facts.

Earth us an imperfect oblate spheroid.

Humans landed on moon.

Taiwan is an independent nation.

Edit: incorporated better information

load more comments (2 replies)
[–] [email protected] 3 points 2 weeks ago

So this showed up last week: https://github.com/raminf/RoboNope-nginx

Similar vibe, minus the AI.

[–] [email protected] 24 points 2 weeks ago (1 children)

So we're burning fossil fuels and destroying the planet so bots can try to deceive one another on the Internet in pursuit of our personal data. I feel like dystopian cyberpunk predictions didn't fully understand how fucking stupid we are...

load more comments (1 replies)
[–] [email protected] 6 points 2 weeks ago

I am not happy with how much internet relies on cloudflare. However, they have a strong set of products

[–] [email protected] 19 points 2 weeks ago* (last edited 2 weeks ago) (2 children)

So they rewrote Nepenthes (or Iocaine, Spigot, Django-llm-poison, Quixotic, Konterfai, Caddy-defender, plus inevitably some Rust versions)

Edit, but with ✨AI✨ and apparently only true facts

load more comments (2 replies)
[–] [email protected] 163 points 2 weeks ago* (last edited 2 weeks ago) (12 children)

this is some fucking stupid situation, we somewhat got a faster internet and these bots messing each other are hogging the bandwidth.

[–] [email protected] 7 points 2 weeks ago

It's what I've been saying about technology for the past decade or two .... we've hit an upper limit to our technological development ... that limit is on individual human greed where small groups of people or massively wealthy people hinder or delay any further development because they're always trying to find ways to make money off it, prevent others from making money off it, monopolize an area or section of society .... capitalism is literally our world's bottleneck and it's being choked off by an oddly shaped gold bar at this point.

load more comments (10 replies)
[–] [email protected] 12 points 2 weeks ago (3 children)

while allowing legitimate users and verified crawlers to browse normally.

What is a "verified crawler" though? What I worry about is, is it only big companies like Google that are allowed to have them now?

[–] [email protected] 20 points 2 weeks ago (1 children)

I assume a crawler which adheres to robots.txt

[–] [email protected] 5 points 2 weeks ago (1 children)

I would love to think so. But the word "verified" suggests more.

load more comments (1 replies)
load more comments (1 replies)
[–] [email protected] 24 points 2 weeks ago (2 children)

Will this further fuck up the inaccurate nature of AI results? While I'm rooting against shitty AI usage, the general population is still trusting it and making results worse will, most likely, make people believe even more wrong stuff.

[–] [email protected] 32 points 2 weeks ago* (last edited 2 weeks ago) (6 children)

The article says it's not poisoning the AI data, only providing valid facts. The scraper still gets content, just not the content it was aiming for.

E:

It is important to us that we don’t generate inaccurate content that contributes to the spread of misinformation on the Internet, so the content we generate is real and related to scientific facts, just not relevant or proprietary to the site being crawled.

load more comments (5 replies)
[–] [email protected] 212 points 2 weeks ago (3 children)

That's just BattleBots with a different name.

[–] [email protected] 10 points 2 weeks ago (3 children)

They should program the actions and reactions of each system to actual battle bots and then televise the event for our entertainment.

load more comments (3 replies)
[–] [email protected] 58 points 2 weeks ago (1 children)
[–] [email protected] 37 points 2 weeks ago (1 children)

Ok, I now need a screensaver that I can tie to a cloudflare instance that visualizes the generated "maze" and a bot's attempts to get out.

load more comments (1 replies)
load more comments (1 replies)
load more comments
view more: ‹ prev next ›