this post was submitted on 19 Aug 2024

530 points (98.4% liked)

Technology

70248 readers

3576 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws

530

Google is no longer asking — feed the AI or you’re not in search results (pivot-to-ai.com)

submitted 9 months ago by homesweethomeMrL@lemmy.world to c/technology@lemmy.world

100 comments fedilink hide all child comments

cross-posted from: https://awful.systems/post/2178630

(page 2) 50 comments

sorted by: hot top controversial new old

[–] daniskarma@lemmy.dbzer0.com 20 points 9 months ago (2 children)

We do really need to figure how to make some kind of decentralized search engine.

[–] douglasg14b@lemmy.world 9 points 9 months ago* (last edited 9 months ago)

I hope it happens one day, but that's an almost insurmountable task given the scale.

Take the entirety of the fediverse, and it's entire history, and you're probably talking a days worth of search engine indexing compute & storage.

The scale is large and the fediverse is incredibly small. Keeping my fingers crossed, but definitely not holding my breath.

In the meantime, I'll use Kagi.

[–] ColinHayhurst@lemmy.world 15 points 9 months ago

Some discussion on that here: https://lemmy.world/comment/11859761

[–] Snowcano@startrek.website 22 points 9 months ago (1 children)

So how do I actually opt out? My website is just some personal hobby stuff on wordpress that only friends and family look at, I don’t need seo.

[–] ColinHayhurst@lemmy.world 30 points 9 months ago* (last edited 9 months ago) (3 children)

You should put these entries into your robots.txt file.

To block the Google search crawler use for all of your site:

User-agent: Googlebot

Disallow: /

To block the Google AI crawler use:

User-agent: Google-Advanced

Disallow: /

load more comments (3 replies)

[–] Subverb@lemmy.world 58 points 9 months ago (1 children)

We're at a point where not only should the Internet be classified as a utility, so should Search.

[–] Gsus4@mander.xyz 26 points 9 months ago* (last edited 9 months ago)

Yeah, it's not just e.g. water that is the utility, pipes and pumping stations are part of it. Otherwise you have water...uh...somewhere, go get it yourself.

[–] charade_you_are@sh.itjust.works -2 points 9 months ago

I'm not sure of the advantages of showing up in Google search results. It seems like something that I wouldn't want to happen anyway.

[–] RememberTheApollo_@lemmy.world 4 points 9 months ago

Google: "Making AI helpful for everyone..." (..mostly us!)

[–] Damage@feddit.it 3 points 9 months ago (1 children)

Bing to finally overtake Google? Inconceivable!

[–] SkyNTP@lemmy.ml 47 points 9 months ago

Oh look, more anticompetitive shenanigans.

Break Google up. Bring the full force of antitrust down on them.

Anything else is an unmitigated disaster waiting to happen.

[–] Kethal@lemmy.world 5 points 9 months ago* (last edited 9 months ago)

Google is genuinely bad now. I switched to Ecosia which is just Bing with a simpler front end and they use their profits to plant trees. I don't think Ecosia is particularly special though. Duck Duck Go, Bing whatever, they're all better than Google.

Whenever I set up a new computer then search for something, I'm always surprised at first seeing the awful layout and quality of the search results before I realize that I haven't changed the default search from Google. It's awful now. Seriously, how are people using it?

My new favorite way to search is perplexity.ai. It's an AI search tool that summarizes the loads of crap out there so you don't need to read through the junk that people write. It provides sources, unlike using ChatGPT, which is incredibly valuable. All AIs make shit up, so having links to double check it is a must. Unlike Bing Chat, or whatever Microsoft calls it this week, you can ask follow up questions to home in on what you want.

[–] TrumpetX@programming.dev 21 points 9 months ago (4 children)

I've been really happy with Kagi since switching.

[–] daniskarma@lemmy.dbzer0.com 2 points 9 months ago (1 children)

Correct me if I'm wrong but doesn't Kagi still uses google/bing results?

load more comments (1 replies)

[–] Tywele@lemmy.dbzer0.com 2 points 9 months ago (1 children)

Same. It's amazing. I really like the feature where you can prioritise or deprioritise search results from certain websites.

load more comments (1 replies)

[–] ZephyrXero@lemmy.world 2 points 9 months ago

It's definitely better...but. Thanks to Google SEO the internet it's bringing you results from is still filled with shit

[–] WhatsHerBucket@lemmy.world 8 points 9 months ago

Same! I swore I wouldn’t pay for a search engine, but I feel like it’s absolutely worth it, considering the current state of things.

[+] parpol@programming.dev 247 points 9 months ago (5 children)

[deleted]

[–] UprisingVoltage@feddit.it 14 points 9 months ago

Unfortunately, the vast majority of people do not give a single fuck and they will use whatever is preinstalled on their device

[–] Lost_My_Mind@lemmy.world 41 points 9 months ago (2 children)

Part 5 is where I don't see this actually going.

Look at twitter. Now look at mastodon. Tell me which one is more shitty. Now tell me which one has something like 85% of the market, and which one most people haven't heard of.

Just because something it better, doesn't mean people use it. You can fit all of Lemmy in the world in one of the larger NBA size arenas. You can't even fit twitters total user base into some smaller CITIES.

[–] phoenixz@lemmy.ca 1 points 9 months ago (2 children)

Twitter will be dead within 1-2 years, Elon will make sure of that

[–] Lost_My_Mind@lemmy.world 8 points 9 months ago (2 children)

God I hope you're right, but doubt you are.

[–] Womble@lemmy.world 5 points 9 months ago (1 children)

At least in the UK there is now lots of mainstream discussion of "Is it time for people to leave twitter as it's too much of a cesspool". Granted they usually then mention threads or bluesky as an alternative not mastodon, but it is definitely possible for social media companies to die out. Once people start to leave in large numbers it can become a mass exodus (see digg and myspace).

load more comments (1 replies)

[–] hobovision@lemm.ee 14 points 9 months ago (1 children)

He's already owned it for nearly two years. I'd definitely take the over on that bet. I just don't see what Twitter could possibly do that they haven't done already to kill it?

load more comments (1 replies)

[–] Zurgo@lemmy.dbzer0.com 25 points 9 months ago

I'm pretty pessimistic about this:

Say no
Google still scrapes your site to train their AI
People don't care that its wrong, still use Google instead of other search engines

[–] MagicShel@programming.dev 106 points 9 months ago (2 children)

Google results are actually already pretty terrible. They just have tremendous inertia.

[–] doctortran@lemm.ee 7 points 9 months ago (1 children)

We all keep saying this but can anybody point me to which one is better?

I invariably end up having to go back to them because the other search engines all have their own problems.

The issue is the internet is polluted with SEO and all the useful things that used to be spread out are now condensed onto places like Reddit, or places that aren't even being indexed.

[–] MagicShel@programming.dev 9 points 9 months ago (2 children)

Supposedly there's a paid one that is good. I haven't tried. The thing is Google is completely enshittified. They don't have to care about you or the sites you search. So my theory is Bing is better because they are hungrier and anything that takes away market share from Google is good—but I'm fully aware that Microsoft was just as shitty as Google and will be again if they get back on top.

Everything else I know of is either just an alternate front end for one of them or an aggregator of both. So you're right, there's precious little alternative to Google. But it's almost bad enough I'm ready for the return of web rings of good sites vouching for each other.

[–] Sparky@lemmy.blahaj.zone 4 points 9 months ago (3 children)

I assume you're talking about kagi. I pay for their $5/month subscription and it's great

load more comments (3 replies)

[–] bitwaba@lemmy.world 0 points 9 months ago (4 children)

my theory is Bing is better because they are hungrier and anything that takes away market share from Google is good

If you think Microsoft is in the business of innovation and healthy competition, you're wrong.

https://www.wired.com/2011/02/bing-copies-google

load more comments (3 replies)

[–] APassenger@lemmy.world 23 points 9 months ago (1 children)

I stopped using them months ago. I only notice when I'm looking for places (e.g., restaurants, barbers).

I'm not unhappy but may still shop around.

[–] pixel@pawb.social 6 points 9 months ago

yeah, I appreciate the push towards more privacy-centric search engines but as a result searches that are relevant to me geographically on places like startpage are next to useless. I understand why but I wish that local results were a bit better on the alternatives.

load more comments (1 replies)

[–] MadMadBunny@lemmy.ca 2 points 9 months ago

This makes me mad…

[–] woelkchen@lemmy.world 0 points 9 months ago (2 children)

As I understand it, this is only about using search results for summaries. If it's just that and links to the source, I think it's OK. What would be absolutely unacceptable is to use the web in general as training data for text and image generation (=write me a story about topic XY).

[–] elrik@lemmy.world 14 points 9 months ago (1 children)

If it's just that and links to the source, I think it's OK.

No one will click on the source, which means the only visitor to your site is Googlebot.

What would be absolutely unacceptable is to use the web in general as training data for text and image generation.

This has already happened and continues to happen.

[–] woelkchen@lemmy.world 7 points 9 months ago (2 children)

No one will click on the source, which means the only visitor to your site is Googlebot.

That was the argument with the text snippets from news sources. Publishers successfully lobbied for laws to be passed in many countries that required search engine operators to pay fees. It backfired when Google removed the snippets from news sources that demanded fees from Google. Their visitors dropped by a massive amount, 90% or so, because those results were less attractive to Google users to click on than the nicer results with a snippet and a thumbnail. So "No one will click on the source" has already been disproven 10 or so years ago when the snippet issue was current. All those publishers have entered a free of charge licensing agreement with Google and the laws are still in place. So Google is fine, upstart search engines are not because those cannot pressure the publishers into free deals.

This has already happened and continues to happen.

With Gemini?

[–] elrik@lemmy.world 2 points 9 months ago (1 children)

The context is not the same. A snippet is incomplete and often lacking important details. It's minimally tailored to your query unlike a response generated by an LLM. The obvious extension to this is conversational search, where clarification and additional detail still doesn't require you to click on any sources; you simply ask follow up questions.

With Gemini?

Yes. How do you think the Gemini model understands language in the first place?

[–] woelkchen@lemmy.world 0 points 9 months ago

The context is not the same.

It's not the same but it's similar enough when, as the article states, it is solely about short summaries. The article may be wrong, Google may be outright lying, maybe, maybe, maybe.

Google, as by far the web's largest ad provider, has a business incentive to direct users towards the web sites, so the website operators have to pay Google money. Maybe I'm missing something but I just don't see the business sense in Google not doing that and so far I don't see anything approximating convincing arguments.

Yes. How do you think the Gemini model understands language in the first place?

Licensed and public domain content, of which there is plenty, maybe even content specifically created by Google to train the data. "the Gemini model understands language" in itself hardly is proof of any wrongdoing. I don't claim to have perfect knowledge or memory, so it's certainly possible that I missed more specific evidence but "the Gemini model understands language" by itself definitively is not.

[–] melroy@kbin.melroy.org 3 points 9 months ago (1 children)

that latter will be the case rather sooner than later I'm afraid. It's just a matter of time with Google.

[–] woelkchen@lemmy.world 1 points 9 months ago (1 children)

that latter will be the case rather sooner than later I’m afraid. It’s just a matter of time with Google.

If that will actually be the case and passes legal challenges, basically all copyright can be abolished which would definitively have some upsides but also downsides. All those video game ROM decompilation projects would be suddenly in the clear, as those are new source code computer-generated from copyrighted binary code, so not really different from a AI generated image based on a copyrighted image used as training data. We could also ask Gemini write a full-length retelling of Harry Potter and just search, replace all trademarked names, and sell that shit. Evil companies could train an AI on GNU/Linux source codes and tell it to write an operating system. Clearly derived work from GPL code but without any copyright to speak of, all that generated code could be legally closed. I don't like that.

load more comments (1 replies)

load more comments