this post was submitted on 28 Feb 2024
341 points (98.0% liked)

Technology

58906 readers
3715 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
(page 2) 16 comments
sorted by: hot top controversial new old
[–] [email protected] 44 points 8 months ago (10 children)

I'm assuming this just relates to WordPress.com rather than the open-source WordPress.org but it's still a bummer. I've worked with the open source platform for over a dozen years and have started to kinda loathe what it's turned into but I'm not sure I'm yet at the point where I'm ready to migrate a bunch of sites to something else. This could be that push if they keep going down this road.

God, am I getting too old for this shit? I'm a pretty technical person but this AI nonsense is just relentless. I'm not philosophically against the idea of AI as like any tool it has the potential to better the world, but every tech company and their dog are going all in on using it for commercial bullshit that seems to provide very little value to society. Even fucking Mozilla is going in that direction.

load more comments (10 replies)
[–] [email protected] 66 points 8 months ago (7 children)

I work in marketing, and every client I work with who has a WordPress website is using AI to write a lot of their content. This is going to lead to circularly trained AI for sure.

[–] [email protected] 23 points 8 months ago (3 children)

Are you sure your clients aren't AI also

load more comments (3 replies)
load more comments (6 replies)
[–] [email protected] 12 points 8 months ago

Well, time to delete my Wordpress account then. Gonna be a lot of content I gotta archive before then. ;-;

[–] [email protected] 6 points 8 months ago* (last edited 8 months ago)

I wish I had content and data to sell :(

Oh, wait, I do. But companies are already selling it :(

[–] [email protected] 40 points 8 months ago (3 children)

Bro...tumblr is full of some WEIRD FUCKIN SHIT YO

[–] [email protected] 21 points 8 months ago (1 children)

Hey now, Don’t kink shame the weirdos

[–] [email protected] 20 points 8 months ago (1 children)

I know because I was one of those weirdos lol

[–] [email protected] 5 points 8 months ago (1 children)

Got 'em.

Sad they're doing this with Tumblr though. It was fun but I just deleted my 10+ year old account.

[–] [email protected] 3 points 8 months ago (1 children)

haha its been about that long since I even logged into mine.

load more comments (1 replies)
load more comments (2 replies)
[–] [email protected] 25 points 8 months ago (1 children)

Shit like this should be opt in by default. But no. Instead of respecting the users they count on ignorance, forgetfulness, and obfuscation for this kind of fuckery.

load more comments (1 replies)
[–] [email protected] -1 points 8 months ago
[–] [email protected] 18 points 8 months ago (1 children)

Not only am I really glad to not be on tumblr, but this further shows I shouldn't use wordpress for my website even though there is an opensource version

load more comments (1 replies)
[–] [email protected] 81 points 8 months ago (1 children)

Can we get a list of companies NOT doing this? I'd assume it's going to be much shorter.

[–] [email protected] 41 points 8 months ago (2 children)

All these AI and machine learning companies are taking content directly from websites and ignoring robot.txt files.

If your content is able to be crawled, even without being listed on search engines, I don't think it really matters.

load more comments (2 replies)
[–] [email protected] 8 points 8 months ago

This is the best summary I could come up with:


To complicate matters even further, advertising content that isn’t even owned by Automattic, including ads from an old Apple Music campaign, has also reportedly made its way into the training data set.

The plans at Automattic have been so controversial internally, that a product manager has even started pulling his own photos off Tumblr to make sure they’re not used to train AI, according to 404.

Generative AI has become a big business ever since OpenAI first launched ChatGPT in late 2022 and text-prompt image creators soon followed from a number of companies.

But major publishers have complained, with some even filing lawsuits, alleging that much of the data used to train these systems was either pirated or doesn’t constitute “fair use” under existing copyright regimes.

In response to emailed questions on Tuesday, Automattic directed Gizmodo to a new post that more or less confirmed 404 Media’s reporting, while trying to sell the move to consumers as an opportunity to “give you more control over the content you’ve created.”

We also plan to take that a step further and regularly update any partners about people who newly opt-out and ask that their content be removed from past sources and future training.”


The original article contains 536 words, the summary contains 201 words. Saved 62%. I'm a bot and I'm open source!

load more comments
view more: ‹ prev next ›