this post was submitted on 17 Feb 2024
1058 points (98.8% liked)

Technology

69494 readers
4313 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
(page 3) 50 comments
sorted by: hot top controversial new old
[–] [email protected] 113 points 1 year ago (3 children)

This is what the 3rd party access to API was really all about.

When API access was allowed , all reddit content was effectively free: They needed to ban 3rd party apps so they could sell the accumulated content. I expect using content to train AI also factors into it.

load more comments (3 replies)
[–] [email protected] 58 points 1 year ago (1 children)

Considering how much of Reddit is already bots, I'm sure this will end fantastically.

load more comments (1 replies)
[–] [email protected] 23 points 1 year ago

I wish there was a license for content like the GPL, that states if you use this content to train generative AI, the model must be open source. Not sure that would legally be enforceable though (due to fair-use).

[–] [email protected] 26 points 1 year ago (5 children)

Good thing I scrubbed all of my posts and comments that I could. Fuck that site, straight up and down.

[–] [email protected] 21 points 1 year ago (3 children)

You really think they don't have your original comments stored?

[–] [email protected] 28 points 1 year ago (3 children)

It's literally been proven that they do. A guy here on Lemmy was a very common poster on some tech support subreddit. He used one of those account scrubbers and deleted his account. He went back to look a few weeks later and all his comments were back.

load more comments (3 replies)
load more comments (2 replies)
load more comments (4 replies)
[–] [email protected] 27 points 1 year ago (4 children)

Damn it. I haven't deleted my account due to how many people I've supported and helped, I stopped using it while ago. It seems I'll have to.

[–] [email protected] 18 points 1 year ago (1 children)

I wouldn't bother. They'll just mark all your stuff DELETED=1 and feed it to their AI anyway.

load more comments (1 replies)
load more comments (3 replies)
[–] [email protected] 24 points 1 year ago

In before poisoning your comments on Reddit turns into the new protest.

[–] [email protected] 12 points 1 year ago (3 children)

Good thing I had multiple bots overwrite my content before I deleted it all. Not that someone couldn't recover it, I'm not naive. But the AI bots should miss me.

[–] [email protected] 5 points 1 year ago (1 children)

Any suggestion on the best way to do that?

[–] [email protected] 3 points 1 year ago (1 children)

There is a Plugin RES (Reddit Enhancement Suite) for Firefox, which could be run on the classic frontend of Reddit to delete everything you posted. https://www.alphr.com/how-to-delete-all-reddit-posts/

load more comments (1 replies)
load more comments (2 replies)
[–] [email protected] 9 points 1 year ago

"Its content", sure.

[–] [email protected] 10 points 1 year ago* (last edited 1 year ago) (5 children)

I feel like AI companies have been scraping Reddit for their datasets already since the beginning and without permission. In fact, unless there's been a regulation change that i'm not aware of, i'm not sure why they would have Reddit "sign away" the data when they can just scrape it.

Also dubious if the current form of AI has a future. They seem like they should revolutionize every sector when you look at their capacities, but in practice their applications might be more limited than we thought?

Anyway, if Reddit does go public i will be deleting my account within the hour. The only reason i haven't yet is that i've been a moderator of the same subreddit for eight years and it's the only thing that's been consistent in my life in that time, i'm kind of attached. The reason i will is i didn't sign up to create value for shareholders, i signed up to create value for a community.

[–] [email protected] 7 points 1 year ago (1 children)

You need to go ahead and delete your account and give up the ghost on modding whatever sub you are referring to. I’m tired of these types of posts where you are both beholden to Reddit and also not. Pick a dang side.

load more comments (1 replies)
[–] [email protected] 3 points 1 year ago* (last edited 1 month ago)

deleted by creator

load more comments (3 replies)
[–] [email protected] 12 points 1 year ago* (last edited 1 year ago)

I am not sure on what I'm going to say, but I think that LLMs are a technological dead end. They might get some use now, but eventually the industry will shift towards better models for machine text generation. And, if those models rely on a tiny corpus of hand-reviewed data, instead of shoving down as much text as possible into the model (the first "L" in "LLM" is "large"), then Reddit posts/comments will become outright useless.

In other words: Reddit is degrading further the trust of its userbase, and it might not even get much in return.

[–] [email protected] 82 points 1 year ago (3 children)

Considering some of the very wrong and upvoted domain specific knowledge I've seen on Reddit over the years I'm not sure the training data is going to be useful for much beyond what every other model can do.

[–] [email protected] 3 points 1 year ago (1 children)

I can only assume they are training some specific model for something appearing more human like.

As useless as that will be considering how fucking wildly different we type

load more comments (1 replies)
[–] [email protected] 16 points 1 year ago (1 children)

lol subreddits with troll names like trees vs marijuana enthusiasts. Good fun. John cena has one also but can’t recall which subreddit is actually about John cena though.

[–] [email protected] 14 points 1 year ago

Potato salad

[–] [email protected] 46 points 1 year ago (2 children)

The legal advice in /r/legaladvice was some of the worst garbage I've ever seen. I have zero doubt numerous had bad outcomes, at best wasting money and time, at worst spending years in jail because of things that sub told them to say and do. Zero doubt.

[–] [email protected] 7 points 1 year ago (1 children)

But almost every answer is the same. "You need to speak to an attorney".

load more comments (1 replies)
[–] [email protected] 20 points 1 year ago

That sub was mostly cops just repeating their own bad interpretation of the law. Terrible.

[–] [email protected] 7 points 1 year ago

Is this why the privacy policy was updated?

load more comments
view more: ‹ prev next ›