this post was submitted on 19 Feb 2024
517 points (98.9% liked)

Technology

58739 readers
4176 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Reddit user content being sold to AI company in $60M/year deal::It’s being reported that a deal has been struck to allow an unnamed large AI company to use Reddit user...

top 50 comments
sorted by: hot top controversial new old
[–] [email protected] 6 points 8 months ago

This is going to backfire when the content they are selling is used by AI to make bots to make the content that gets sold to make the AI to make bots to make the content.

[–] [email protected] -4 points 8 months ago* (last edited 8 months ago) (1 children)

$60 Million or $60,000? Sometimes people use MM for Million and M for 'Mille' aka thousand. Other times people use M for Million and k for Thousand. Not a great article if they can't clarify that.

[–] [email protected] 4 points 8 months ago

I've never seen Mille used in reference to money. Only in advertising (eg CPM = cost per mille = cost per thousand ad impressions)

But to answer your question, the original Bloomberg article says 60 million.

[–] [email protected] 10 points 8 months ago (1 children)

Shower thought: what if a large number of people made lots of posts and comments on reddit using only AI generated content?

[–] [email protected] 13 points 8 months ago* (last edited 8 months ago) (1 children)

Considering the spam problem, in a way, it sort of is already happening.

It's possible that par tof the API changes might have been to curb off that kind of behaviour before people decided to go and do just that too, or stop them using bots to wipe their profiles out.

[–] [email protected] 4 points 8 months ago* (last edited 8 months ago)

Honestly, you just need to convince people to go through their comments and break any chains with nonsense. I bet that they are training conversational abilities (I mean what other good is the data set, it's not like redditors are experts, or when there is that the experts get upvoted at all.)

[–] [email protected] 7 points 8 months ago* (last edited 8 months ago)

The annoying part is that the only use of "AI" I have so far, is "translating reddit post titles to understandable English". Once they train their "AI" on whatever is there, I probably won't be able to understand the "translation" anymore... Sucks. 😬

[–] [email protected] 34 points 8 months ago (1 children)

Won't be long long before reddit is selling 90% AI generated content passing for human generated content!

[–] [email protected] 6 points 8 months ago

Feels like they're already there.

[–] [email protected] 58 points 8 months ago (3 children)

Putting aside pretty much everything else about this announcement: That’s… shockingly cheap.

[–] [email protected] 23 points 8 months ago

Probably because it was harvested long before they locked API. I suspect it's not a purchase but a way to legitimize the datasets already in the works since Reddit said they are now trading them. And our favorite CEO struggles to turn any profits, so he hardly had any leverage to ask for more.

[–] [email protected] 9 points 8 months ago (2 children)

1m for every IQ point of the average Reddit user

[–] [email protected] 18 points 8 months ago (2 children)

lol dude most of us were over there for years before jumping ship and coming here

Wait

Fuck

[–] [email protected] 1 points 8 months ago (1 children)

Before I deleted my account I removed all posts and changed all my comments to a complaint about the enshittification under way. Two accounts, 13 and 11 years old, it was a lot of work.

[–] [email protected] 1 points 8 months ago

This is awesome. Like chucking some frozen prawns under the floor boards when your landlord kicks you out.

[–] [email protected] 14 points 8 months ago

Shhh, let's just pretend the average IQ over there dropped when we left.

[–] [email protected] 12 points 8 months ago* (last edited 8 months ago)

It's mostly data that's publically available. It's more of a gamble I think, it's only worth anything if the government decides you need to pay for the data you use in training.

[–] [email protected] 33 points 8 months ago* (last edited 8 months ago) (2 children)

Those AI companies should love fediverse then. I mean, all data here is basically open for anyone to grab. Heck, they don't even need to grab the data, just run their own instance and the federation data will flood in on its own.

[–] [email protected] 1 points 8 months ago (1 children)

This was my thought exactly. Shouldn’t there be a “no_ai.txt” on the servers somehow?

[–] [email protected] 4 points 8 months ago* (last edited 8 months ago)

That would be about as effective as robots.txt, unfortunately.

[–] [email protected] 10 points 8 months ago

Oh, don’t give them ideas please!

load more comments
view more: next ›