Technology

60058 readers

2807 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 2 years ago

MODERATORS

[email protected]

1449

Make illegally trained LLMs public domain as punishment (www.theregister.com)

submitted 1 day ago by [email protected] to c/[email protected]

181 comments fedilink hide all child comments

It's all made from our data, anyway, so it should be ours to use as we want

(page 3) 50 comments

sorted by: hot top controversial new old

[–] [email protected] 22 points 19 hours ago* (last edited 6 hours ago) (1 children)

"Given they were trained on our data, it makes sense that it should be public commons – that way we all benefit from the processing of our data"

I wonder how many people besides the author of this article are upset solely about the profit-from-copyright-infringement aspect of automated plagiarism and bullshit generation, and thus would be satisfied by the models being made more widely available.

The inherent plagiarism aspect of LLMs seems far more offensive to me than the copyright infringement, but both of those problems pale in comparison to the effects on humanity of masses of people relying on bullshit generators with outputs that are convincingly-plausible-yet-totally-wrong (and/or subtly wrong) far more often than anyone notices.

I liked the author's earlier very-unlikely-to-be-met-demand activism last year better:

I just sent @OpenAI a cease and desist demanding they delete their GPT 3.5 and GPT 4 models in their entirety and remove all of my personal data from their training data sets before re-training in order to prevent #ChatGPT telling people I am dead.

...which at least yielded the amusingly misleading headline OpenAI ordered to delete ChatGPT over false death claims (it's technically true - a court didn't order it, but a guy who goes by the name "That One Privacy Guy" while blogging on linkedin did).

load more comments (1 replies)

[–] [email protected] 53 points 19 hours ago (1 children)

It's not punishment, LLM do not belong to them, they belong to all of humanity. Tear down the enclosing fences.

This is our common heritage, not OpenAI's private property

load more comments (1 replies)

[–] [email protected] 0 points 20 hours ago

Yes!

[–] [email protected] 2 points 21 hours ago* (last edited 21 hours ago)

Only if they were trained on public material.

[–] [email protected] 5 points 21 hours ago (1 children)

Delete them. Wipe their databases. Make the companies start from scratch with new, ethically acquired training data.

[–] [email protected] 2 points 19 hours ago (3 children)

Mmm yes so all that electricity is pure waste

load more comments (3 replies)

[–] [email protected] 61 points 21 hours ago (3 children)

A similar argument can be made about nationalizing corporations which break various laws, betray public trust, etc etc.

I'm not commenting on the virtues of such an approach, but I think it is fair to say that it is unrealistic, especially for countries like the US which fetishize profit at any cost.

[–] [email protected] 8 points 19 hours ago

Yes, mining companies should all be nationalised for digging up the country's ground and putting carbon in the country's air.

load more comments (1 replies)

[–] [email protected] 1 points 22 hours ago

Doesn't seem like this helps out all the writers / artists that the LLM stole from.

load more comments