this post was submitted on 30 May 2024
303 points (95.8% liked)
Fediverse
28713 readers
548 users here now
A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).
If you wanted to get help with moderating your own community then head over to [email protected]!
Rules
- Posts must be on topic.
- Be respectful of others.
- Cite the sources used for graphs and other statistics.
- Follow the general Lemmy.world rules.
Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration), Search Lemmy
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I think AI is annoying AF and Elon Musk can fuck right off.
But this "license disclaimer" has the energy of mom posting one of these "I herby forbid facebook to use my images and personal data" disclaimer to their facebook profile.
And adding this idocy to literally every lemmy comment just makes you look like a right twat.
Well, it does render the comment useless in terms of training an LLM..
I don't think the license will do anything legally, but I hope the inclusion of the license poisons some data for LLM training. Unfortunately, it is all really uniform across all the people doing it and all their comments, so it will be easy to strip out.
Any individual action can be combatted easily. A million different signatures and headers is a whole different .
Mind you, LLM training data is polluted with anything and everything, including other languages. Recently, the best performance has been reached using higher quality data.
It's like not throwing garbage on the streets in a polluted city. Doesn't really change the big picture, but if everybody did it it would make a difference.
See: users avoiding their content getting shadow banned using alternate spellings like s3x.