this post was submitted on 13 Jul 2024
364 points (100.0% liked)

196

16238 readers
1782 users here now

Be sure to follow the rule before you head out.

Rule: You must post before you leave.

^other^ ^rules^

founded 1 year ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[โ€“] [email protected] 2 points 2 months ago (1 children)

I mean, yes?

That's very pithy, but the material used as training data was probably produced by artists attempting to create art using tools (ai and otherwise), as well as more mundane data designed and produced by humans with no ai tools and some produced by humans with almost exclusively ai tools.

[โ€“] [email protected] 2 points 2 months ago

You probably live in a different world than I do.

Don't chicken/egg this. All of the training data was man-made at some point. Until the first LLMs started outputting based on it.

Secondly, the amount of human-produced content and LLM-produced content that's in the training data is incomparable. And will continue to be so. Otherwise the models break.