OTOH people are better at filtering out, or at least recognising gibberish than LLMs. At least for now.
You are right about the fediverse being used for training content though.
I'm curious about the levels of bot posting compared to xitter etc. A low rate here would make it even more attractive to prevent model collapse.
My point is that if we turn up our gibberish dial now then at least our llms will be learning the wrong thing & we have some control.
There is still a lot of understanding that we do automatically that an llm will never do. I still 4eckon I can spot gibberish better than an llm & I would like to keep it that way
Or we just give up. As you can see I have mostly given up.