103
NVIDIA: Copyrighted Books Are Just Statistical Correlations to Our AI Models * TorrentFreak
(torrentfreak.com)
1. Posts must be related to the discussion of digital piracy
2. Don't request invites, trade, sell, or self-promote
3. Don't request or link to specific pirated titles, including DMs
4. Don't submit low-quality posts, be entitled, or harass others
📜 c/Piracy Wiki (Community Edition):
💰 Please help cover server costs.
Ko-fi | Liberapay |
Aren’t MP3s just a statistical correlation?
Besides, you really don’t need to zoom in on “but muh license agreement” to roast these AI turds.
They’re very clear: We’re gonna put creatives out of work, we’re gonna sell a unified product to replace them, and we’re gonna use their own labor to build their replacements.
That’s anticompetitive.
Nail em on that instead of trying to thread the needle on reining in the tech lords without damaging e.g. linguistic analysis researchers.
The tools exist for creatives to use.
Yes, but: it's short sighted, and wrong. Until we have a sea change in the LLM/AGI space, "creatives" will be needed for seed data. LLMs that are recursively trained on their own output degrade and produce worse output over time.
The "yes" part is that companies looking to replace paying people for their work, but still hoping that Creative Commons types are still posting online for free harvesting.