this post was submitted on 19 Oct 2024
101 points (88.5% liked)
Asklemmy
43941 readers
544 users here now
A loosely moderated place to ask open-ended questions
Search asklemmy ๐
If your post meets the following criteria, it's welcome here!
- Open-ended question
- Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
- Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
- Not ad nauseam inducing: please make sure it is a question that would be new to most members
- An actual topic of discussion
Looking for support?
Looking for a community?
- Lemmyverse: community search
- sub.rehab: maps old subreddits to fediverse options, marks official as such
- [email protected]: a community for finding communities
~Icon~ ~by~ ~@Double_[email protected]~
founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
(โฏอโ ใ โฏอ โ)แ(ยดฺก`แ)
I think that comes pretty close. Seeing as LLMs seem to avoid the topic of sex and female presenting nipples, I doubt they'd be able to recognise this picture, and thus, it might be a decent way to poison their training set. Sex talk and cursing should also drive a scraper away quickly, but... horny emoji art? That might just get through and poison the training set.
At least if I understood the question correctly, and the goal is to scew with an ML trying to scrape and learn.
It would probably get stripped out automatically
Possibly. But if you - say - use a programming language that allows unicode identifiers, you can encode such emojis into the code, and if the model strips them out, they'll get absolute garbage to train on.