AI bots on Reddit reaching the front page? I absolutely believe it : reddit

this post was submitted on 18 Dec 2024

635 points (99.2% liked)

17802 readers

20 users here now

News and Discussions about Reddit

Welcome to !reddit. This is a community for all news and discussions about Reddit.

The rules for posting and commenting, besides the rules defined here for lemmy.world, are as follows:

Rules

Rule 1- No brigading.

**You may not encourage brigading any communities or subreddits in any way. **

YSKs are about self-improvement on how to do things.

Rule 2- No illegal or NSFW or gore content.

**No illegal or NSFW or gore content. **

Rule 3- Do not seek mental, medical and professional help here.

Do not seek mental, medical and professional help here. Breaking this rule will not get you or your post removed, but it will put you at risk, and possibly in danger.

Rule 4- No self promotion or upvote-farming of any kind.

That's it.

Rule 5- No baiting or sealioning or promoting an agenda.

Posts and comments which, instead of being of an innocuous nature, are specifically intended (based on reports and in the opinion of our crack moderation team) to bait users into ideological wars on charged political topics will be removed and the authors warned - or banned - depending on severity.

Rule 6- Regarding META posts.

Provided it is about the community itself, you may post non-Reddit posts using the [META] tag on your post title.

Rule 7- You can't harass or disturb other members.

If you vocally harass or discriminate against any individual member, you will be removed.

Likewise, if you are a member, sympathiser or a resemblant of a movement that is known to largely hate, mock, discriminate against, and/or want to take lives of a group of people, and you were provably vocal about your hate, then you will be banned on sight.

Rule 8- All comments should try to stay relevant to their parent content.

Rule 9- Reposts from other platforms are not allowed.

Let everyone have their own content.

:::spoiler Rule 10- Majority of bots aren't allowed to participate here.

founded 2 years ago

MODERATORS

[email protected]

635

AI bots on Reddit reaching the front page? I absolutely believe it (slrpnk.net)

submitted 4 days ago by [email protected] to c/[email protected]

92 comments fedilink hide all child comments

Will Manidis is the CEO of AI-driven healthcare startup ScienceIO

(page 2) 43 comments

sorted by: hot top controversial new old

[–] [email protected] 2 points 4 days ago* (last edited 4 days ago)

Maybe they're using the subreddit to try to train morality into the model?

[–] [email protected] 20 points 4 days ago

Most people who have worked in customer service would believe every word because they have seen the absurdity of real people.

[–] [email protected] 17 points 4 days ago (2 children)

It’s stupidly easy to make up stuff on AITA and get upvotes/comments. I made up one just for fun and was surprised at how popular it got. Well, now not so much, but back when I did.

If you know the audience and what gets them upset, you’ve got easy karma farming.

[–] [email protected] 1 points 4 days ago

Two weeks ago someone on one of those story subs, I think it was amioverreacting, was milking off karma making updates. They made 5 posts about the whole thing and even started to sell merch to profit in real life until they took the last post down.

[–] [email protected] 7 points 4 days ago (2 children)

It's like reality TV & soap operas in text form. You can somewhat easily spot the AI posts though, which are plentiful now. They all tend to have the same "professional" writing style and a high tendency to add mid sentence "quotes" and em dashes (—) which you need a numpad combo to actually write out manually - a casual write-up would just use the - symbols, if at all. LLMs also make a lot of logic errors that may pop up. Example from one of the currently highly upvoted posts:

He pulled out what looked like a box from a special jewelry store. My heart raced with excitement as I assumed it was a lovely bracelet or a special memento for our wedding day. But when he opened the box, I was absolutely stunned. Inside was a key to a house he supposedly bought for us. I was taken aback because I had no idea he was even looking for real estate. My first reaction was one of shock and confusion, as I thought it was a huge decision that we should have discussed together.

As I processed the moment, I realized the house wasn’t just any house—it was a fixer-upper on the outskirts of town. Now, I get that it can be a great investment, but this particular house needed a ton of work. I’m talking major renovations and repairs, and I honestly had no desire to live there.

Aside from the weird writing (Oh jolly! Expensive gifts! How exciting!), this lady somehow realized & identified this house, location and its state just by looking at some random key in that moment. Bonus frustration if you read through the comments who eat all of this shit up, assuming they aren't also bots.

[–] [email protected] 1 points 4 days ago (1 children)

Now that you mention it, I might be the only non-AI using em dashes on the internet (I have a program that joins two hyphens into an em dash).

[–] [email protected] 1 points 4 days ago

Apparently Lemmy, and Reddit (I can't test either one), actually render it that way too. Not sure how many people know about that though.

[–] [email protected] 5 points 4 days ago (3 children)

Interesting observation about the em dash. I never thought about it that hard, but reddit's text editor (as well as Lemmy's, at least on the default UI) automatically concatenate a double dash into an en dash, rather than an em dash.

I use em dashes (well, en dashes, as above) in my writing all the time, because I am a nerd.

For anyone who cares, an en dash is the same width as an N in typical typography, and looks like this: --

An em dash is, to no one's surprise, the same with as an M. It looks like this: —

(For what it's worth, Lemmy does not concatenate a triple dash into an em dash. It turns it into a horizontal rule instead.)

[–] [email protected] 3 points 4 days ago

I use the poor man's emdash (two hyphens in a row) here and there as well. I guess I never noticed Reddit auto-formats them. I have been accused of being an AI on a few occasions. I guess this is a contributing factor to why that is.

Funny how Reddit technically formats it into the wrong glyph, though. Not like anyone but the most insufferable of pedants would notice and care, of course. I find it merely mildly amusing.

[–] [email protected] 5 points 4 days ago (1 children)

Does not look like your Lemmy editor is converting them, but Lemmy's frontend itself, because on mbin I just see two regular dashes. Kind of a weird implementation.

[–] [email protected] 4 points 4 days ago (1 children)

That's probably because the posts are stored as plain text, and any markdown within them is just rendered at display time. This is presumably also how you can view any post or comment's original source. So, here you go:

Double --

En – (alt 0150)

Em — (alt 0151)

And for good measure, a triple:

Actually, I notice if you include a triple that's not on a line by itself it does render it as an em dash rather than en, like so: ---

[–] [email protected] 2 points 4 days ago (1 children)

You're right, it means you don't have to save two versions or somehow convert it back into a source format instead. The triple renders as a line below on mbin. I don't remember what they're called.

[–] [email protected] 1 points 4 days ago

That's a horizontal rule.

Refreshingly, the frontend just converts it to a plain old HTML tag and doesn't try to reinvent the wheel, either.

load more comments (1 replies)

[–] [email protected] 44 points 4 days ago (3 children)

Dead internet theory

[–] [email protected] 3 points 4 days ago (1 children)

At least I know to blame Claude.

[–] [email protected] 2 points 4 days ago

Claude classique

load more comments (2 replies)

[–] [email protected] 131 points 4 days ago (9 children)

(Already said this before, but let me reiterate:)

Typical AITA post:

Title: AITAH for calling out my [Friend/Husband/Wife/Mom/Dad/Son/Daughter/X-In-Law] after [He/She] did [Undeniably something outrageous that anyone with an IQ above 80 should know its unacceptable to do]?

Body of post:

[5-15 paragraph infodumping that no sane person would read]

I told my friend this and they said I’m an asshole. AITAH?

Comments:

Comment 1: NTA, you are abosolutely right, you should [Divorce/Go No-Contact/Disown/Unfriend, the person] IMMEDIATELY. Don’t walk away, RUNNN!!!

Comment 2: NTA, call the police! That’s totally unacceptable!

And sometimes you get someone calling out OP… 3: Wait, didn’t OP also claim to be [Totally different age and gender and race] a few months ago? Heres the post: [Link]

🙄 C’mon, who even think any of this is real…

[–] [email protected] 14 points 4 days ago* (last edited 4 days ago)

Man, sometimes when I finish grabbing something I needed from Reddit, I hit the frontpage (always logged out) just out of morbid curiosity.
Every single time that r/AmIOverreacting sub is there with the most obvious "no, you're not" situation ever.

I never once seen that sub show up before the exodus. AI or not, I refuse to believe any frontpage posts from that sub are anything other than made up bullshit.

[–] [email protected] 9 points 4 days ago

If it's well-written enough to be entertaining, it doesn't even matter whether it's real or not. Something like it almost certainly happened to someone at some point.

[–] [email protected] 1 points 4 days ago

Look at that, the detection heuristics all laid out nice and neatly. The only issue is that Reddit doesn't want to detect bots because they are likely using them. Reddit at one point was using a form of bot protection but it wasn't for posts; instead, it was for ad fraud.

[–] [email protected] 26 points 4 days ago (3 children)

Needs to feature both a wedding and a pregnancy and you've nailed it

load more comments (3 replies)

[–] [email protected] 21 points 4 days ago (2 children)

Way too many...

I was born before the Internet. The Internet is always lumped into the "entertainment" part of my brain. A lot of people that have grown up knowing only the Internet think the Internet is much more "real". It's a problem.

[–] [email protected] 7 points 4 days ago

I genuinely miss the 90s. I mean, yeah, early forms of internet and computers existed, but not everyone had a camera, and not everyone got absolutely bukkaked with disinformation. Not that I think everything is bad about the tech in of itself, but how we use it nowadays is just so exhausting.

[–] [email protected] 12 points 4 days ago* (last edited 4 days ago) (5 children)

I've come up with a system to categorize reality in different ways:

Category 1: Thoughts inside my brain formed by logics

Category 2: Things I can directly observe via vision, hearing, or other direct sensory input

Category 3: IRL Other people's words, stories, anecdotes, in face to face conversations

Category 4: Acredited News Media, Television, Newspaper, Radio (Including Amateur Radio Conversations), Telegrams, etc...

Category 5: The General Internet

The higher the category number, means the more distant that information is, and therefore more suspicious I am.

I mean like, if a user on Reddit (or any internet fourm or social media for that matter) told me X is a valid treatment for X disease without like real evidence, I'm gonna laugh in their face (well not their face, since its a forum, but you get the idea).

load more comments (5 replies)

[–] [email protected] 5 points 4 days ago

ESH

load more comments (3 replies)

[–] [email protected] 5 points 4 days ago (1 children)

Why not? r/AmlTheAsshole is about entertainment, not truth. It would be an indictment of AI if it couldn't replicate a short, funny story.

[–] [email protected] 5 points 4 days ago

If your statement was true, then it should be disclosed in a visible manner, which it isn't.

[–] [email protected] 16 points 4 days ago (2 children)

Is reddit still feeding Googles LLM or was it just a one time thing? Meaning will the newest LLM generated posts feed LLMs to generate posts?

[–] [email protected] 5 points 4 days ago

These days the LLMs feed the LLMs so you can model models unless you're excluding any public data from the last decade. You have to assume all public data based on users is tainted when used for training.

[–] [email protected] 22 points 4 days ago* (last edited 4 days ago) (1 children)

The truly valuable data is the stuff that was created prior to LLMs, anything after this is tainted by slop. Any verifiable human data would be worth more, which is why they are simultaneously trying to erode any and all privacy

[–] [email protected] 0 points 4 days ago (2 children)

I'm not sure about that. It implies that only humans are able to produce high-quality output. But that seems wrong to me.

First of all, not everything that humans produce has high quality; rather, the opposite.
Second, with the development of AI i think it will be very well possible for AI to generate good-quality output in the future.

load more comments (2 replies)

[–] [email protected] 87 points 4 days ago (8 children)

Oh boy, identity mechanics to curb out the last vestiges of privacy.

[–] [email protected] 6 points 4 days ago (1 children)

I wonder where people in the future will get their information from. What trustworthy sources of information are there? If the internet is overrun with bots, then you can't really trust anything you read there, as it could all be propaganda. What else to do, though, to get your news?

[–] [email protected] 6 points 4 days ago

That's the killer app right there: the complete inability for the common person to distinguish between true and false. That's what they're going for.

[–] [email protected] 3 points 4 days ago (2 children)

Yeah, a real problem solver would probably be to remove the incentive for someone to do this.

It would probably be far less likely for someone to do that on lemmy, as there is no karma and you dont get paid for upvotes or something. (Still there are incentives, like creating credibility, celebrity accounts, maybe influence public opinion, self-pleasure from seeing upvotes to "your" posts/comments etc., but they arent such potent incetives as directly monetary incetives.)

load more comments (2 replies)

[–] [email protected] 7 points 4 days ago (2 children)

you could try to cook up some kind of trust chain, without totally abandoning privacy.

Get a government-certified agencies minting master key tied to your id. You only get one, with trust rating tied to it.

With that master key you can generate infinite amount of sub-ids that dont identify you but show your trust rating(fuzzed).

Have a cross-network reporting system that can lower that rating for abuses like botting.

idk Im just spitballing

load more comments (2 replies)

[–] [email protected] 26 points 4 days ago

Also doesn’t fix the problem at all, I can still just use AI to post to my main account

[–] [email protected] 5 points 4 days ago (2 children)

I dunno, part of me is ok with it. It's clear to me how bad things are going to get. So having certain platforms or spaces with some level of public identity validation seems like it might be ok....

[–] [email protected] 3 points 4 days ago* (last edited 4 days ago)

Especially when it's about gathering real information. When everything you read is written by an anonymous author, you'd have no chance to know whether it's true or wrong, except if it's a paper on theoretical maths of course.

load more comments (1 replies)

[–] [email protected] 36 points 4 days ago (1 children)

They're pretty much declaring a war on VPNs also

[–] [email protected] 15 points 4 days ago (2 children)

Yep. More than half the time I can't access Reddit through Proton VPN.

[–] [email protected] 2 points 4 days ago (1 children)

https://addons.mozilla.org/en-US/firefox/addon/redlib/

[–] [email protected] 4 points 4 days ago* (last edited 4 days ago)

Meh. Thanks but fuck Reddit anyway.

load more comments (1 replies)

load more comments (2 replies)

load more comments