Instance Admins: Check Your Instance for Vote Manipulation Accounts [PSA]

Admiral Patrick@dubvee.org · edit-2 7 months ago

Instance Admins: Check Your Instance for Vote Manipulation Accounts [PSA]

kersploosh@sh.itjust.works · 8 months ago

After digging into it, we banned the two sh.itjust.works accounts mentioned in this post. A quick search of the database did not reveal any similar accounts, though that doesn’t mean they aren’t there.

Blaze (he/him)@feddit.org · 8 months ago

We have our own astroturfing bots, did we make it?

Lost_My_Mind@lemmy.world · 8 months ago

Make it harder to moderate? Sure!

Admiral Patrick@dubvee.org · edit-2 8 months ago

I hope this post doesn’t tank the monthly active users stats lol. Mostly that’s me hoping this problem isn’t as big as I fear.

Coelacanth@feddit.nu · 8 months ago

I believe “Russian Bot Farm Presence” is the preferred metric of social network relevance in the scientific community.

Admiral Patrick@dubvee.org · 8 months ago

Lol, that sounds like a Randall Munroe unit of measurement, and I love it. If there’s not already an xkcd for that, there should be.

abff08f4813c@j4vcdedmiokf56h3ho4t62mlku.srv.us · 8 months ago

What surprises me is that these seem to be all on other instances - including a few big ones like just.works - rather than someone spinning up their own instance to create unlimited accounts to downvote/spam/etc.

schizo@forum.uncomfortable.business · 7 months ago

Not really: if you’re astroturfing, you don’t do all your astroturfing from a single source because that makes it so obvious even a blind person could see it and sort it out.

You do it from all over the places, mixed in with as much real user traffic as you can, and then do it steadily and without being hugely bursty from a single location.

Humans are very good at pattern matching and recognition (which is why we’ve not all been eaten by tigers and leopards) and will absolutely spot the single source, or extremely high volume from a single source, or even just the looks-weird-should-investigate-more pattern you’d get from, for example, exactly what happened to cause this post.

TLDR: they’re doing this because they’re trying to evade humans and ML models by spreading the load around, making it not a single source, and also trying to mix it in with places that would also likely have substantial real human traffic because uh, that’s what you do if you’re hoping to not be caught.

A Basil Plant@lemmy.world · edit-2 8 months ago

My bachelor’s thesis was about comment amplifying/deamplifying on reddit using Graph Neural Networks (PyTorch-Geometric).

Essentially: there used to be commenters who would constantly agree / disagree with a particular sentiment, and these would be used to amplify / deamplify opinions, respectively. Using a set of metrics [1], I fed it into a Graph Neural Network (GNN) and it produced reasonably well results back in the day. Since Pytorch-Geomteric has been out, there’s been numerous advancements to GNN research as a whole, and I suspect it would be significantly more developed now.

Since upvotes are known to the instance administrator (for brevity, not getting into the fediverse aspect of this), and since their email addresses are known too, I believe that these two pieces of information can be accounted for in order to detect patterns. This would lead to much better results.

In the beginning, such a solution needs to look for patterns first and these patterns need to be flagged as true (bots) or false (users) by the instance administrator - maybe 200 manual flaggings. Afterwards, the GNN could possibly decide to act based on confidence of previous pattern matching.

This may be an interesting bachelor’s / master’s thesis (or a side project in general) for anyone looking for one. Of course, there’s a lot of nuances I’ve missed. Plus, I haven’t kept up with GNNs in a very long time, so that should be accounted for too.

Edit: perhaps IP addresses could be used too? That’s one way reddit would detect vote manipulation.

[1] account age, comment time, comment time difference with parent comment, sentiment agreement/disgareement with parent commenters, number of child comments after an hour, post karma, comment karma, number of comments, number of subreddits participated in, number of posts, and more I can’t remember.

Onno (VK6FLAB)@lemmy.radio · 8 months ago

As an end user, ie. not someone who either hosts an instance or has extra permissions, can we in anyway see who voted on a post or comment?

I’m asking because over the time I’ve been here, I’ve noticed that many, but not all, posts or comments attract a solitary down vote.

I see this type of thing all over the place. Sometimes it’s two down votes, indicating that it happens more than once.

I note that human behaviour might explain this to some extent, but the voting happens almost immediately, in the face of either no response, or positive interactions.

Feels a lot like the Reddit down vote bots.

Admiral Patrick@dubvee.org · 8 months ago

As a regular user, I don’t think there’s much you can do, unfortunately (though thank you for your willingness to help!). Sometimes you can look at a post/comment from Kbin to see the votes, but I think Mbin only shows the upvotes. Most former kbin instances, I believe, switched to mbin when development on kbin stalled.

The solitary downvotes are annoying for sure. “Some people, sigh” is just my response to that. I just ignore those.

Re: Downvote bots. I can’t say they’re necessarily bots, but my instance has scripts that flag accounts that exclusively give out downvotes and then bans them. That’s about the best I can do, at present, to counter those for my users.

Tanoh@lemmy.world · 8 months ago

Re: Downvote bots. I can’t say they’re necessarily bots, but my instance has scripts that flag accounts that exclusively give out downvotes and then bans them. That’s about the best I can do, at present, to counter those for my users.

It is usually not a good idea to specify what your exact metrics are for a ban. A bad actor could see that and then get around it by randomly upvoting something every now and then.

Admiral Patrick@dubvee.org · edit-2 8 months ago

True. But it uses a threshold ratio. They’d have to give out a proportional number of upvotes to “fool” it, and at that point, they’re an average Lemmy user lol. That script isn’t (currently) setup to detect targeted vote brigading, just ones that are only here to downvote stuff. I’ve got other scripts to detect that, but they just generate daily/weekly reports.

It takes time to detect them, but it does prevent most false positives that way (better to err on the side of caution and all that).

XNX@slrpnk.net · 8 months ago

How did you discover this? I wonder if private voting will make it too difficult to discover

Admiral Patrick@dubvee.org · edit-2 8 months ago

Try to summarize this as briefly as I can:

I was replying to a comment in a big news community about 5 months ago. It took me probably 2 minutes, at most, to compose my reply. By the time I submitted the comment (which triggered the vote counts to update in the app), the comment I was replying to had received ~17 downvotes. This wasn’t a controversial comment or post, mind you.

17 votes in under 2 minutes on a comment is a bit unusual, so I pulled up the vote viewer to see who all had downvoted it so quickly. Most of them were these random 8 character usernames like are shown in the post.

From there, I went to the DB to look at the timestamps on those votes, and they were all rapid-fire, back to back. (e.g. someone put the comment AP ID into a script and sent their bot swarm after it)

So that’s when I realized something fishy was happening and dug deeper. Looking at what was upvoted from those, however, revealed more than what they were downvoting. Have been keeping an eye out for those type of accounts since. They stopped registering for a while, but then they started coming up again within the last week or two.

I wonder if private voting will make it too difficult to discover

Depends how it’s implemented. If the random usernames that are supplied from the private votes are random for each vote, that would make it nearly impossible to catch (and would also clutter the person table on instances with junk, one-off entries). If the private voting accounts are static and always show up with the same identifier, I don’t think it would make it much more difficult than it is now with these random user accounts being used. The kicker would be that only the private version of the account would be actionable.

The only platform with private voting I know of right now is Piefed, and I’m not sure if the private voting usernames are random each time or static (I think they’re static and just not associated with your main profile). All that said, I’m not super clear on how private voting is implemented.

socsa@piefed.social · edit-2 8 months ago

You should out the users and topics they are engaging with.

Admiral Patrick@dubvee.org · edit-2 8 months ago

Ethically, I can’t (and won’t). I’m only comfortable and confident enough to share the list of sockpuppet accounts I’ve confirmed and provide the information necessary to detect them. I did list the topics I’m aware of (US news and politics), but I’m only able to see activity based on what my instance knows about. So they may be manipulating other communities, but if my instance doesn’t subscribe to them (or they’re by posters that have been banned), I have no way of seeing it.

That’s actually why I posted this. My visibility is limited, so once I identified the pattern, I’m passing that along to other admins for awareness.

socsa@piefed.social · 8 months ago

Don’t respond if it is mostly “Blue MAGA” and “Genocide Joe”

Cadeillac@lemmy.world · edit-2 8 months ago

This Blue MAGA shit is so fucking funny to me. It is the laziest no u. It came out of nowhere, they provide absolutely nothing to back it up. They just show up screaming Blue MAGA. I kind of miss the days when trolls actually tried. It isn’t even fun anymore, and they just run away when you hit them with a factual rebuttal

thisbenzingring@lemmy.sdf.org · edit-2 8 months ago

I got banned from one of the politics communities for calling out someone using the “blue maga” phrase. I called them ambitious and then called called them weirdo and got my comment removed for “attack language”, when I quested the mod they banned me for a few days. I will avoid any communities that mod is a part of.

Cadeillac@lemmy.world · 8 months ago

I’ve gotten a couple warnings on politics. I don’t worry too much about it. Makes me have to be more clever, and not just directly attack people

Scrubbles@poptalk.scrubbles.tech · 8 months ago

I have a manual process for admitting people, do I need to do anything if I know exactly who is on my instance, or do I need to do anything to protect my instance from other bad acting instances (beyond defederating, which I do when I notice a lot of spam). Any queries you recommend?

Admiral Patrick@dubvee.org · edit-2 8 months ago

I have a manual process for admitting people, do I need to do anything if I know exactly who is on my instance,

With that in place, I wouldn’t think so. I’m in the same boat with a small instance that has always used applications. The problematic accounts I’ve noticed are all using these random, 8-character names and seem to be setting up shop across open instances w/o applications. So chances are, if you’re manually admitting people, you’d have noticed these already and likely not approved them.

do I need to do anything to protect my instance from other bad acting instances

Unfortunately, defederating only protects your instance’s users from being impacted by the manipulations. Beyond that, it’s less a bad instance rather than them being taken advantage of (kind of like our persistent troll who instance hops every few days).

For now, I’ve just banned the vote manipulation accounts and moved on (this PSA notwithstanding lol) I wouldn’t consider these a “defederation worthy” offense. When I do defed, it’s for bigger reasons or just temporary due to spam (sometimes admins can’t deal with it right away but it’s causing a huge problem now and I need to do something in the short term).

Queries, I do have some, but they’re ugly AF. lol. I should prob look into starting a Matrix room or admin community where we can share and improve each others’ utility scripts.

rglullis@communick.news · 8 months ago

Another data point in favor of supporters of Dead Internet Theory .

Also, this is one more example of why it would be better if instances charged a little bit from everyone: spammers will rather run things from their own machines (or some illegal botnet) than paying something with a credit card.

Admiral Patrick@dubvee.org · 8 months ago

That may work, or you’d just get a bunch of chargebacks from stolen credit cards lol.

I do like the idea of some kind of verification besides from a questionnaire, but I’m not sure what would ever get traction.

Draconic NEO@lemmy.world · 7 months ago

That’s one thing that nobody really ever talks about when it comes to discussing payment verification. The fact that the people who are willing to commit scams and fraud are also willing to steal credit or debit cards.

rglullis@communick.news · edit-2 8 months ago

you’d just get a bunch of chargebacks from stolen credit cards lol.

Criminals use stolen credit cards for high value items that can be sold quickly. If criminals really wanted to do mass manipulation via AP servers, it will be easier/faster/cheaper for them to spin up their own servers than signing up for paid accounts.

The one counter-argument that I would accept though: what if bad actors running psyops become commercial providers to attract legit customers and mix it with their agents?

Lampshade@lemmy.sdf.org · 8 months ago

What stops the botters from setting up their own instances to create unlimited users for manipulating votes?

I guess admins also have to be on top of detecting and defederating from such instances?

Draconic NEO@lemmy.world · 7 months ago

They usually get found out pretty easily and then defederated by everyone. There’s a service called fediseer which allows instance admins to flag instances as harmful, which other admins can use to determine if they should block an instance.

In order for that to really work they would have to rotate between a lot of domain names either by changing their own instance’s domain or using a proxy. Either way they’d run out of domains rather quickly.

It’s way easier for them to just get accounts on the big servers and hide there as if they were normal lurking users.

Mac@mander.xyz · 8 months ago

this has already happened multiple times. they get found out fairly quickly and defederated by pretty much everyone.

Blaze (he/him)@feddit.org · 8 months ago

Project like https://gui.fediseer.com/

Blaze (he/him)@feddit.org · 8 months ago

I just had a look at https://lemy.lol/, and they have email verification enabled, so it’s not just people finding instances without email check to spam account on there.

@[email protected] and @[email protected] FYI

SorteKanin@feddit.dk · 8 months ago

Email verification is super easy to get around. It’s practically not a barrier at all.

Blaze (he/him)@feddit.org · 8 months ago

It’s small step, but still a step

Admiral Patrick@dubvee.org · 8 months ago

I used to think so, but it’s barely even that.

I’ve had 3 instance admins confirm anonymously that these were using a throwaway email service. sharklasers.com specifically.

Camus (il, lui)@jlai.lu · 8 months ago

Thank you for the list, we’ll remove the Jlai.lu account

Admiral Patrick@dubvee.org · edit-2 8 months ago

I strongly advise verifying first, but yes.

I can only verify them based on the posts/comment votes my instance is aware of. That said, I do have sufficient data and enough overlap to establish a connection/pattern.

Otter@lemmy.ca · 8 months ago

I think what we need is an automated solution which flags groups of accounts for suspect vote manipulation.

We appreciate the work you put into this, and I imagine it took some time to put together. That will only get harder to do if someone / some entity puts money into it.

Admiral Patrick@dubvee.org · 8 months ago

Yeah, this definitely seems more like script kiddie than adversarial nation-state. We’re not big enough here, yet anyway, that I think we’d be attracting that kind of attention and effort. However, it is a good practice run for identifying this kind of thing.

SorteKanin@feddit.dk · 8 months ago

automated solution

On the other hand, any automated solution will be possible to work around. Such a system would be open source like the rest of Lemmy and you’d know exactly the criteria you need to live up to to avoid getting hit by the filter.

Otter@lemmy.ca · edit-2 8 months ago

I guess it could end up being an arms race.

What if the tool was more of a toolbox, where each instance could configure it the way that they want (ex. Thresholds before something is flagged, etc.) Similar to how automod works, where the options are well known but it’s hard to tell what any particular space is running behind the scenes.

At the very least, tools like this can make it harder for silent vote manipulation even if it doesn’t stop it entirely

Instance Admins: Check Your Instance for Vote Manipulation Accounts [PSA]

Instance Admins: Check Your Instance for Vote Manipulation Accounts [PSA]

What are they doing?

What do these have in common?

What can you, as an instance admin, do?

Why are they doing this?

Who are the known culprits?