Code Monkey home page Code Monkey logo

userstalker's Introduction

User Stalker

What exactly is this?

We are scanning every user account upon creation (on a few SE sites) and report those users who fail a few checks in a chat room.

What data are we using?

  1. The Stack Exchange API.
  2. Blacklisted usernames from the SmokeDetector project.
  3. Offensive Regular Expressions from the Heat Detector project.
  4. Blacklisted username regex created using data from Rubio

What all are we checking?

  1. If there is a link in their about me text.
  2. If the website link is similar to their display name.
  3. If the username or profile text fails the Heat Detector Offensive Regex.
  4. If the user profile text contains a phone number.
  5. If the user profile text contains the word "Insurance".
  6. If the username is blacklisted on the SmokeDetector username blacklist.
  7. If the username fails the username blacklist regex.
  8. If the username contains the current year or the next year.
  9. If the user is suspended on the site upon creation.

What do we do when a user is detected as a bad user?

Nothing

We don't do anything when a user is detected. We are just stalkers.

According to the Stack Exchange policy:

We actually don't encourage moderators to seek out these kinds of users and destroy them because it's usually a waste of time. There are lots of spam users that just create profiles and then never do anything, and actively seeking them out to destroy them rarely achieves anything.

and

So if you really feel like spending your time hunting these down, go for it, but by all means you should be absolutely certain it's an attempt at spam

Moderators can destroy these accounts, if and only if they are completely sure that the profile is created only for spam.

When we detect trends like the "The Great Super User Spam Invasion of 2017" on 16th October 2017, we usually try to figure out what is happening and destroy the spam profiles. (But it is left to the site moderators discretion. We just alert the mods, if the need be. In that particular case, Community Managers were informed as the number of accounts involved was very high.)

(Another recent occurrence is the insurance spam, which lead to the two new checks.)

Whom would it help?

Mostly the diamond moderators.

How do I track a site?

Use the add command and pass the site name and the frequency of tracking needed as parameters. Frequency can be fast or slow. fast calls the API every minute, slow calls it every 5 minutes.

For example, if you need to track on site Drupal Answers, every 5 minutes, use:

@Jam add drupal slow

What other commands are there?

  • quota - Returns the API quota available.
  • alive - Returns a test message if the bot is responding to commands.
  • tracked - Returns the list of sites which are being tracked.
  • check - Returns a list of reasons for which a user can be caught.

Join us in MOBotics

userstalker's People

Contributors

adeak avatar arcticecho avatar bhargav-rao avatar codygray avatar dependabot[bot] avatar double-beep avatar jeffschaller avatar lw001 avatar superplane39 avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

userstalker's Issues

Command to blacklist some names.

There should be a new command to blacklist some names.

Just something which stores it in memory which needn't be persisted across restarts.

Update chatexchange library

As you might have heard, Stack Exchange will remove OpenID.

Since tunaki/chatexchange is currently using OpenID to connect to chat, we had to release an update. If you don't update, you won't be able to connect to chat after July 25, 2018.

We've moved the project to org.sobotics.*. This is the updated dependency:

<dependency>
  <groupId>org.sobotics</groupId>
  <artifactId>chatexchange</artifactId>
  <version>2.0.0</version>
</dependency>

You'll have to update your import statements. All APIs are still the same, so you don't have to modify the implementation.

If you have issues updating your project, feel free to ask for help in our chatroom.

Regex for Spam Usernames

One of the Puzzling Stack Exchange moderators, Rubio, has provided us with dataset of usernames which were used to spam Puzzling Stack Exchange.

The list is here https://pastebin.com/CkeV99c5

Can we come up with a new Regex list to catch most of these spammers?

I need some help to create the regex list, not to develop anything. We can use a new file on GH to store the new regexes and just use the same code which we are using to match against the smokey blacklisted username list.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.