splitgraph/stopforumspam
Open repository in Console
 
Readme
Updated 2 years ago
Indexed 2 years ago

Stop Forum Spam IP, email and username data

Retrieved from https://www.stopforumspam.com/downloads on January 31st, 2022

This dataset contains a list of abusive:

  • IPv4 and IPv6 addresses
  • email addresses
  • usernames

as well as the number of times this email/IP/username has been reported and the last time it was reported.

Note that the usernames have been preprocessed to only have values matching the [A-Za-z0-9\-]+ regular expression:

cat listed_username_365_all.txt | grep -E '^"[A-Za-z0-9\-]+",.*$' | sort | uniq > listed_username_365_all_filtered.txt

Usage

You can query this dataset over the Splitgraph REST API:

$ curl https://data.splitgraph.com/splitgraph/stopforumspam/latest/-/rest/listed_email_365_all?email=eq.arasmout650%40gmail.com
[{"email":"arasmout650@gmail.com","times_reported":494,"last_reported":"2022-01-28T17:35:55"}]

$ curl https://data.splitgraph.com/splitgraph/stopforumspam/latest/-/rest/listed_email_365_all?email=eq.support%40splitgraph.com
[]

...or over the Splitgraph DDN (see the guide):

curl -XPOST -H "Content-Type: application/json" https://data.splitgraph.com/sql/query/ddn \
    --data-raw '{"sql":"SELECT times_reported, last_reported FROM \"splitgraph/stopforumspam:latest\".\"listed_email_365_all\" WHERE email = '"'"'arasmout650@gmail.com'"'"'"}'

{
  "success": true,
  "command": "SELECT",
  "rowCount": 1,
  "rows": [
    {
      "times_reported": "494",
      "last_reported": "2022-01-28T17:35:55.000Z"
    }
  ],
  "fields": [
    {
      "name": "times_reported",
      "tableID": 0,
      "columnID": 0,
      "dataTypeID": 20,
      "dataTypeSize": 8,
      "dataTypeModifier": -1,
      "format": "text",
      "formattedType": "INT8"
    },
    {
      "name": "last_reported",
      "tableID": 0,
      "columnID": 0,
      "dataTypeID": 1114,
      "dataTypeSize": 8,
      "dataTypeModifier": -1,
      "format": "text",
      "formattedType": "TIMESTAMP"
    }
  ],
  "executionTime": "145ms",
  "executionTimeHighRes": "0s 144.26411ms"
}
 
Preview
All 3 Tables
  • listed_email_365_all
    25 Objects
     | 59 MB | 
    2.43M Rows