Project killR: Dataset containing information on male and female international serial killers and scripts for scraping, wrangling, and analyzing data on both international and US American serial killers
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
README.md
Serial_killers_US.R
serial_killers_analysis.R
serial_killers_data.rds

README.md

serial-killers

Project killR: Data on both male and female international serial killers and scripts for scraping, wrangling, and analyzing data on serial killers

Description: serial_killers_data

Contains the following scraped data on both male and female international serial killers (N = 576):

  • name (full, given, and surnames)
  • pseudonym (note: pseudonyms are partly in German)
  • country
  • sex
  • victims (proven, suspected)
  • years active (active from-to, active from, active to)
  • geocoded country locations (longitude, latitude)

Data source: https://de.wikipedia.org/wiki/Liste_von_Serienmördern#Serienmörder-Paare/Gruppen

Description: serial_killers_analysis

The script covers

  • scraping Wikipedia tables containing information on international serial killers
  • wrangling scraped data
  • geocoding the killers' locations by country using Google Maps API
  • identifying the most deadly international serial killers
  • visualizing the killers' number of victims and sex with ggplot2
  • mapping the killers' locations as both points and polygons with leaflet and ggplot2

using the abovementioned data set on N = 576 serial killers.

Description: Serial_killers_US

The script covers

  • scraping a Wikipedia table containing information on US serial killers
  • wrangling scraped data
  • scraping a Wikipedia table containing US states and extracting information on the killers' location from strings
  • predicting the killers' sex based on their given names using gender and genderdata
  • scraping a Wikipedia table containing causes of death and extracting information on the killers' causes of death from strings
  • visualizing the killers' causes of death, number of victims, and sex with ggplot2
  • mapping the killers' locations with ggplot2

using the example of serial killers in the US.