Skip to content

It is a step-by-step approach of visualizing the dataset of mass shooting by cleaning the data ,data wrangling and plotting(visualizing) various entities to obtain useful results

SHADOW13325/Mass-Shooting-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Mass-Shooting-Analysis

It is a step-by-step approach of visualizing the dataset of mass shooting by cleaning the data ,data wrangling and plotting(visualizing) various entities to obtain useful results Mass Shootings in the United States of America (1966-2017) The US has witnessed 398 mass shootings in last 50 years that resulted in 1,996 deaths and 2,488 injured. The latest and the worst mass shooting of October 2, 2017 killed 58 and injured 515 so far. The number of people injured in this attack is more than the number of people injured in all mass shootings of 2015 and 2016 combined. The average number of mass shootings per year is 7 for the last 50 years that would claim 39 lives and 48 injured per year.

1) Data-Preparation

Firstly we need to know about the dataset.So,

GetImage

There are 323 shootings and 21 features in the dataset.

S# - Shooting# - descending order
Title - Description about the shooting
Location - location of the mass shooting
Date - date of shooting
Incident Area - Area where shooting occurred
Open/Close Location - Whether the location is closed or opened
Target - Targeted shooting or random one? If Targeted, who was targeted
Cause - cause for the shooting
Summary - brief summary of the shooting. Who shot and the background etc.
Fatalities - how many were dead during/after the shooting
Injured - how many were injured during/after the shooting
Total victims - Fatalities + Injured
Policeman Killed - Number of policeman killed
Age - Age of the gunman
Employeed (Y/N) - Was the gunman employed
Employed at - If employed, employment details of the gunman
Mental Health Issues - Was the gunman mentally ill?
Race - race of the gunman
Gender - gender of the gunman
Latitude - locations latitude
Longitude - locations longitude

2) Data Processing

Let's see the missing values first

GetImage

There are 45 missing locations, 62 missing incident areas and 5 missing Target. a) Let's try to derive the location from summary. My logic is based on the assumption that some titles have city and state separated by comma and some titles just have the state.

GetImage (1)

b) Next we will make state names from updated locations we have now.We will extract the abbreviated state names and and name them in full form.

GetImage (2)

c) Now let's update the redundant values i.e. data cleaning i) Cleaning column gender

GetImage (3) ==> GetImage (4)

Likewise doing it for columns Race, Mental Health Issues, Cause, open/close Location ii) In case of less redundant values we will group them by our use. Cleaning column Incident area.

GetImage (5) ==> GetImage (6)

Likewise doing it for columns Target.

3) Exploratory Data Analysis

Now it's the time to visualize our data for which we have done all of the above shit and derive results from it. So what are we waiting for. Let's Go.

i) Mass Shootings Vs Year

GetImage (8)

We can see that there is a highly change in shootings from 2015 onwards as compared to all other previous years.

Let's categorise Mass shooting over the years based on Fatalities, Injured, Total Victims and No of shootings.

GetImage (9)

We can see that there is a massive change in all 4 categories in the after years.

Maybe there is a connection between shootings and cause. Let's see it.

ii) Mass Shootings Vs Cause

No of Shootings

GetImage

Total victims in the shootings

GetImage (1)

Mass Shootings Vs cause

GetImage (4)

We can see that the No of shootings are more in case of cause of pyscho and terrorism and the total no of victims are more in case of terroism

iii) Mass Shootings Vs Race

GetImage (5)

We can see that in both the cases i.e. no of shootings and total victims, White Race and then Black Race is most affected. But there is majority of White people and then black people and then minorities in USA. So we cannot derive any inference.

Let's take this analysis further and let's compare the cause with the Mental Health Issues.

iv) Mental Health Issues vs Cause

Let's visualize it for No of Shootings

GetImage (6)

Let's visualize it for Total Victims

GetImage (7)

We can see that No of casualities are more in case of the person has Mental Health Issues than the person not having Mental Health Issues. Unknown mental health state is a scary issue. There are many people with mental health issues due to abusive childhood, bullies, poisonous environment, job opportunities etc.

v) Mass Shootings va State

GetImage (8)

Above figure is not giving us the big picture in terms of comparison. So let's take it down to another comparable plot.

GetImage (9)

vi) Mass Shootings in School Area

GetImage (10)

Let's break it down to Target audience by shooters in School Area.

GetImage (11)

As expected the target audience is School people and the random people there.So no new inference can be drawn.

Inference

  1. We can see that there is an increase in no of mass shootings between 1984-2000.After that the sudden increase in the no of mass shooting was seen in the year 2007 of Virginia Tech shooting. 2012 recorded highest no of casualities as compared to all previous years. 2015 and 2016 got a severe high no of shootings resulting in many more casualities. Although 2017 got less no of shootings but the victims were highest in that year till date. Main shooting in 2007 was Las Vegas strip concert mass shooting. It is only possible when there is an easy access to the guns to commom peple and its purchase is not controlled by the government
  2. There seems to be a big gap between identifying a person to be having mental health issue and really helping with the issue. Due to this problem it is highly unpredictable of person that he will shoot.abusive childhood, bullies, poisonous environment, job opportunities etc results in unpredictable mental status

solutions

  1. Limit the availability of guns in the market.
  2. For Mental health the society must maintain the good decorum among themselves which is quite impossible.

About

It is a step-by-step approach of visualizing the dataset of mass shooting by cleaning the data ,data wrangling and plotting(visualizing) various entities to obtain useful results

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published