# Suicide Rate Analysis

This kernel shows some basic analysis on the suicide rates in different countries from 1985 to 2016. We will use the data to do some basic analysis on the sucide rates.

### Getting Started

Here, we will get data and some dependencies we need. We will also do some basic exploration of data

In [None]:
# importing libraries
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

In [None]:
# getting the data
df = pd.read_csv('../input/master.csv')

In [None]:
# getting the basic overview of the data
df.head()

In [None]:
# shape of the dataset
df.shape

In [None]:
df_unique_country = df['country'].unique() # to find total countries in the dataset
df_unique_country

### Basic Data Visualization

Now we will plot some graphs to explore the relationships between various features of the dataset

In [None]:
p = sns.countplot(x="sex", data=df)

So, from above plot we can see that the dataset includes equal amount of data for both genders

In [None]:
_ = sns.heatmap(df.corr(), annot=True)

The above plot shows the correlations between the features in the dataset

In [None]:
p = sns.barplot(x='sex', y='suicides_no', hue='age', data=df)

The plot above shows the numbers of suicides commited by the people of different age groups across both the genders. We can observe that the suicide rate is higher in the age group of 35-54 years in both female and male populations and it is lowest in age group of 5-14 years.

In [None]:
p = sns.barplot(x='sex', y='suicides_no', hue='generation', data=df)

The plot above shows the numbers of suicides commited by the people of different generations across both the genders. Here, we can see that suicide numbers are higher in the generation of Boomers, whereas it is lowest in Generation Z.

In [None]:
p = sns.catplot('sex','suicides_no',hue='age',col='year',data=df,kind='bar',col_wrap=5)

The plot above shows the number of suicides for both males and females of different age groups for every year from 1985 to 2016

In [None]:
age_15 = df.loc[df.loc[:, 'age']=='15-24 years',:]
age_35 = df.loc[df.loc[:, 'age']=='35-54 years',:]
age_75 = df.loc[df.loc[:, 'age']=='75+ years',:]
age_25 = df.loc[df.loc[:, 'age']=='25-34 years',:]
age_55 = df.loc[df.loc[:, 'age']=='55-74 years',:]
age_5 = df.loc[df.loc[:, 'age']=='5-14 years',:]

In [None]:
p = sns.lineplot(x='year', y='suicides_no', data=age_5)
q = sns.lineplot(x='year', y='suicides_no', data=age_15)
r = sns.lineplot(x='year', y='suicides_no', data=age_25)
s = sns.lineplot(x='year', y='suicides_no', data=age_35)
t = sns.lineplot(x='year', y='suicides_no', data=age_55)
t = sns.lineplot(x='year', y='suicides_no', data=age_75)

_ = plt.legend(['5-14 years', '15-24 years', '25-34 years', '35-54 years', '55-74 years', '75+ years'])

The plot above shows the change in suicide numbers for people of different age groups for every year

In [None]:
male_population = df.loc[df.loc[:, 'sex']=='male',:]
female_population = df.loc[df.loc[:, 'sex']=='female',:]

In [None]:
p = sns.lineplot(x='year', y='suicides_no', data=male_population)
q = sns.lineplot(x='year', y='suicides_no', data=female_population)

_ = plt.legend(['males', 'females'])

The plot above shows the change in suicide numbers for both male and female populations for each year