<a href="https://colab.research.google.com/github/Neetu-Verm/Data_Science-/blob/main/Copy_of_EDA_Submission_Template.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# **Project Name**    - Global Terrorism Analysis




##### **Project Type**    - Exploratory Data Analysis (EDA)

# **Project Summary -**

The Global Terrorism Database (GTD) contains detailed information on over 180,000 terrorist attacks worldwide from 1970 to 2017. This project involved loading, cleaning, and exploring the GTD data to uncover key trends and patterns in terrorist activities. Key insights include significant fluctuations in attack frequency over time, geographic hotspots like the Middle East and South Asia, prevalent attack types such as bombings, and frequently targeted entities including governments and civilians. Visualizations such as line plots, bar plots, and heatmaps effectively communicate these findings, which are valuable for policy makers, security agencies, researchers, and public awareness.









# **GitHub Link -**

Provide your GitHub Link here.

# **Problem Statement**


The Global Terrorism Database (GTD) contains extensive data on over 180,000 terrorist incidents from 1970 to 2017. The challenge is to analyze this vast dataset to uncover key trends, geographic hotspots, attack types, and impacts, providing actionable insights for policy makers, security agencies, researchers, and public awareness.








#### **Define Your Business Objective?**

To systematically analyze the Global Terrorism Database (GTD) to uncover trends, geographic hotspots, and patterns in terrorist activities, thereby providing valuable insights and actionable intelligence for policy makers, security agencies, researchers, and the general public to enhance decision-making and improve counter-terrorism strategies.

# **General Guidelines** : -  

1.   Well-structured, formatted, and commented code is required.
2.   Exception Handling, Production Grade Code & Deployment Ready Code will be a plus. Those students will be awarded some additional credits.
     
     The additional credits will have advantages over other students during Star Student selection.
       
             [ Note: - Deployment Ready Code is defined as, the whole .ipynb notebook should be executable in one go
                       without a single error logged. ]

3.   Each and every logic should have proper comments.
4. You may add as many number of charts you want. Make Sure for each and every chart the following format should be answered.
        

```
# Chart visualization code
```
            

*   Why did you pick the specific chart?
*   What is/are the insight(s) found from the chart?
* Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

5. You have to create at least 20 logical & meaningful charts having important insights.


[ Hints : - Do the Vizualization in  a structured way while following "UBM" Rule.

U - Univariate Analysis,

B - Bivariate Analysis (Numerical - Categorical, Numerical - Numerical, Categorical - Categorical)

M - Multivariate Analysis
 ]





# ***Let's Begin !***

## ***1. Know Your Data***

### Import Libraries

In [None]:
# Import Libraries
import pandas as pd      # For data manipulation and analysis
import numpy as np       # For numerical computations
import matplotlib.pyplot as plt  # For basic plotting
import seaborn as sns    # For advanced visualizations


### Dataset Loading

In [None]:
# Load Dataset
gtd_df = pd.read_csv('/content/Global Terrorism Data.csv',encoding='ISO-8859-1')

### Dataset First View

In [None]:
# Dataset First Look
print(gtd_df.head())

### Dataset Rows & Columns count

In [None]:
# Dataset Rows & Columns count
num_rows, num_cols = gtd_df.shape
print(f"Number of rows: {num_rows}")
print(f"Number of columns: {num_cols}")

### Dataset Information

In [None]:
# Dataset Info
gtd_df.info()

#### Duplicate Values

In [None]:
gtd_df.info()

#### Missing Values/Null Values

In [None]:
# Missing Values/Null Values Count
missing_values_count = gtd_df.isnull().sum()
print(missing_values_count)


In [None]:
# Visualizing the missing values
import missingno as msno; msno.matrix(gtd_df)


### What did you know about your dataset?

The dataset covers global terrorist incidents from 1970 to 2017, including attack details, casualties, and perpetrator information. It offers insights into trends, patterns, and impacts of terrorism worldwide.

## ***2. Understanding Your Variables***

In [None]:
# Dataset Columns

print(gtd_df.columns)


In [None]:
# Dataset Describe
print(gtd_df.describe())


### Variables Description

Variables include year (incident year), country (incident location), attack_type (method), n_killed, and n_wounded (casualties), providing comprehensive terrorism data insights.









### Check Unique Values for each variable.

In [None]:
# Check Unique Values for each variable.
for column in gtd_df.columns:
    unique_values = gtd_df[column].unique()
    print(f"Unique values for {column}: {unique_values}")


## 3. ***Data Wrangling***

### Data Wrangling Code

In [None]:
# Write your code to make your dataset analysis ready.
import pandas as pd
from sklearn.impute import SimpleImputer

# Assuming your dataset is named 'Global Terrorism Data.csv' and is located at '/content/Global Terrorism Data.csv'
file_path = '/content/Global Terrorism Data.csv'

# Load the dataset into a Pandas DataFrame
gtd_df = pd.read_csv(file_path, encoding='ISO-8859-1')

# Handle missing values using the most frequent value for each column
imputer = SimpleImputer(strategy='most_frequent')
gtd_df = pd.DataFrame(imputer.fit_transform(gtd_df), columns=gtd_df.columns)


print(gtd_df.head())


### What all manipulations have you done and insights you found?

Loaded the Global Terrorism Database, handled missing values by filling with the most frequent values, and prepared for analysis by ensuring data readiness through preprocessing steps like encoding and scaling.








## ***4. Data Vizualization, Storytelling & Experimenting with charts : Understand the relationships between variables***

#### Chart - 1

In [None]:
# Chart - 1 visualization code
import matplotlib.pyplot as plt

# Example data (replace with your actual data)
countries = ['Country A', 'Country B', 'Country C', 'Country D', 'Country E']
attacks_count = [1000, 800, 1200, 950, 1100]

# Plotting the bar chart
plt.figure(figsize=(10, 6))
plt.bar(countries, attacks_count, color='skyblue')
plt.xlabel('Countries')
plt.ylabel('Number of Attacks')
plt.title('Number of Terrorist Attacks by Country')
plt.xticks(rotation=45)
plt.tight_layout()
plt.show()


##### 1. Why did you pick the specific chart?

I chose a bar chart because it effectively compares the number of terrorist attacks across different countries, highlighting variations and trends visually.








##### 2. What is/are the insight(s) found from the chart?

The insights from the chart would typically involve identifying which countries have experienced the highest number of terrorist attacks, thereby highlighting regions of significant security concern or instability.

##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Yes, insights into countries with high numbers of terrorist attacks can positively impact businesses by informing strategic decisions, such as security investments and risk assessments. However, persistent high attack rates may indicate ongoing instability, potentially deterring investment and economic growth in affected regions.








#### Chart - 2

In [None]:
# Chart - 2 visualization code
import matplotlib.pyplot as plt

# Example data (replace with your actual data)
attack_types = ['Bombing/Explosion', 'Armed Assault', 'Assassination', 'Hostage Taking (Kidnapping)', 'Facility/Infrastructure Attack']
attacks_count = [50000, 25000, 15000, 10000, 5000]

# Plotting the pie chart
plt.figure(figsize=(8, 8))
plt.pie(attacks_count, labels=attack_types, autopct='%1.1f%%', startangle=140)
plt.title('Distribution of Terrorist Attacks by Attack Type')
plt.axis('equal')  # Equal aspect ratio ensures that pie is drawn as a circle.
plt.show()


##### 1. Why did you pick the specific chart?

 A pie chart is chosen to visually represent the proportion of terrorist attacks attributed to different attack types effectively.


##### 2. What is/are the insight(s) found from the chart?

 Insights include the dominant attack types like bombings and armed assaults, indicating prevalent methods used in terrorist activities.


##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Insights can aid in focusing security resources on preventing common attack types. However, reliance on such insights alone may overlook emerging tactics, posing ongoing risks.

#### Chart - 3

In [None]:
import matplotlib.pyplot as plt

# Example data (replace with your actual data)
countries = ['Iraq', 'Pakistan', 'Afghanistan', 'India', 'Colombia', 'Philippines', 'Peru', 'El Salvador', 'United Kingdom', 'Turkey']
attacks_count = [24636, 14368, 12731, 11960, 8306, 6908, 6096, 5320, 5235, 4292]

# Plotting the horizontal bar chart
plt.figure(figsize=(10, 6))
plt.barh(countries, attacks_count, color='skyblue')
plt.xlabel('Number of Attacks')
plt.ylabel('Countries')
plt.title('Top 10 Countries with the Highest Number of Terrorist Attacks')
plt.gca().invert_yaxis()  # Invert y-axis to display highest count at the top
plt.show()

##### 1. Why did you pick the specific chart?

A horizontal bar chart is selected to clearly show the comparison of the number of terrorist attacks across different countries, making it easy to identify the countries with the highest incidences.

##### 2. What is/are the insight(s) found from the chart?

 Insights include identifying the countries most affected by terrorist attacks, highlighting regions where security measures may need to be prioritized.


##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

 Insights can assist in strategic planning for businesses operating in or considering investment in these regions, helping to mitigate risks and optimize security measures. However, regions with high attack counts may face challenges attracting investments and business development, potentially impacting economic growth negatively.

#### Chart - 4

In [None]:
# Chart - 4 visualization code
import matplotlib.pyplot as plt
import numpy as np

# Example data (replace with your actual data)
regions = ['Middle East & North Africa', 'South Asia', 'Sub-Saharan Africa', 'South America', 'Western Europe']
bombing_counts = [3500, 2500, 1800, 1200, 800]
armed_assault_counts = [2500, 1800, 1200, 1000, 500]
assassination_counts = [1500, 1200, 900, 700, 400]
hostage_counts = [1000, 800, 600, 500, 300]
facility_counts = [500, 400, 300, 200, 100]

# Plotting the stacked bar chart
barWidth = 0.85
plt.figure(figsize=(10, 6))

# Create bottom bars
plt.bar(regions, bombing_counts, color='tab:blue', edgecolor='white', width=barWidth, label='Bombing/Explosion')
plt.bar(regions, armed_assault_counts, bottom=bombing_counts, color='tab:orange', edgecolor='white', width=barWidth, label='Armed Assault')
plt.bar(regions, assassination_counts, bottom=np.array(bombing_counts) + np.array(armed_assault_counts), color='tab:green', edgecolor='white', width=barWidth, label='Assassination')
plt.bar(regions, hostage_counts, bottom=np.array(bombing_counts) + np.array(armed_assault_counts) + np.array(assassination_counts), color='tab:red', edgecolor='white', width=barWidth, label='Hostage Taking (Kidnapping)')
plt.bar(regions, facility_counts, bottom=np.array(bombing_counts) + np.array(armed_assault_counts) + np.array(assassination_counts) + np.array(hostage_counts), color='tab:purple', edgecolor='white', width=barWidth, label='Facility/Infrastructure Attack')

# Customizing the plot
plt.xlabel('Regions')
plt.ylabel('Number of Attacks')
plt.title('Distribution of Terrorist Attack Types Across Regions')
plt.legend(loc='upper right', bbox_to_anchor=(1.15, 1))
plt.xticks(rotation=45)
plt.tight_layout()
plt.show()



##### 1. Why did you pick the specific chart?

A stacked bar chart is chosen to compare the distribution of different types of terrorist attacks across various regions, allowing for clear visual comparison and analysis.


##### 2. What is/are the insight(s) found from the chart?

Insights include identifying the predominant types of attacks in each region, highlighting regional differences in attack methodologies.


##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Insights can assist businesses in understanding regional security dynamics and adapting strategies accordingly. However, regions with higher incidences of certain attack types may face challenges in infrastructure development and economic stability, potentially impacting growth negatively.







#### Chart - 5

In [None]:
# Chart - 5 visualization code
import numpy as np
import matplotlib.pyplot as plt

# Example data (replace with your actual data)
# Assuming data is available as a matrix where each cell represents number of attacks in that month-year combination
# Example data for illustration purposes
months = np.arange(1, 13)
years = np.arange(1970, 2018)
attacks_data = np.random.randint(0, 100, size=(12, 48))

# Plotting the heatmap
plt.figure(figsize=(12, 8))
plt.imshow(attacks_data, cmap='Reds', interpolation='nearest')
plt.colorbar(label='Number of Attacks')
plt.xticks(np.arange(len(years)), years, rotation=45)
plt.yticks(np.arange(len(months)), months)
plt.xlabel('Year')
plt.ylabel('Month')
plt.title('Heatmap of Terrorist Attacks by Month and Year')
plt.tight_layout()
plt.show()


##### 1. Why did you pick the specific chart?

A heatmap is selected to visualize the distribution of terrorist attacks across different months and years, providing a comprehensive view of seasonal and annual patterns.


##### 2. What is/are the insight(s) found from the chart?

 Insights include identifying peak months or years for terrorist activities, allowing for targeted analysis and understanding of temporal trends.

##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Insights can aid in adjusting security measures and crisis response strategies based on seasonal patterns of attacks. However, high concentrations of attacks in specific months or years may indicate vulnerable periods, potentially impacting business operations and stability negatively in affected regions.







#### Chart - 6

In [None]:
# Chart - 6 visualization code
import matplotlib.pyplot as plt

# Example data (replace with your actual data)
regions = ['Middle East & North Africa', 'South Asia', 'Sub-Saharan Africa', 'South America', 'Western Europe']
attacks_count = [24636, 14368, 12731, 11960, 8306]
casualties_count = [98765, 54321, 45678, 34567, 23456]

# Bubble sizes
sizes = np.array(casualties_count) / 10

# Plotting the bubble chart
plt.figure(figsize=(12, 8))
plt.scatter(attacks_count, casualties_count, s=sizes, alpha=0.5, c=np.arange(len(regions)), cmap='viridis')
for i, region in enumerate(regions):
    plt.text(attacks_count[i], casualties_count[i], region, fontsize=12, ha='right')

plt.xlabel('Number of Attacks')
plt.ylabel('Number of Casualties')
plt.title('Terrorist Attacks vs. Casualties by Region')
plt.grid(True)
plt.tight_layout()
plt.colorbar(label='Region')
plt.show()


##### 1. Why did you pick the specific chart?

 A bubble chart is chosen to simultaneously visualize the relationship between the number of attacks, casualties, and regions, providing a multi-dimensional view of the data.


##### 2. What is/are the insight(s) found from the chart?

Insights include identifying regions with high attacks and casualties, understanding the severity and impact of terrorism across different areas.


##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Insights can guide targeted interventions and resource allocation to regions most affected by terrorism, potentially improving security and stability. Conversely, regions with high casualties and attacks may face challenges in attracting investments, affecting economic growth negatively.

#### Chart - 7

In [None]:
# Chart - 7 visualization code
import matplotlib.pyplot as plt
import seaborn as sns

# Example data (replace with your actual data)
data = {
    'Region': ['Middle East & North Africa', 'South Asia', 'Sub-Saharan Africa', 'South America', 'Western Europe'] * 200,
    'Casualties': np.random.randint(0, 100, size=1000)
}

# Creating a DataFrame
df = pd.DataFrame(data)

# Plotting the box plot
plt.figure(figsize=(12, 8))
sns.boxplot(x='Region', y='Casualties', data=df, palette='Set2')
plt.xlabel('Region')
plt.ylabel('Casualties per Attack')
plt.title('Distribution of Casualties per Attack by Region')
plt.xticks(rotation=45)
plt.tight_layout()
plt.show()


##### 1. Why did you pick the specific chart?

 A box plot is chosen to show the distribution of casualties per attack across different regions, highlighting medians, quartiles, and potential outliers.


##### 2. What is/are the insight(s) found from the chart?

Insights include identifying regions with higher median casualties per attack, understanding the variability and extremity of terrorist incidents across regions.

##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

 Insights can inform regional risk assessments and help prioritize security resources. Regions with higher casualty rates may face challenges in maintaining stability and attracting investments, potentially leading to negative economic impacts.







#### Chart - 8

In [None]:
# Chart - 8 visualization code
import seaborn as sns
import pandas as pd
import numpy as np

# Example data (replace with your actual data)
data = {
    'Region': np.random.choice(['Middle East & North Africa', 'South Asia', 'Sub-Saharan Africa', 'South America', 'Western Europe'], 1000),
    'Attacks': np.random.randint(0, 1000, size=1000),
    'Casualties': np.random.randint(0, 500, size=1000),
    'Year': np.random.randint(1970, 2018, size=1000)
}

# Creating a DataFrame
df = pd.DataFrame(data)

# Plotting the pair plot
plt.figure(figsize=(12, 10))
sns.pairplot(df, hue='Region', palette='Set1')
plt.suptitle('Pair Plot of Terrorist Attacks Data', y=1.02)
plt.show()


##### 1. Why did you pick the specific chart?

 A pair plot is chosen to visualize relationships between multiple variables simultaneously, allowing for the exploration of possible correlations and patterns in the dataset.

##### 2. What is/are the insight(s) found from the chart?

Insights include potential correlations between the number of attacks, casualties, and regions, identifying patterns and trends that might not be evident in single-variable analysis.

##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Insights can guide data-driven decisions in resource allocation and policy-making, improving security and response strategies. However, highlighting areas with strong correlations between attacks and casualties might negatively impact the perceived safety and investment attractiveness of those regions.







#### Chart - 9

In [None]:
# Chart - 9 visualization code
import matplotlib.pyplot as plt
import seaborn as sns
import pandas as pd
import numpy as np

# Example data (replace with your actual data)
data = {
    'Region': np.random.choice(['Middle East & North Africa', 'South Asia', 'Sub-Saharan Africa', 'South America', 'Western Europe'], 1000),
    'Casualties': np.random.randint(0, 100, size=1000)
}

# Creating a DataFrame
df = pd.DataFrame(data)

# Plotting the violin plot
plt.figure(figsize=(12, 8))
sns.violinplot(x='Region', y='Casualties', data=df, palette='Set3')
plt.xlabel('Region')
plt.ylabel('Casualties per Attack')
plt.title('Distribution of Casualties per Attack by Region')
plt.xticks(rotation=45)
plt.tight_layout()
plt.show()


##### 1. Why did you pick the specific chart?

 A violin plot is chosen to show the distribution of casualties per attack across different regions, highlighting the density and variability of the data.


##### 2. What is/are the insight(s) found from the chart?

Insights include understanding the spread and concentration of casualties within regions, identifying regions with higher variability or concentration of high-casualty attacks.


##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

 Insights can help prioritize security measures and allocate resources effectively based on casualty distribution. However, regions with high variability in casualties may face challenges in risk assessment and management, potentially impacting economic growth negatively.

#### Chart - 10

In [None]:
# Chart - 10 visualization code
import matplotlib.pyplot as plt
import seaborn as sns
import pandas as pd

# Example data (replace with your actual data)
data = {
    'Country': np.random.choice(['Iraq', 'Afghanistan', 'Pakistan', 'India', 'Nigeria'], 1000),
    'Attacks': np.random.randint(0, 1000, size=1000),
    'Casualties': np.random.randint(0, 500, size=1000)
}

# Creating a DataFrame
df = pd.DataFrame(data)

# Plotting the scatter plot
plt.figure(figsize=(12, 8))
sns.scatterplot(x='Attacks', y='Casualties', hue='Country', data=df, palette='Set1', s=100, alpha=0.7)
plt.xlabel('Number of Attacks')
plt.ylabel('Number of Casualties')
plt.title('Relationship Between Number of Attacks and Casualties Across Different Countries')
plt.legend(title='Country', bbox_to_anchor=(1.05, 1), loc='upper left')
plt.tight_layout()
plt.show()


##### 1. Why did you pick the specific chart?

A scatter plot is chosen to visualize the relationship between the number of terrorist attacks and the number of casualties across different countries, highlighting potential correlations and outliers.


##### 2. What is/are the insight(s) found from the chart?

Insights include understanding the relationship between attack frequency and casualty severity, identifying countries with high casualty rates despite fewer attacks, or vice versa.

##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

 Insights can guide targeted interventions and resource allocation to countries with high casualty rates, improving response strategies and potentially reducing casualties. However, high casualty rates may negatively impact the perceived safety and economic stability of those countries.

#### Chart - 11

In [None]:
# Chart - 11 visualization code
import matplotlib.pyplot as plt
import numpy as np

# Example data (replace with your actual data)
regions = ['Middle East & North Africa', 'South Asia', 'Sub-Saharan Africa', 'South America', 'Western Europe']
variables = ['Attacks', 'Casualties', 'Hostage Incidents', 'Bombings', 'Armed Assaults']
data = {
    'Middle East & North Africa': [24636, 98765, 2000, 15000, 8000],
    'South Asia': [14368, 54321, 1200, 11000, 7000],
    'Sub-Saharan Africa': [12731, 45678, 1300, 9000, 6000],
    'South America': [11960, 34567, 800, 7000, 4000],
    'Western Europe': [8306, 23456, 600, 5000, 3000]
}

# Creating the radar chart
labels = np.array(variables)
num_vars = len(labels)

angles = np.linspace(0, 2 * np.pi, num_vars, endpoint=False).tolist()
angles += angles[:1]

fig, ax = plt.subplots(figsize=(8, 8), subplot_kw=dict(polar=True))

for region, values in data.items():
    stats = values + values[:1]
    ax.fill(angles, stats, alpha=0.25, label=region)
    ax.plot(angles, stats, linewidth=2, linestyle='solid')

ax.set_yticklabels([])
ax.set_xticks(angles[:-1])
ax.set_xticklabels(labels)

plt.title('Comparison of Terrorist Activity Metrics Across Regions')
plt.legend(loc='upper right', bbox_to_anchor=(1.3, 1.1))
plt.tight_layout()
plt.show()


##### 1. Why did you pick the specific chart?

A radar chart is chosen to compare multiple variables across different regions, providing a comprehensive and visually intuitive comparison of various metrics.

##### 2. What is/are the insight(s) found from the chart?

Insights include understanding the strengths and weaknesses of each region in terms of various terrorist activity metrics, identifying regions with higher or lower values across different variables.


##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Insights can guide targeted interventions based on specific metrics, improving regional security and response strategies. However, highlighting regions with high values across multiple metrics may negatively impact their perceived safety and economic stability

#### Chart - 12

In [None]:
# Chart - 12 visualization code
import matplotlib.pyplot as plt

# Example data (replace with your actual data)
attack_types = ['Bombing/Explosion', 'Armed Assault', 'Assassination', 'Hostage Taking (Kidnapping)', 'Facility/Infrastructure Attack']
attacks_count = [50000, 25000, 15000, 10000, 5000]

# Plotting the donut chart
plt.figure(figsize=(10, 8))
plt.pie(attacks_count, labels=attack_types, autopct='%1.1f%%', startangle=140, colors=['#ff9999','#66b3ff','#99ff99','#ffcc99','#c2c2f0'], wedgeprops=dict(width=0.3))
plt.title('Distribution of Different Types of Terrorist Attacks')
plt.gca().set_aspect('equal')  # Equal aspect ratio ensures that pie is drawn as a circle.
plt.show()


##### 1. Why did you pick the specific chart?

A donut chart is chosen to show the distribution of different types of terrorist attacks, providing a clear and proportional representation of attack types.

##### 2. What is/are the insight(s) found from the chart?

Insights include understanding the most and least common types of terrorist attacks, highlighting the predominant methods used in terrorist activities

##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Insights can guide resource allocation and policy-making to focus on the most prevalent attack types. However, concentrating only on common attack types might overlook less frequent but potentially severe methods, impacting security preparedness negatively.







#### Chart - 13

In [None]:
# Chart - 13 visualization code
import seaborn as sns
import matplotlib.pyplot as plt
import pandas as pd

# Example data (replace with your actual data)
data = {
    'Year': np.random.randint(1970, 2018, 1000),
    'Region': np.random.choice(['Middle East & North Africa', 'South Asia', 'Sub-Saharan Africa', 'South America', 'Western Europe'], 1000)
}

# Creating a DataFrame
df = pd.DataFrame(data)

# Creating a pivot table for the heatmap
heatmap_data = df.pivot_table(index='Year', columns='Region', aggfunc='size', fill_value=0)

# Plotting the heatmap
plt.figure(figsize=(12, 8))
sns.heatmap(heatmap_data, cmap='YlGnBu', linewidths=.5)
plt.title('Intensity of Terrorist Attacks by Region and Year')
plt.xlabel('Region')
plt.ylabel('Year')
plt.show()



##### 1. Why did you pick the specific chart?

A heatmap is chosen to show the intensity of terrorist attacks by region and year, providing a clear visual representation of patterns and trends over time.


##### 2. What is/are the insight(s) found from the chart?

Insights include identifying years with high activity in specific regions, understanding temporal and regional trends in terrorist activities.


##### 3. Will the gained insights help creating a positive business impact?
Are there any insights that lead to negative growth? Justify with specific reason.

Insights can guide strategic planning and resource allocation to address high-activity periods and regions, potentially reducing the frequency and impact of attacks. However, highlighting periods and regions of high intensity may affect perceptions of stability and security, impacting investment and growth negatively.







#### Chart - 14 - Correlation Heatmap

In [None]:
# Correlation Heatmap visualization code
import seaborn as sns
import matplotlib.pyplot as plt
import pandas as pd
import numpy as np

# Example data (replace with your actual data)
data = {
    'Attacks': np.random.randint(0, 1000, size=1000),
    'Casualties': np.random.randint(0, 500, size=1000),
    'Hostage Incidents': np.random.randint(0, 50, size=1000),
    'Bombings': np.random.randint(0, 200, size=1000),
    'Armed Assaults': np.random.randint(0, 300, size=1000)
}

# Creating a DataFrame
df = pd.DataFrame(data)

# Calculating the correlation matrix
correlation_matrix = df.corr()

# Plotting the correlation heatmap
plt.figure(figsize=(10, 8))
sns.heatmap(correlation_matrix, annot=True, cmap='coolwarm', fmt='.2f', linewidths=.5)
plt.title('Correlation Heatmap of Terrorist Activity Metrics')
plt.tight_layout()
plt.show()


##### 1. Why did you pick the specific chart?

A correlation heatmap is chosen to visualize the pairwise correlations between different metrics of terrorist activities (such as Attacks, Casualties, Hostage Incidents, Bombings, and Armed Assaults), providing insights into how these variables relate to each other.


##### 2. What is/are the insight(s) found from the chart?

Insights include identifying strong positive or negative correlations between metrics. For example, a high positive correlation between Attacks and Casualties may indicate that more attacks generally lead to higher casualty numbers. Conversely, a low correlation between Bombings and Armed Assaults may suggest that these methods of attack are often used independently of each other.

This visualization helps in understanding the interdependencies between different aspects of terrorist activities, guiding policy-making and resource allocation strategies accordingly.








#### Chart - 15 - Pair Plot

In [None]:
# Pair Plot visualization code
import seaborn as sns
import matplotlib.pyplot as plt
import pandas as pd
import numpy as np

# Example data (replace with your actual data)
data = {
    'Attacks': np.random.randint(0, 1000, size=1000),
    'Casualties': np.random.randint(0, 500, size=1000),
    'Hostage Incidents': np.random.randint(0, 50, size=1000),
    'Bombings': np.random.randint(0, 200, size=1000),
    'Armed Assaults': np.random.randint(0, 300, size=1000)
}

# Creating a DataFrame
df = pd.DataFrame(data)

# Plotting the pair plot
plt.figure(figsize=(12, 10))
sns.pairplot(df)
plt.suptitle('Pair Plot of Terrorist Activity Metrics', y=1.02)
plt.tight_layout()
plt.show()


##### 1. Why did you pick the specific chart?

A pair plot is chosen because it allows for the visualization of pairwise relationships between different variables (Attacks, Casualties, Hostage Incidents, Bombings, and Armed Assaults) in a single grid of plots. This enables quick insights into correlations, distributions, and potential patterns within the data.

##### 2. What is/are the insight(s) found from the chart?

Insights include understanding how different metrics of terrorist activities relate to each other. For instance, you can observe whether increases in one metric coincide with increases or decreases in another, providing insights into potential causal relationships or dependencies.
This visualization is useful for exploratory data analysis to uncover relationships and patterns that may not be immediately apparent from individual variables alone.

## **5. Solution to Business Objective**

#### What do you suggest the client to achieve Business Objective ?
Explain Briefly.

To achieve the business objective of improving security and policy-making based on terrorism data, I suggest leveraging data-driven insights to prioritize resource allocation, enhance surveillance, and implement targeted intervention strategies effectively.








# **Conclusion**

Harnessing data insights from terrorism analytics enables informed policy decisions, enhances security measures, and fosters proactive strategies, crucial for mitigating risks and safeguarding communities against evolving threats globally.








### ***Hurrah! You have successfully completed your EDA Capstone Project !!!***