Skip to content

Shark attacks in Teenagers - Data cleaning & wrangling project

Notifications You must be signed in to change notification settings

Monica-Duarte11/Shark-attacks-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Shark attacks in Teenagers

cover

Data cleaning & data wrangling Project 🦈

This project focuses on cleaning and analyzing the global shark attack database (from the Global Shark Attack File's).

For this, we have used python as programming language and Power BI as the tool for the final visualizations, applying data cleaning, wrangling and analysis techniques with pandas, matplotlib and regex libraries.

Index

💻 The project 21_05_teenagers.ipynb file contains the code for cleaning and analyzing the database.

📚 In the file src.py can be found the functions used for the cleaning of the data set.

👩‍🏫 In Shark attacks.pdf you can see the presentation made for the pitch of the project.

Hypothesis

For data cleaning and analysis, we looked for a hypothesis that could be demonstrated and validated through the available data.

As the records gather demographic information on each attack, we examined a scenario centered on a social and age group that, because of the characteristics generally associated with their behavior, could make them easy targets for sharks: teens.

Taking into account the above arguments, the final hypothesis was: "Teenagers are the demographic group most likely to be attacked by sharks".

Visualizations

To see the visualizations created to understand the data and validate the hypothesis, you can access the link below:

https://app.powerbi.com/view?r=eyJrIjoiZDY1ZGNmNGEtZjhkNS00NGMxLWFiNGYtZWNhNTU3N2EyODY5IiwidCI6IjM1ODlkOTA0LTdiOTAtNDQyMi1hOWNmLTM5YzZlNGJkMDYyYyIsImMiOjR9

About

Shark attacks in Teenagers - Data cleaning & wrangling project

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published