Skip to content

Example of applying data science using Python & Pandas to the Titanic data set

License

Notifications You must be signed in to change notification settings

markwkiehl/py_data_science_titanic

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

py_data_science_titanic

Example of applying data science using Python & Pandas to the Titanic data set

Overview

Read the Titanic dataset and perform Data Science on it using Pandas and SkyLearn.

Get the Titanic data set: https://www.kaggle.com/datasets/yasserh/titanic-dataset

This data science example was inspired by, and many portions derived from the following two articles:

"Predicting the Survival of Titanic Passengers" by Niklas Donges
https://towardsdatascience.com/predicting-the-survival-of-titanic-passengers-30870ccc7e8

"Building a Machine Learning Model Step By Step With the Titanic Dataset" by Taha Bilal Uyar
https://medium.com/swlh/building-a-machine-learning-model-step-by-step-with-the-titanic-dataset-e3462d849387

The data pre-processing employed is different from what each author did, so the results in each article are not exact, but they are very close. By following the two articles and reviewing and executing this code, you can learn a lot about data science from these authors.

About

Example of applying data science using Python & Pandas to the Titanic data set

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages