kaggle-titanic

The Challenge

The sinking of the Titanic is one of the most infamous shipwrecks in history. On April 15, 1912, during her maiden voyage, the widely considered “unsinkable” RMS Titanic sank after colliding with an iceberg. Unfortunately, there weren’t enough lifeboats for everyone onboard, resulting in the death of 1502 out of 2224 passengers and crew. While there was some element of luck involved in surviving, it seems some groups of people were more likely to survive than others.

Objective

In this challenge, we ask that I build a predictive model that answers the question: "what kind of person is most likely to survive?" using passenger data (ie name, age, sex, socioeconomic class, etc.).

What Data Will I Use in This Competition?

In this competition, I will use two similar data sets that include passenger information, such as name, age, gender, socioeconomic class, etc. One dataset is titled train.csv and the other is titled test.csv. Train.csv will contain the details of a subset of the passengers on board (891 to be more exact) and, more importantly, will reveal whether they survived or not, also known as the "basic truth". The test.csv dataset contains similar information, but does not reveal a“ basic truth ”for each passenger. Using the patterns you find in the train.csv data, predict whether the other 418 passengers on board (found in test.csv) survived.

Project Motivation

First project developed on the Udacity platform in the Data Scientist Nanodegree program, and also the first challenge proposed on the Kaggle platform.

Libraries required

pandas (data manipulation and analysis), scikit-learn (machine learning library), seaborn and matplotlib (data visualization)

Summary of Results

accuracy of 87%
f1-score of 80% in survivors and 90% in non-survivors

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
kaggle_titanic.ipynb		kaggle_titanic.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

kaggle-titanic

The Challenge

Objective

What Data Will I Use in This Competition?

Project Motivation

Libraries required

Summary of Results

About

Releases

Packages

Languages

joonaspp/kaggle-titanic

Folders and files

Latest commit

History

Repository files navigation

kaggle-titanic

The Challenge

Objective

What Data Will I Use in This Competition?

Project Motivation

Libraries required

Summary of Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages