This Project aims at predicting survival outcomes, achieving at least 80% accuracy, from the 1912 Titanic disaster based on each passenger’s features, such as sex and age.
This project requires Python 2.7 and the following Python libraries installed:
You will also need to have software installed to run and execute an iPython Notebook
It is recommended to install Anaconda, a pre-packaged Python distribution that contains all of the necessary libraries and software for this project.
Download the files in this repository, navigate to the directory containing file 'titanic_survival_exploration.ipynb'. Open a terminal and run one of the following commands:
jupyter notebook titanic_survival_exploration.ipynb
or
ipython notebook titanic_survival_exploration.ipynb
This will open the iPython Notebook software and project file in your web browser.
The dataset used in this project is included as titanic_data.csv
. This dataset is provided by Udacity and contains the following attributes:
survival
: Survival (0 = No; 1 = Yes)pclass
: Passenger Class (1 = 1st; 2 = 2nd; 3 = 3rd)name
: Namesex
: Sexage
: Agesibsp
: Number of Siblings/Spouses Aboardparch
: Number of Parents/Children Aboardticket
: Ticket Numberfare
: Passenger Farecabin
: Cabinembarked
: Port of Embarkation (C = Cherbourg; Q = Queenstown; S = Southampton)