Titanic-Survival-Prediction

In this repository, we will go through the whole process of creating a machine learning model on the famous Titanic dataset. We have two .csv files for the training and testing of data so as to make predictions. Initially, we began with analyzing our training data and checking for any missing data and found out which features are the most significant for making better predictions. During this whole process, we used seaborn and matplotlib libraries to perform the visualizations. During the data preprocessing part, we computed missing values, converted features into numeric ones, grouped values into categories and created a few new features. Afterwards we started training 9 different machine learning models, picked one of them (random forest) and applied cross validation on it. Then we discussed how random forest works, took a look at the importance it assigns to the different features and tuned it’s performace through optimizing it’s hyperparameter values. Lastly, we looked at it’s confusion matrix and computed the models precision, recall and f-score.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Predicted_dataset.csv		Predicted_dataset.csv
README.md		README.md
Titanic-Survival-Prediction-Code.py		Titanic-Survival-Prediction-Code.py
Titanic-Survival-Prediction.ipynb		Titanic-Survival-Prediction.ipynb
test.csv		test.csv
train.csv		train.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Titanic-Survival-Prediction

About

Releases

Packages

Languages

RawatMeghna/Titanic-Survival-Prediction

Folders and files

Latest commit

History

Repository files navigation

Titanic-Survival-Prediction

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages