Skip to content

Titanic Survival Prediction Project (93% Accuracy)🛳️ In this notebook, The goal is to correctly predict if someone survived the Titanic shipwreck using different Machine Learning Model & Hyperparameter tunning.

License

Notifications You must be signed in to change notification settings

mrankitgupta/titanic-survival-prediction-93-xgboost

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Titanic - Machine Learning from Disaster | (Accuracy: 93%) XGBoost 🛳️

Titanic Survival Prediction: Machine Learning Model 🛳️

ML Models used: XGBoost, Random Forest, Logistic Regression

In this notebook, The goal is to correctly predict if someone survived the Titanic shipwreck using different Machine Learning Model and Hyperparameter tunning.

Prerequisites:

Data Analyst Roadmap

Python Lessons 📑

Python Libraries for Data Science 🗂️

Overview

  1. Understand the shape of the data (Histograms, box plots, etc.)

  2. Data Cleaning

  3. Data Exploration

  4. Feature Engineering

  5. Data Preprocessing for Model

  6. Basic Model Building

  7. Model Tuning

  8. Ensemble Modle Building

  9. Results

About the Project 🛳️

Competition sites like Kaggle define the problem to solve or questions to ask while providing the datasets for training your data science model and testing the model results against a test dataset. The question or problem definition for Titanic Survival competition is described here at Kaggle.

Titanic ML model- By Ankit Gupta

Knowing from a training set of samples listing passengers who survived or did not survive the Titanic disaster, can our model determine based on a given test dataset not containing the survival information, if these passengers in the test dataset survived or not.

We may also want to develop some early understanding about the domain of our problem. This is described on the Kaggle competition description page here. Here are the highlights to note.

On April 15, 1912, during her maiden voyage, the Titanic sank after colliding with an iceberg, killing 1502 out of 2224 passengers and crew. Translated 32% survival rate. One of the reasons that the shipwreck led to such loss of life was that there were not enough lifeboats for the passengers and crew. Although there was some element of luck involved in surviving the sinking, some groups of people were more likely to survive than others, such as women, children, and the upper-class.

Titanic ML model- By Ankit Gupta

Workflow stages

The competition solution workflow goes through seven stages described in the Data Science Solutions book.

  1. Question or problem definition.
  2. Acquire training and testing data.
  3. Wrangle, prepare, cleanse the data.
  4. Analyze, identify patterns, and explore the data.
  5. Model, predict and solve the problem.
  6. Visualize, report, and present the problem solving steps and final solution.
  7. Supply or submit the results.

Technologies used ⚙️

Python Libraries :

Project - Titanic Survival Prediction: Machine Learning Model 🛳️

Kaggle Project Link: Titanic Survival Prediction 🛳️ 🔗

Datasets

Kaggle Titanic Datasets: Titanic Train & Titanic Test

Titanic ML model- By Ankit Gupta

Related Projects❓ 👨‍💻 🛰️

Spotify Data Analysis using Python 📊

Data Analyst Roadmap

Statistics for Data Science using Python 📊

Sales Insights - Data Analysis using Tableau & SQL 📊

Kaggle - Pandas Solved Exercises 📊

Python Lessons 📑

Python Libraries for Data Science 🗂️

Liked my Contributions❓ Follow Me👉 Kaggle and GitHub

Nominate Me for GitHub Stars ⭐ ✨

For any queries/doubts 🔗 👇

MrAnkitGupta_

MrAnkitGupta MrAnkitGupta_ AnkitGupta MrAnkitGupta mrankitguptaaggle