Predictive-Analytics-using-Supervised-Machine-Learning

Predictive Analytics in Data Driven Decision Making using Supervised Machine Learning .

Predictive Analysis, Supervised Learning – Titanic This task is about classifying a large set of data based on a set of pre-classified samples.

Predict whether a passenger survived the Titanic shipwreck or not. We will use both a Decision Tree Classifier and a Support Vector Machine to do this and compare the results.

The general steps are data exploration and analysis, data pre-processing and transformation (handling missing values, converting categorical features into numeric, converting discrete features into binary, etc.), and implementing your classifier.

The classic Titanic dataset provides information on the fate of passengers on the Titanic, summarised according to economic status (class), sex, age, and survival.

We will find two data files:

• Training set (titanic_train.csv) should be used to build your ML models. • Test set (titanic_test.csv) should be used to see how well your model performs on unseen data.

Data Description and Notes: Pclass: A proxy for Socio-Economic Status (SES).

• 1st = Upper • 2nd = Middle • 3rd = Lower

Age: Age in years. It is fractional if less than 1. If the age is estimated, it is in the form of xx.5. SibSp: The number of siblings/spouses aboard the Titanic. The dataset defines family relations in this way:

• Sibling = brother, sister, stepbrother, stepsister • Spouse = husband, wife (mistresses and fiancés were ignored)

Parch: The number of parents/children aboard the Titanic. The datasetdefines family relations in this way:

• Parent = mother, father • Child = daughter, son, stepdaughter, stepson • Some children travelled only with a nanny, therefore Parch = 0 for them.

Embarked: The port of embarkation, C = Cherbourg, Q = Queenstown, S =Southampton. Ticket: The ticket number. Fare: The passenger fare. Cabin: The cabin number.

Main Python libraries to use:

• scikit-learn (a Python library that features various classification, regression, and clustering algorithms) https://scikit- learn.org/stable/

• pandas https://pandas.pydata.org/docs/

• NumPy https://numpy.org/

• Matplotlib https://matplotlib.org/

• seaborn: statistical data visualisation https://seaborn.pydata.org/

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Code.ipynb		Code.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Predictive-Analytics-using-Supervised-Machine-Learning

Main Python libraries to use:

About

Uh oh!

Releases

Packages

Languages

License

Waleed-T/Predictive-Analytics-using-Supervised-Machine-Learning

Folders and files

Latest commit

History

Repository files navigation

Predictive-Analytics-using-Supervised-Machine-Learning

Main Python libraries to use:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages