Skip to content

One of the most popular starter data sets in data science, the Titanic data set. This is a data set that records various attributes of passengers on the Titanic, including who survived and who didn’t. Here I have detected some missing value, replace the missing values and also create new values added to the dataset. There are two csv files, fir…

master
Go to file
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 

README.md

Cleaning-Titanic-Data

##About the data One of the most popular starter data sets in data science, the Titanic data set. This is a data set that records various attributes of passengers on the Titanic, including who survived and who didn’t. ##What did I do Here I have detected some missing value, replace the missing values and also create new values added to the dataset. rplot

                    here is the overview of the missing value in the original dataset.

##The output There are two csv files, first one is titanic_original.csv and second one is tatanic_clean.csv. Second csv is generated from the R code, called 'titanic.r' here. Have fun. titanic data

              camparision between the titanic_original.csv and tatanic_clean.csv

About

One of the most popular starter data sets in data science, the Titanic data set. This is a data set that records various attributes of passengers on the Titanic, including who survived and who didn’t. Here I have detected some missing value, replace the missing values and also create new values added to the dataset. There are two csv files, fir…

Resources

Releases

No releases published

Packages

No packages published

Languages