Skip to content
This is the repo for the Towards Data Science article, "The complete beginner's guide to data cleaning and preprocessing."
Jupyter Notebook
Branch: master
Clone or download
Latest commit b1d4e0f Feb 18, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.ipynb_checkpoints update readme Feb 17, 2019
README.md update readme Feb 17, 2019
data_cleaning_and_preprocessing.ipynb update readme Feb 17, 2019
my_data.csv initial commit Feb 17, 2019
pixabay_DigiPD_2.jpg update readme Feb 17, 2019

README.md

basic_data_preprocessing

How to use Python, Pandas, Numpy, and Scikit-Learn to do some basic data cleaning and preprocessing

This is the repo for the Towards Data Science article, The complete beginner's guide to data cleaning and preprocessing

You can find the article here

basic data cleaning and preprocessing

From the article

Data preprocessing is the first (and arguably most important) step toward building a working machine learning model. It’s critical!

If your data hasn’t been cleaned and preprocessed, your model does not work.

It’s that simple.

Data preprocessing is generally thought of as the boring part. But it’s the difference between being prepared and being completely unprepared. It’s the difference between looking like a pro and looking pretty foolish.

It’s kind of like getting ready for a vacation. You might not like the preparation part, but tightening down the details in advance can save you from one nightmare of a trip.

You just have to do it or you can’t start having fun.

Read the rest of the article here

You can’t perform that action at this time.