Skip to content

anjanibhattar/Data-Cleaning-using-Pandas

Repository files navigation

Data-Cleaning-using-Pandas

In my previous project, I obtained data by Web Scraping a Wikipedia page using the Beautiful Soup library in Python. The code for this is available in my repository https://github.com/anjanibhattar/Web-Scraping-Wiki-using-BS

Now, I have tried my hand at Data Cleaning/Wrangling using the Pandas library in Python. The data contains 549 rows and 13 columns. Since the data is obtained by scraping a HTML page, there were quite a few data cells that included HTML tags and data-format issues.

I started off by examining the data type of each column. Using Excel, I analysed the various data anomalies that were present in each column and included code to exclude (or) drop (or) transform data as required.

The final cleaned data is exported as an Excel(.xlsx) file for analysis and visualization.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published