Data Wrangling Project

This is a data analytics project utilizing Python v.3 libraries Numpy, PANDAS, Matplotlib, Os, Beautiful Soup, tweepy and Requests through Jupyter Notebook in order to analyze data from renowned Twitter account We Rate Dogs. The focus of this project is data wrangling wherein we undergo the three steps of data wrangling systematically from gathering data to data cleaning.

Required Software:

Jupyter Notebook

Numpy

PANDAS

Matplotlib

Os

Beautiful Soup

Tweepy

Requests

Data Analysis Outline:

The first stage of data analysis is data wrangling, and in data wrangling, the first step is gathering data. In this stage, pandas requests, numpy, Beautiful Soup, Tweepy and os were utilized to gather and read data from three different sources.

The second step of data wrangling is assessment wherein we utilize both visual and programmatic assessment methods in order to assess data quality and tidiness issues that we need to address prior to analyzing data.

The third and last step of the data wrangling process is the cleaning stage wherein we use several pandas methods in order to clean any quality and tidiness issues we've detected during the assessment process.

The last part of the data analytics process is data visualization and analysis. In this section, I used Matplotlib in order to create visualizations and show results of my analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE.txt		LICENSE.txt
Project4 We Rate Dogs.ipynb		Project4 We Rate Dogs.ipynb
README.MD		README.MD
act_wrangle.pdf		act_wrangle.pdf
image-predictions.tsv		image-predictions.tsv
tweet_json.txt		tweet_json.txt
twitter-archive-enhanced.csv		twitter-archive-enhanced.csv
twitter_master.csv		twitter_master.csv
wrangle_report.pdf		wrangle_report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE.txt

LICENSE.txt

Project4 We Rate Dogs.ipynb

Project4 We Rate Dogs.ipynb

README.MD

README.MD

act_wrangle.pdf

act_wrangle.pdf

image-predictions.tsv

image-predictions.tsv

tweet_json.txt

tweet_json.txt

twitter-archive-enhanced.csv

twitter-archive-enhanced.csv

twitter_master.csv

twitter_master.csv

wrangle_report.pdf

wrangle_report.pdf

Repository files navigation

Data Wrangling Project

Required Software:

Data Analysis Outline:

About

Releases

Packages

Languages

License

jpadillo/Data-Wrangling

Folders and files

Latest commit

History

Repository files navigation

Data Wrangling Project

Required Software:

Data Analysis Outline:

About

Topics

Resources

License

Stars

Watchers

Forks

Languages