Wrangle-Project---WeRate-Dogs

Project Overview

Introduction

The objective of this data wrangling project is to gather different data from a variety of sources in a variety of formats, assess its quality and tidiness, then clean it. For this project I used the Python programming language and some other Python libraries. You will get more details on this as you continue with this documentation.

Dataset Overview

The datasets used for this project is from the WeRateDogs Twitter handle. I performed some data analysis and visualization on the data. WeRateDogs is a Twitter account that rates people's dogs with a humorous comment about the dog. These ratings almost always have a denominator of 10. The numerators, though? Almost always greater than 10. 11/10, 12/10, 13/10, etc. Why? Because "they're good dogs Brent." WeRateDogs has over 4 million followers and has received international media coverage.

Tasks

In the course of this project, I did some gathering of datasets, and cleaning of the dataset. I will be dividing this section into:

Gathering of data
Assess data
Cleaning of data

Gathering of data - In this step, I gathered all 3 different data from different sources: The WeRateDogs Twitter Archive CSV file, the tweet image prediction TSV file, and the additional data from the Twitter API.

Assess data - After gathering the different data from their sources, I tried to assess the data visually and programmatically to get more insights in quality and tidiness issues.

Cleaning of data - Now that the I have visually and programmatically assessed the data, I then proceeded to clean the data using the Python programming language and some libraries. The libraries I used are:

Pandas
Numpy
Matplotlib.pyplot
Seaborn

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Act_report.html		Act_report.html
Act_report.pdf		Act_report.pdf
README.md		README.md
Wrangle_act.ipynb		Wrangle_act.ipynb
Wrangle_report.ipynb		Wrangle_report.ipynb
image-predictions.tsv		image-predictions.tsv
tweet_json.txt		tweet_json.txt
twitter-archive-enhanced.csv		twitter-archive-enhanced.csv
twitter_archive_master.csv		twitter_archive_master.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wrangle-Project---WeRate-Dogs

Project Overview

Introduction

Dataset Overview

Tasks

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Wrangle-Project---WeRate-Dogs

Project Overview

Introduction

Dataset Overview

Tasks

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages