Data-wrangling-project

We Rate Dogs Data Wrangling

This is a data wrangling project.

In this project, I made use of the Data Wrangling steps, which are:

Gather
Assess
Clean

Gather.

In gathering, I made use of the we-rate-dogs twitter account which posts dogs and rates them with funny comments. To do this, there was need to use the tweeter API-- tweepy. All of this data were extracted(gathered) into a dataframe.

Also, I made use of the image prediction tsv dataframe which had a machine learning prediction of the dogs posted. The dataframe had 3 predictions, but one of higher certainty It was downloaded programmatically.

Also, the tweeter-archive-enhanced data frame which had more details of the we-rate-dogs account. It had the stages of dogs and the texts of the tweeter account.

The Assess and Clean.

Here was the whole cleaning process of the 3 dataset. It had the test for quality and test for tidiness of the data. More of these were summed up in the wrangling_report.pdf file. The two method used in this cleaning process is the visual method and programmatic method. For the visual, a use of spreadsheet and python for the programmatic. Also, using the Question, Code and Observation method

A peep:

This was fully done until I had a clean data ready for exploration and visualization

The Visualization report is in the act_report.pdf file and a peep of it is shown thus:

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Project		Project
tweet-json		tweet-json
Data Wrangling Report.docx		Data Wrangling Report.docx
README.md		README.md
We Rate Dogs Insights.docx		We Rate Dogs Insights.docx
act_report.pdf		act_report.pdf
df_1.csv		df_1.csv
df_2.csv		df_2.csv
df_3.csv		df_3.csv
image-predictions.tsv		image-predictions.tsv
new_df.csv		new_df.csv
tweet-json copy		tweet-json copy
tweet-json.zip		tweet-json.zip
tweet_json.txt		tweet_json.txt
tweets_df.csv		tweets_df.csv
twitter-api.rtf		twitter-api.rtf
twitter-archive-enhanced.csv		twitter-archive-enhanced.csv
twitter_archive_master.csv		twitter_archive_master.csv
wrangling-act.ipynb		wrangling-act.ipynb
wrangling_report.pdf		wrangling_report.pdf
~$ta Wrangling Report.docx		~$ta Wrangling Report.docx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data-wrangling-project

We Rate Dogs Data Wrangling

Gather.

The Assess and Clean.

A peep:

Thanks for your time

About

Releases

Packages

Contributors 2

Languages

ilesanmi-007/We_rate_dogs-Data-wrangling-project

Folders and files

Latest commit

History

Repository files navigation

Data-wrangling-project

We Rate Dogs Data Wrangling

Gather.

The Assess and Clean.

A peep:

Thanks for your time

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages