Project-Wrangle-and-Analyze-Data

A Data wrangling and Cleaning Project That Involves Gathering Data from Different sources Including querying Twitter's API using tweepy

by OLAMIDE QUZEEM O.

Installations

NumPy
pandas
Matplotlib
Seaborn
import tweepy
json
timeit

Dataset

For this project, three datasets were used. Two of them were provided directly, while the third one required querying Twitter's API and writing the data to a .txt file. The datasets used are as follows:

WeRateDogs Twitter Archive Data: This dataset contains information such as tweet ID, timestamp, rating numerator, rating denominator, name, etc.
Tab Separated Values (TSV) file: This file contains images that need to be filtered to extract pictures of dogs.
Querying Twitter's API: Data obtained from querying Twitter's API and writing it to a .txt file.

Insights:

The top dog breeds posted on WeRateDogs are:
- Labrador Retriever
- French Bulldog
- Chihuahua
- Pembroke
- Eskimo Dog
The most commonly used device by handlers for tweeting is an iPhone.
December has the highest tweet rates, followed by November.
There is no significant effect of the day on the tweet rate.

Reports

This project consists of two reports:

Wrangle Report: This report provides detailed information about the data wrangling efforts undertaken during the project. It is framed as an internal document.
Wrangle Act: This report communicates all the insights and visualizations derived from the wrangled data. It is framed as an external document, similar to a blog post or a magazine article.

By presenting the findings in these two reports, the audience will gain a comprehensive understanding of the data wrangling process and the insights obtained from the analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
act_report (1).html		act_report (1).html
act_report.ipynb		act_report.ipynb
tsv-img.csv		tsv-img.csv
tweet_json.txt		tweet_json.txt
twitter-archive-enhanced (1).csv		twitter-archive-enhanced (1).csv
twitter_archive_master.csv		twitter_archive_master.csv
twitter_master_archive.csv		twitter_master_archive.csv
wrangle_act.ipynb		wrangle_act.ipynb
wrangle_report.ipynb		wrangle_report.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project-Wrangle-and-Analyze-Data

by OLAMIDE QUZEEM O.

Installations

Dataset

Insights:

Reports

About

Releases

Packages

Languages

quzeem91/Project-Wrangle-and-Analyze-Data

Folders and files

Latest commit

History

Repository files navigation

Project-Wrangle-and-Analyze-Data

by OLAMIDE QUZEEM O.

Installations

Dataset

Insights:

Reports

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages