Skip to content
/ data_analysis Public template

Collection of data analysis notebook projects completed using python and other tools

Notifications You must be signed in to change notification settings

drusho/data_analysis

Repository files navigation

Twitter profile link buttonLinkedin profile link buttonTableau profile link buttonGithub profile link button

Data Analysis Notebook Projects



1. Google Trends for Mothers Day

Goal of this noteboook is to explore Google Trends for topics and keywords related to Mother's Day. Other holidays such as Father's Day and Valentines Day were also compared in the analysis.

Interesting Findings

Valentine's Day does not show the same popularity for Google Trends search as Mother's Day. There has been a downward trend in popularity for the words Valentines Day.


Top result for Mothers Day search was Mother's Daughter a Song by Miley Cyrus. This probably occured because the Miley Cyrus was a feature singer on Saturday Night Live the day before Mother's Day.




2. HackerNews Top Post Trends

Data Analysis and Visualization of a dataset taken from Kaggle. The dataset contains information of all posts from Sep. 26, 2016. The goals of the project was to practice cleaning and organizing data using pandas. The data was analyzed for correlations and as well as find out what were average days/times for popular posts. Seaborn was used to visualize all data.



3. Exploring Ebay Car Sales

Explores csv data from used cars on eBay Kleinanzeigen, a classifieds section of the German eBay website.

Project goals were to explore how to clean and analyze data using pandas, seaboard, and numpy.



4. Employee Exit Surveys

Practicing cleaning data taken from the Queensland Government. Employments status of new and temporary was analyzed along with most popular job titles.



5. Practicing SQL using CIA Factbook Data



6. Visualizing The Gender Gap In College Degrees

The data set comes from Randal Olson that cleaned data which was obtained from the Department of Education Statistics. This data contains the percentage of bachelor's degrees that were granted to women from 1972 to 2012.

The focus of the data set is to grain a clearer understanding of the gender gap in STEM fields.



7. Recent Grads

This dataset focuses on the job outcomes of students who graduated from college between 2010 and 2012. The raw csv data comes from FiveThirtyEight's github. They used this raw data to help write the article: The Economic Guide to Picking a College Major



8. Fandango Movie Reviews

The purpose of the notebook to review research orginally performed in 2015 by fivethirtyeight over potiental bias in movie reviews by Fandango.

The data used in the notebook was taken from Fivethirtyeight's Github.



9.Exploring Hacker News Posts

This notebook compared two types of posts on Hackers Newsto determine the following:

  • Do Ask HN or Show HN receive more comments on average?

  • Do posts created at a certain time receive more comments on average?

About

Collection of data analysis notebook projects completed using python and other tools

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published