Skip to content

latyanvikas/data_science_blogpost

Repository files navigation

Table of Contents

  1. Installation
  2. Project Motivation
  3. File Descriptions
  4. Results
  5. Licensing, Authors, and Acknowledgements

Installation

I was unbale to install anconda3, I installed anaconda2 and created an environment using python=3.5. All the packages used are part of standard installation of anaconda3.

Project Motivation

For this project, I was interestested in using Stack Overflow data from 2018-2020 to answer the following questions

  1. Most popular languages in india across years?
  2. Which is the most lucrative programming languages people want to puruse in India?
  3. How do men and women differ in terms of choice of popular languages being used in India as per 2020 respondents?
  4. Finally, the salary comparison between Men and Women in India?

The full set of files related to this course are owned by Udacity, so they are not publicly available here. However, you can see pieces of the analysis here. This README also serves as a template for students to follow in creating their own project README files.

File Descriptions

Data for the year 2018, 2019 & 2020 can be downloaded form here --> https://insights.stackoverflow.com/survey

You will need to create a folder with name data and rename the survey files for each year as survey_result_public_{year}.csv. for example : 2018 file will be renamed as survey_result_public_2018.csv and so on.

Notebooks available here showcase the analysis of the above questions.

Results

The main findings of the code can be found at the post available here.

Licensing, Authors, Acknowledgements

Must give credit to Stack Overflow for the data. You can find the Licensing for the data and other descriptive information at the Kaggle link available here. Otherwise, feel free to use the code here as you would like!

About

My Data Science Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published