Analyses of online developer community surveys for the UNU-CS Gender Tech Lab
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
developer-surveys
preprocessing
LICENSE
README.md

README.md

Gender Gap in Online Developer Communities

Overview

Online developer communities boast millions of users - over 29 million on GitHub and over 8 million on Stack Overflow, in 2018. Participation in these communities is becoming one of the primary ways software developers learn new programming languages, improve their skills, develop collaborative projects, and find new job opportunities. (David and Shapiro, 2008; Ford et al., 2016; Vasilescu et al., 2015)

Developers on these sites may ask and answer coding questions to improve their skills (e.g. Stack Overflow), use those skills to contribute to open-source code (e.g. GitHub) and participate in coding challenges (e.g. HackerRank). These platforms are becoming increasingly important to hiring decisions, as recruiters look at GitHub contributions or reputation on Stack Overflow as indicators of developers' skill.

However, despite the promise for online software developer communities to support software developers in their professional development, there are indicators that there may be serious difference in women and men's* participation in these communities - differences which may further exacerbate existing gender gaps in the global ICT workforce.

* See the UNU-CS EQUALS project page for more detail on the nature of the gender analyses used by the UN and the EQUALS project for the purposes of this research.

Research questions

To understand the extent and nature of the gender gap in online software developer communities, we ask the following research questions:

  1. How do male and female developers differ in their participation in online software developer communities?
  2. How do male and female developers differ in their perceptions of belonging and kinship in online software developer communities?
  3. How do male and female developers in online software developer communities differ in their employment and prior experience with coding?

Data

We use publicly available survey data from 3 major online developer communities:

  • Stack Overflow survey (download latest survey results here)
  • GitHub survey (download latest survey results here)
  • HackerRank survey (download latest survey results here)

Files

  • Statistical analyses of survey data (Chi-squares, log-linear models, etc)
  • Visualizations of descriptive statistics of survey data (bar charts, cross-tabulated heatmaps, etc)
  • Re-usable data cleaning scripts (cleaning country names, etc)

Usage

  • To view the results of the analyses, clone or download this repository using the green button, then open the .html files in your browser.
  • To modify or re-run the code, some basic familiarity with Python and the Jupyter development environment may be necessary. See a tutorial here for assistance beginning to work with Jupyter.
  • Re-usable data cleaning scripts are located in the preprocessing folder
  • The primary analysis results are in the developer-surveys folder. The developer_survey_analyses script is the main analysis file, with additional visualizations in the developer_survey_visualizations file.

Authors