A supervised learning project using eXtreme Gradient Boosting Trees. This project creates a predictive model for understanding the dynamics of gender in intro CS at Berkeley for years 2014 through 2015.
Jupyter Notebook TeX
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.
report docs: update Nov 18, 2016


Cultural Analytics

Identify Factors that Predict Intro CS Experience Based on Gender.

A Supervised Learning Project.

This project adds a predictive model for understanding the dynamics of gender in intro CS at Berkeley for years 2014 through 2015. This work builds on previous research done in fulfillment of a Computer Science Education Ph.D., HipHopathy, A Socio-Curricular Study of Introductory Computer Science.

I did a mixed-methods formative research that answers the question, “What are the socio-curricular factors that lead historically underrepresented students to choose CS?”

I broke down the problem into multiple well-defined components, each one solving a specific aspect of the problem. The projects generated can be found in the following repositories:


An analysis of this project can be found here report/CSExperience.pdf
The technical implementation can be found in:


The dataset used in this project not available for mass consumption, as it contains sensitive, personally identifiable student data. To generate this dataset, I created a survey that includes the following attributes for each data point based on a Likert scale of 1 to 5, 1 for strongly disagree and 5 corresponding to strongly agree. Some items have yes and no answers.

Self-reported attitudes about CS

  • atcs_1 I like to use computer science to solve problems.
  • atcs_2 I can learn to understand computing concepts.
  • atcs_3 I can achieve good grades (C or better) in computing courses.
  • atcs_4 I do not like using computer science to solve problems.
  • atcs_5 I am confident that I can solve problems by using computation
  • atcs_6 The challenge of solving problems using computer science appeals to me.
  • atcs_7 I am comfortable with learning computing concepts.
  • atcs_8 I am confident about my abilities with regards to computer science.
  • atcs_9 I do think I can learn to understand computing concepts.

Gendered belief about CS ability

  • atcsgender_1 Women are less capable of success in CS than men.
  • atcsgender_2 Women are smarter than men.
  • atcsgender_3 Men have better math and science abilities than women.

Career driven beliefs about CS

  • atcsjob_1 Knowledge of computing will allow me to secure a good job.
  • atcsjob_2 My career goals do not require that I learn computing skills.

Self-reported attitudes about computational thinking

  • atct_1 I am good at solving a problem by thinking about similar problems I’ve solved before.
  • atct_2 I have good research skills.
  • atct_3 I am good at using online search tools.
  • atct_4 I am persistent at solving puzzles or logic problems.
  • atct_5 I know how to write computer programs
  • atct_6 I am good at building things.
  • atct_7 I’m good at ignoring irrelevant details to solve a problem.
  • atct_8 I know how to write a computer program to solve a problem.

Self-reported attitudes about CS class belonging

  • blg_1 In this class, I feel I belong.
  • blg_2 In this class, I feel awkward and out of place.
  • blg_3 In this class, I feel like my ideas count.
  • blg_4 In this class, I feel like I matter.

Self-reported beliefs about collegiality

  • clet_1 I work well in teams.
  • clet_2 I think about the ethical, legal, and social implications of computing.
  • cltrcmp_1 I am comfortable interacting with peers from different backgrounds than my own (based on race, sexuality, income, and so on.)
  • cltrcmp_2 I have good cultural competence, or the ability to interact effectively with people from diverse backgrounds.


  • gender Could I please know your gender

CS mentors and role models

  • mtr_1 Before I came to UC Berkeley, I knew people who have careers in Computer Science.
  • mtr_2 There are people with careers in Computer Science who look like me.
  • mtr_3 I have role models within the Computer Science field that look like me.

Prior collegiate CS exposure

  • prcs_1 Did you take a CS course in High School?
  • prcs_2 Did you have any exposure to Computer Science before UC Berkeley?
  • prcs_3 Did a family member introduce you to Computer Science?
  • prcs_4 Did you have a close family member who is a Computer Scientist or is affiliated with computing industry?
  • prcs_5 Did your high school offer AP CS?


The contents of this repository are subjected to the GNU GENERAL PUBLIC LICENSE.