Exploring enrollment data from the Chicago Public Schools
Branch: master
Clone or download
cymack Some basic facts comparing first year (2006) and current year (2018) …
…enrollments. no graphs or finished tables yet, just output.
Latest commit 75f701c Feb 22, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
CPSenrollment_files/figure-markdown_github added data_2018.R protocol for first time; updated CPSenrollment and … Feb 8, 2019
school_maps_files/figure-markdown_github removed command echos; added note about plotly labels Jan 14, 2019
CPSenrollment.Rmd added data_2018.R protocol for first time; updated CPSenrollment and … Feb 8, 2019
CPSenrollment.md added data_2018.R protocol for first time; updated CPSenrollment and … Feb 8, 2019
README.Rmd added data_2018.R protocol for first time; updated CPSenrollment and … Feb 8, 2019
README.md added data_2018.R protocol for first time; updated CPSenrollment and … Feb 8, 2019
clean.R uploaded clean.R for first time Jan 14, 2019
data.R uploading the protocol used to form the main enrollment data set Jan 14, 2019
data_2018.R fixed strange git push stuff Feb 13, 2019
enrollment_all_hs.Rds Added data for 2018-2019 school year to enrollment_all sets; noted ne… Feb 7, 2019
enrollment_all_hs.csv Added data for 2018-2019 school year to enrollment_all sets; noted ne… Feb 7, 2019
explore.R uploaded explore.R for the first time Jan 14, 2019
first+this_raw.Rmd Some basic facts comparing first year (2006) and current year (2018) … Feb 22, 2019
first+this_raw.md Some basic facts comparing first year (2006) and current year (2018) … Feb 22, 2019
school_loc_merged.2006.Rds
school_loc_merged.2006.csv uploading csv versions of the school location 2006 and 2016 files. Jan 24, 2019
school_loc_merged.2016.Rds init location data files Dec 20, 2018
school_loc_merged.2016.csv uploading csv versions of the school location 2006 and 2016 files. Jan 24, 2019
school_maps.Rmd removed command echos; added note about plotly labels Jan 14, 2019
school_maps.md removed command echos; added note about plotly labels Jan 14, 2019

README.md

Introducing CPS Enrollment Datasets

Charlotte Mack February 08, 2019

CPSenrollment

Exploring enrollment data from the Chicago Public Schools

This repository contains exploratory analysis and visualizations of enrollment data that were extracted from Chicago Public Schools (CPS) public records. The prepared data are available at this repository in Rds and csv formats; at present there are only high school data, with elementary school data forthcoming. The sets span the school years from 2006--2007 through 2018--2019, and will be periodically updated.

A glimpse of the high schools data:

## Observations: 2,097
## Variables: 10
## $ govern      <chr> "regular", "regular", "regular", "regular", "regular…
## $ school_id   <chr> "610245", "609695", "609696", "610402", "609708", "6…
## $ common_name <chr> "Douglass HS", "Amundsen HS", "Austin HS", "DeVry Ad…
## $ year        <dbl> 2006, 2006, 2006, 2006, 2006, 2006, 2006, 2006, 2006…
## $ total       <dbl> 737, 1500, 580, 223, 1738, 1694, 1439, 4278, 1936, 5…
## $ total_hs    <dbl> 384, 1500, 580, 223, 1738, 1694, 1439, 4278, 1936, 5…
## $ g09         <dbl> 215, 435, NA, NA, 606, 510, 513, 1103, 578, 254, 262…
## $ g10         <dbl> 119, 438, NA, NA, 504, 419, 331, 945, 546, 254, 305,…
## $ g11         <dbl> 50, 361, 367, 107, 380, 371, 291, 1230, 403, NA, 267…
## $ g12         <dbl> NA, 266, 213, 116, 248, 394, 304, 1000, 409, NA, 208…

The original CPS data are available in a series of spreadsheets at CPS website[1] Downloadable data files that I have prepared are in this repository with Rds and csv extensions, downloadable through the links below. The Rds files, which are used in R language programming, may have some type designations that are not in the csv files, but there should be no other difference.

Rds file for all CPS high schools, September 2006 to September 2018

csv file for all CPS high schools, September 2006 to September 2018

Rds files of high school locations as of 2014--2015 school year, with enrollments from 2016--2017 school year, and from 2006--2007 school year These data were used to construct the school maps linked below.

A more detailed overview of the all-years enrollment data is in the file CPSenrollment.md.

A brief article with maps of District high schools under regular and other governance for the 2006--2007 and 2016--2017 school years is in the file school_maps.md.

The preparation of the data 2006-2017 is described in data.R and clean.R. data.R is a protocol in the form of a script of the first interactive steps that I took in preparing the unified data set. The protocol here takes the data from the original set of annual membership rosters, published in spreadsheets and having various sets and arrangements of variables, to R data.frames of uniform layout. I undertook the project when I was fairly new to R, so it is not as automated as it might be; on the other hand, each year's roster from the Chicago Public Schools has idiosyncrasies that can only be found and worked out manually. clean.R examines missing data more closely and creates a uniform set of names for each school to be used throughout the data.frame.

The protocol for the 2018-2019 update is in data_2018.R, for those who want a more compact overview.

Miscellaneous exploratory analysis, mostly in graphs is in explorechange a subset with dplyr transmute.R.

[1] Membership data are under the heading "Demographics."