Skip to content
Combined public datasets relating to various facets of the school-to-prison pipeline
Jupyter Notebook
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.

Data on the School-To-Prison Pipeline


Combined public datasets relating to various facets of the school-to-prison pipeline.

The Datasets folder includes data from the following organizations:

  • Office of Juvenile Justice and Delinquency Prevention
  • Iowa Department of Human Services
  • Iowa Department of Public Health
  • U.S. Census Bureau
  • U.S. Dept. of Education Civil Rights Data Collection
  • Iowa Department of Education
  • Iowa Workforce Development

The original datasets exist in the Current Data sub-folder. A list of datasets and a codebook are included in the Datasets folder. Variable code names, descriptions, observational units, types, and units are accounted for in the codebook document.

An interactive version of the dataset inventory and codebook is available on AirTable.

The combined dataset is included in the S2PP_Master CSV file located in the Datasets folder.

Data facets included in the dataset cover the following areas:

  • Juvenile arrests
  • Juvenile public health
  • Junveile residential placement
  • Student academic proficiency
  • Student discipline, expulsion, and retention
  • Special education
  • Teacher credentialing and ongoing professional development

Geographic areas represented in the dataset include the following areas:

  • U.S. states (all)
  • Counties (Iowa)

The Python scripts and R Markdown files used to clean and merge datasets are included in the Cleaning Scripts folder.

Qualitative Data

The Qualitative Data folder includes sub-folders for each county in Iowa.

Qualitative information collected includes the following data types:

  • Handbooks
  • Policy manuals
  • News stories
  • District websites

An interactive version of the qualitative information is available on AirTable.

Project History

This data collection project began in summer 2018 as a collaboration between Tammy Nyden and Katherine Walden, as part of "The School to Prison Pipeline Project," an initiative supported by Mothers on the Front Lines and the Grinnell College Innovation Fund.

The first stage of data collection took place in the 2018-2019 academic year, led by Daisy Morales '20 through her work in the Vivero Digital Scholarship Fellowship Program. Morales was responsible for identifying major data sources, downloading original datasets, documenting data gathered, and generating relevant metadata and contextual information about the datasets. Morales also gathered, organized, and inventoried the qualitative data included in this repository.

The initial round of data processing took place in Spring 2019 and was made possible through the support of the Data Analysis and Social Inquiry Lab (DASIL), supported by the Roy J. Carver Charitable Trust's data science grant to Grinnell College. DASIL student workers Cora Touchstone '19 (Economics and Mathematics) and Hollis Akins '22 (Physics) led that work, supervised and mentored by Jarren Santos '17 (DASIL Data Scientist) and Xavier Escandell (DASIL Director). Additional data processing took place in Fall 2019 and was undertaken by DASIL student worker Hollis Akins, supervised and mentored by Jarren Santos.

The collected data was made available for use by students enrolled in the Fall 2019 course "The School-to-Prison Pipeline" and co-requisite digital lab "Digital Stories for Social Justice."

Project Team

Project Support

You can’t perform that action at this time.