A data science tool for the Three Strikes Project at the Stanford University School of Law that proactively and automatically identifies nonviolent offenders in California's prison system who are eligible for Prosecutor Initiated Resentencing (PIR).
If you make use of our dataset(s) or code, please cite our work as follows:
"Redo.io. resentencing-data-initiative (Version 1.0.0) [Computer software]. https://github.com/redoio/resentencing_data_initiative/"
@software{Redo_io_resentencing-data-initiative, author = {{Redo.io}}, title = {{resentencing-data-initiative}}, url = {https://github.com/redoio/resentencing_data_initiative/}, version = {1.0.0}}
All content in this repository is licensed under GNU AGPLv3
You must give appropriate credit to our work. You may not use our work for commercial purposes, which means anything primarily intended for or directed toward commercial advantage or monetary compensation.
The Three Strikes Project is legal clinical seminar where law students represent individuals serving life sentences for non-violent crimes under California’s Three Strikes Law. Lawyers assist district attorney’s offices throughout the state who are seeking opportunities to safely and effectively reduce California’s prison population through Prosecutor Initiated Resentencing (PIR).
But, given that there are over 100,000 people in California's prisons, lawyers need efficient and accurate ways to identify those who are eligible for PIR instead of manually reviewing piles of paperwork. Redo.io's open-source eligibility tool helps lawyers filter their caseload and apply their expertise efficiently. We are currently under a contract with the Stanford School of Law to develop this application.
A rules-based and deterministic model that generates cohorts of individuals eligible for PIR based on criteria established by district attorney's offices.
What it is:
- A deterministic model with a rules-based framework designed by Redo.io in conjunction with legal experts
- Easily interpretable and explainable for non-technical users and audience
- Responsive to user requests, i.e. an attorney can request changes to the eligibility criteria and see updated results
What it is not:
- A blackbox predictive model
- A decision-maker that determines the cases that are ultimately resentenced by the courts
- Ingest the demographics, current offenses and prior offenses of individuals in California Department of Corrections and Rehabilitation (CDCR) custody
- Run the eligibility model (with selection criteria and scenarios designed by legal experts)
- Return a cohort of individuals eligible for PIR
- Provide attorneys easy-to-understand personal profiles of each candidate
- Re-run the eligibility model when new data on the prison population is available
An example of the solution pipeline implemented for the Los Angeles County District Attorney's Office:
Raw data is sourced from the Three Strikes Project and the participating district attorney's offices. We primarily utilize the following information on the prison population for eligibility determination.
Information on the offenses an individual is currently serving time for.
CDCR #
: Universal identification across datasets
Sentencing County
: District Attorney Office or location
Case Number
: Case identification
Offense
: Penal code number
Offense Category
: Nature of offense
Offense Begin Date
: Date when offense was committed
Enhancements
: Additional charges added to an individual's record
Each row will include the details on ONE offense committed by the individual, i.e. if an individual has committed 6 offenses under 2 separate case numbers, there will be 6 rows in total associated with their CDCR identification number.
Relationship
variable specifies the nature of a sentence and takes the following values: 'Initial', 'Concurrent', 'Consecutive' or 'Stayed'
In-prison
variable specifies if an offense was committed in prison or not and takes the following values: 'Yes' or 'No'
Information on the offenses an individual is no longer serving time for.
CDCR #
: Universal identification across datasets
Sentencing County
: District Attorney Office/Location
Case Number
: Case identification
Offense
: Penal code number
Offense Category
: Nature of offense
Offense Begin Date
: Date when offense was committed
Each row will include the details on ONE offense committed by the individual, i.e. if an individual has committed 6 offenses under 2 separate case numbers, there will be 6 rows in total associated with their CDCR identification number.
Relationship
variable specifies the nature of a sentence and takes the following values: 'Initial', 'Concurrent', 'Consecutive' or 'Stayed'
In-prison
variable specifies if an offense was committed in prison or not and takes the following values: 'Yes' or 'No'
Personal information on the individual. Note that we do not utilize any demographic information such as gender or race.
CDCR #
: Universal identification across datasets
Birthday
: Individual's date of birth
Offense Begin Date
: Date when offense was committed
Aggregate Sentence in Months
: Total sentence length in months
Three Strikes Law and Prosecutor Initiated Resentencing (PIR):
- https://law.stanford.edu/three-strikes-project/three-strikes-basics/
- https://capitalbnews.org/prosecutor-resentencing-law/
Contact aparna.komarla@gmail.com for any questions
Currently looking for interns and developers to join our team. Here is a short summary of available projects: https://github.com/redoio/resentencing_data_initiative/blob/main/opportunities.md. Reach out to Aparna Komarla (aparna.komarla@gmail.com) if you are interested!