# Identifying Need for Change: North Carolina School Performance

Out of the 2,617 public schools (including Charter) operating in North Carolina during the 2016-2017 school year, 902 schools (34.5%) have, for at least one year since 2013-2014, been classified as a low performing school. 

NCPDI classifies low performing schools as:

“Low-performing schools are those that receive a **school performance grade** of **D** or **F** and a **school growth score** of **"met expected growth"** or **"not met expected growth"** as defined by G.S. 115C-83.15.” (G.S. 115C-105.37(a)), and

“A Low-performing local school administrative unit is a unit in which the majority of the schools in that unit that received a school performance grade and school growth score as provided in G.S. 115C-83.15 have been identified as low-performing schools, as provided in G.S. 115C-105.37.” (G.S. 115C-105.39A(a)).

Source: http://www.ncpublicschools.org/schooltransformation/low-performing/

**Thus, we treat low performing schools as a proxy for aggregate student educational achievement.**


## Problem Statement: 
In recent years, 30% of public schools in North Carolina have been low performing. Students in low performing schools are not meeting the educational achievement standards set by the state. Factors outside administrators' control: economically disadvantaged and majority-minority student populations, are the most influential indicators of low performance.  


## Motivation: 
Of the 902 schools low performing between 2013/14 and 2016/17, 209 have been low performing for all schools years (8%), 203 have been low performing for 3 of the schools years (7.8%), 227 have been low performing for 2 schools years (8.7%), and 263 have been low performing once (10%). Another way to look at these numbers is to consider that out of the four school years between 2013/14 and 2016/2017, 24.4% of schools have been recurringly low performing. 

What does this mean in terms of students? 
Out of around the 1.5 million total number of students studying in a public school in the 2016/17 school year, around 460,000 of those students (30%) have had at least a year studying a low performing school. Around 100,000 students (6.8%) in North Carolina study at a school that has been low performing for 4 years. 


## Solution:
We identify the factors within school administrator's control that can positively impact a school's EVAAS growth score. Growth scores are a metric to measure how well a school's performance increases over a year. We focus on growth instead of raw performance scores, as raw scores will be much slower to change over time. 

This notebook reviews NCPDI North Carolina School Report Card and Statistical Profile data to identify the factors that contribute to low school performance and EVAAS growth scores. We will then create a model to predict EVAAS growth scores including only school-level factors irrespective of student demographics. After determining the most predictive factors for determining EVAAS growth scores, we simulate changes in these factors to demonstrate a theoretical improvement in student achievement growth. 

We: 
1. Take a look at the heuristically most common reasons for low school performance: percentage of economically disadvantaged students, student demographics, and school funding to see if these indicators are statistically different in low performing schools. 

2. Remove the factors outside of the school adminsitration's control from the dataset to determine which school-level factors are most important in determining low performance by:
    1. Performing Feature Importance using XGBoost. XGBoost is a tree-based gradient-boosting method which minimizes a cost function relative to predicting a target variable. When a node in a decision tree is split, we can calculate the following reduction in impurity, and attribute this reduction to feature involved. When the tree is finished splitting nodes, those features with the largest proportional contribution toward decreasing impurity within nodes can be said to be the most “important.”
    
4. Test the predictive nature of our selected features by creating a classification model to predict the EVAAS growth score for each of our 4 years. Whether these features are important will be reflected in the accuracy and precision of the regression model. 

5. Use a new methodology for decision-making. We create a function that will synthesize data based on percent changes in a certain input feature to then be used in our regression model to predict low performance. This will allow us to review, all things being equal, how a change in one or more features may correlate to a change in school EVAAS growth score. 


*Please note: Dataset Creation and Processing can be found: https://github.com/oleeson/NCPDI-Capstone*

# XGBoost FINDINGS

# Top Features of Importance for EVAAS GROWTH SCORE Prediction

### School and Environment Level 
#### Resources
- student_num: Number of students at school level (school size) 

- class_teach_num: Average number of classroom teachers at School Level 
- lea_class_teach_num: Average number of classroom teachers at LEA Level 

- wap_num: Wireless Access Points at the school level
- wap_per_classroom: Wireless Access Points per Classroom at the school level

- stud_internet_comp_num: Ratio of students to internet connected computer at school level 
- lea_stud_internet_comp_num: Ratio of students to internet connected computer at LEA level 

- summer_program_ind: Summer school program was included in Read to Achieve results


#### Discipline
- avg_daily_attend_pct: Average daily attendance percentage at school level
- lea_avg_daily_attend_pct: Average daily attendance percentage at LEA level

- short_susp_per_c_num: Short term suspensions per 100 students at school level 
- lea_short_susp_per_c_num: Short term suspensions per 100 students at LEA level 

- lea_crime_per_c_num: Number of crimes or acts of violence per 100 students at LEA level 
- crime_per_c_num: Number of crimes or acts of violence per 100 students at school level 

### Educator Level
#### Teachers
- flicensed_teach_pct: Percent of teachers that meet NC fully licensed definition at school level
- nbpts_num: Number of National Board Certified Staff at school level 

- tchyrs_0thru3_pct: Percent of teachers with 0 - 3 years of experience at the school level
- tchyrs_4thru10_pct: Percent of teachers with 4 - 10 years of experience at the school level
- lea_tchyrs_4thru10_pct: Percent of teachers with 4 - 10 years of experience at the LEA level

- advance_dgr_pct: Percent of teachers with masters or higher degree at school level 
- lea_advance_dgr_pct: Percent of teachers with masters or higher degree at LEA level  

- Accomplished_TCHR_Standard 1_Pct : Percent of Teachers reaching standard 1 : Teachers Demonstrate Leadership

#### Principals
- 10+ Years_LEA_Exp_Pct_Prin : Percent of principals with 10+ years of experience at the LEA level