--- 
Project for the course in Microeconometrics | Summer 2021, M.Sc. Economics, Bonn University | [Mengxi Wang](https://github.com/Mengxi-20)

# Replication of Chen, T., Kung, J. K. S., & Ma, C. (2020) <a class="tocSkip">   
---

This notebook contains my replication of the results from the following paper:

> Chen, T., Kung, J. K. S., & Ma, C. (2020). Long Live Keju! The Persistent Effects of China’s Civil Examination System. The Economic Journal, 130(631), 2030–2064. 

##### Downloading and viewing this notebook:


* The best way to view this notebook is by downloading it and the repository it is located in from [GitHub](https://github.com/OpenSourceEconomics/ose-data-science-course-project-Mengxi-20). Other viewing options like _MyBinder_ or _NBViewer_ may have issues with displaying images or coloring of certain parts (missing images can be viewed in the folder [files](https://github.com/OpenSourceEconomics/ose-data-science-course-project-Mengxi-20) on GitHub).

* The original paper, as well as the data and code provided by the authors can be accessed [here](https://academic.oup.com/ej/article/130/631/2030/5819954).


##### Information about replication and individual contributions:

* 

<h1>Table of Contents<span class="tocSkip"></span></h1>

<div class="toc"><ul class="toc-item"><li><span><a href="#1.-Introduction" data-toc-modified-id="1.-Introduction-1">1. Introduction</a></span></li><li><span><a href="#2.-Identification" data-toc-modified-id="2.-Identification-2">2. Identification</a></span></li><li><span><a href="#3.-Empirical-Setup" data-toc-modified-id="3.-Empirical-Setup-3">3. Empirical Setup</a></span></li><li><span><a href="#4.-Replication-of-Chen-et-al.-(2020)" data-toc-modified-id="4.-Replication-of-Chen-et-al.-(2020)-4">4. Replication of Chen et al. (2020)</a></span><ul class="toc-item"><li><span><a href="#4.1.-Data-&amp;-Descriptive-Statistics" data-toc-modified-id="4.1.-Data-&amp;-Descriptive-Statistics-4.1">4.1. Data &amp; Descriptive Statistics</a></span></li><li><span><a href="#4.2.-Results" data-toc-modified-id="4.2.-Results-4.2">4.2. Results</a></span><ul class="toc-item"><li><span><a href="#4.2.1.-Tests-of-the-Validity-of-the-RD-Approach" data-toc-modified-id="4.2.1.-Tests-of-the-Validity-of-the-RD-Approach-4.2.1">4.2.1. Tests of the Validity of the RD Approach</a></span></li><li><span><a href="#i.--Extension:-Visual-Validity-Check" data-toc-modified-id="i.--Extension:-Visual-Validity-Check-4.2.2">i.  Extension: Visual Validity Check</a></span></li><li><span><a href="#ii.-Advanced-Validity-Check" data-toc-modified-id="ii.-Advanced-Validity-Check-4.2.3">ii. Advanced Validity Check</a></span></li><li><span><a href="#4.2.2.-First-Year-GPAs-and-Academic-Probation" data-toc-modified-id="4.2.2.-First-Year-GPAs-and-Academic-Probation-4.2.4">4.2.2. First Year GPAs and Academic Probation</a></span></li><li><span><a href="#4.2.3.-The-Immediate-Response-to-Academic-Probation" data-toc-modified-id="4.2.3.-The-Immediate-Response-to-Academic-Probation-5.2.5">4.2.3. The Immediate Response to Academic Probation</a></span></li><li><span><a href="#4.2.4.-The-Impact-onSubsequent-Performance" data-toc-modified-id="4.2.4.-The-Impact-onSubsequent-Performance-4.2.6">4.2.4. The Impact onSubsequent Performance</a></span></li><li><span><a href="#i.-Main-Results-for-Impact-on-GPA-&amp;-Probability-of-Placing-Above-Cutoff-in-the-Next-Term" data-toc-modified-id="i.-Main-Results-for-Impact-on-GPA-&amp;-Probability-of-Placing-Above-Cutoff-in-the-Next-Term-4.2.7">i. Main Results for Impact on GPA &amp; Probability of Placing Above Cutoff in the Next Term</a></span></li><li><span><a href="#ii.-Formal-Bound-Analysis-on-Subsequent-GPA-(partial-extension)" data-toc-modified-id="ii.-Formal-Bound-Analysis-on-Subsequent-GPA-(partial-extension)-4.2.8">ii. Formal Bound Analysis on Subsequent GPA (partial extension)</a></span></li><li><span><a href="#4.2.5.-The-Impacts-on-Graduation" data-toc-modified-id="4.2.5.-The-Impacts-on-Graduation-4.2.9">4.2.5. The Impacts on Graduation</a></span></li></ul></li></ul></li><li><span><a href="#6.-Extension:-Robustness-Checks" data-toc-modified-id="5.-Extension:-Robustness-Checks-5">5. Extension: Robustness Checks</a></span><ul class="toc-item"><li><span><a href="#5.1.--A-Closer-Look-at-Students'-Subsequent-Performance." data-toc-modified-id="5.1.--A-Closer-Look-at-Students'-Subsequent-Performance.-5.1">5.1.  A Closer Look at Students' Subsequent Performance.</a></span><ul class="toc-item"><li><span><a href="#5.1.1.-Subsequent-Performance-and-Total-Credits-in-Year-2" data-toc-modified-id="5.1.1.-Subsequent-Performance-and-Total-Credits-in-Year-2-5.1.1">5.1.1. Subsequent Performance and Total Credits in Year 2</a></span></li><li><span><a href="#5.1.2.-Subsequent-Cumulative-Grade-Point-Average-(CGPA)" data-toc-modified-id="5.1.2.-Subsequent-Cumulative-Grade-Point-Average-(CGPA)-5.1.2">5.1.2. Subsequent Cumulative Grade Point Average (CGPA)</a></span></li></ul></li><li><span><a href="#5.2.-Bandwidth-Sensitivity" data-toc-modified-id="5.2.-Bandwidth-Sensitivity-5.2">5.2. Bandwidth Sensitivity</a></span></li></ul></li><li><span><a href="#6.-Conclusion" data-toc-modified-id="6.-Conclusion-6">6. Conclusion</a></span></li><li><span><a href="#7.-References" data-toc-modified-id="7.-References-7">7. References</a></span></li></ul></div>

In [1]:
%matplotlib inline
# pip install linearmodels==4.24
# pip install graphviz

In [5]:
import numpy as np
import pandas as pd
import pandas.io.formats.style
import seaborn as sns
import statsmodels.api as sm
import matplotlib.pyplot as plt
import statsmodels.formula.api as smf
import statsmodels.api as sm_api

from linearmodels.iv import IV2SLS
from linearmodels import IV2SLS, IVLIML, IVGMM, IVGMMCUE
from IPython.display import HTML, Image