Update README.md

OpportunityInsights · Sep 11, 2019 · b32f1fc · b32f1fc
1 parent efb9d27
commit b32f1fc
Showing 1 changed file with 29 additions and 0 deletions.
diff --git a/README.md b/README.md
@@ -1,2 +1,31 @@
 # Differential-Privacy
 Replication Package for "A Practical Method to Reduce Privacy Loss when Disclosing Statistics Based on Small Samples"
+
+Figures Replication
+
+This folder contains the Stata code to reproduce figures 1 to 3 in “A Practical Method to Reduce Privacy Loss when Disclosing Statistics Based on Small Samples” by Chetty and Friedman (2019).
+
+To run the code, please set the file paths in in the do-file all_figures_paper.do to the files contained in this replication package (tract_covariates.dta and tract_outcomes_early_dta_dp.dta). 
+
+
+Implementation of Noise-Infusion Algorithm: Example Code
+
+The sub-folder example_code_implementation_guide contains two examples of how to implement the noise-infusion algorithm outlined in the Appendix A of “A Practical Method to Reduce Privacy Loss when Disclosing Statistics Based on Small Samples” by Chetty and Friedman (2019). The three files in the sub-folder contain the following:
+
+Example 1 – Simple regression coefficient as statistic of interest
+
+Implementation_guide_simple_reg.do
+
+Stata do-file showing a step-by-step example of how to apply the noise-infusion algorithm to publicly release the estimated coefficients of a simple regression estimate of child income rank on parent income rank in each of the cells of a simulated dataset.
+
+Example 2 – Predicted value of Y at a certain value of X as statistic of interest
+
+Implementation_guide_p_25_prediction.do
+
+Stata do-file showing a step-by-step example of how to apply the noise-infusion algorithm to publicly release the predicted value of child income rank at the 25th percentile of the parental income distribution in each of the cells of a simulated dataset.
+
+Dataset
+
+private_data_by_cells.dta
+
+Simulated dataset in Stata format containing information on child income rank and parent income rank of 10,000 fictitious individuals grouped in 111 cells.Variable names: parent_rank, kid_rank, cell.