This repository contains the code and data for the paper "Dead or Alive, we can deny it”. A Differentially Private Approach to Survival Analysis" accepted at the SEBD 2024: 32nd Symposium on Advanced Database Systems.
- Francesco Luigi De Faveri, Guglielmo Faggioli, Nicola Ferro: Department of Information Engineering, University of Padua, Padua, Italy.
- Riccardo Spizzo, National Cancer Center CRO Aviano, Aviano, Italy.
- francescoluigi.defaveri@phd.unipd.it (F. L. De Faveri)
- faggioli@dei.unipd.it (G. Faggioli)
© 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
Survival Analyses (SAs), a key statistical tool used to predict event occurrence over time, often involve sensitive information, necessitating robust privacy safeguards. This work demonstrates how the Revised Randomized Response (RRR) can be adapted to ensure Differential Privacy (DP) while performing SAs. This methodology seeks to safeguard the privacy of individuals’ data without significantly changing the utility, represented by the statistical properties of the survival rates computed. Our findings show that integrating DP through RRR into SAs is both practical and effective, providing a significant step forward in the privacy-preserving analysis of sensitive time-to-event data. This study contributes to the field by offering a new comparison method to the current state-of-the-art used for SAs in medical research.
Differential Privacy, Privacy-Preserving Mechanisms, Survival Analysis, Information Security
The code is written in Python and implemented using two Jupyter Notebooks. The first notebook relates to the original scenario, while the second notebook relates to the DP scenario. The code is available in the src
folder. All the notebooks are commented to explain the code and the results.
The results are available in the results
folder. The results are presented in the form of tables and plots.