GitHub - Algue-Rythme/lip-dp: Lipschitz (LIP) constrained neural networks with Differential Privacy (DP) guarantees, for privacy & robustness.

LipDP is a Python toolkit dedicated to robust and certifiable learning under privacy guarantees.

This package is the code for the paper "DP-SGD Without Clipping: The Lipschitz Neural Network Way" by Louis Béthune, Thomas Massena, Thibaut Boissin, Aurélien Bellet, Franck Mamalet, Yannick Prudent, Corentin Friedrich, Mathieu Serrurier, David Vigouroux, published at the International Conference on Learning Representations (ICLR 2024). The paper is available on arxiv.

State-of-the-art approaches for training Differentially Private (DP) Deep Neural Networks (DNN) face difficulties to estimate tight bounds on the sensitivity of the network's layers, and instead rely on a process of per-sample gradient clipping. This clipping process not only biases the direction of gradients but also proves costly both in memory consumption and in computation. To provide sensitivity bounds and bypass the drawbacks of the clipping process, we propose to rely on Lipschitz constrained networks. Our theoretical analysis reveals an unexplored link between the Lipschitz constant with respect to their input and the one with respect to their parameters. By bounding the Lipschitz constant of each layer with respect to its parameters, we prove that we can train these networks with privacy guarantees. Our analysis not only allows the computation of the aforementioned sensitivities at scale, but also provides guidance on how to maximize the gradient-to-noise ratio for fixed privacy guarantees. To facilitate the application of Lipschitz networks and foster robust and certifiable learning under privacy guarantees, we provide this Python package that implements building blocks allowing the construction and private training of such networks.

The sensitivity is computed automatically by the package, and no element-wise clipping is required. This is translated into a new DP-SGD algorithm, called Clipless DP-SGD, that is faster and more memory efficient than DP-SGD with clipping.

🔥 Tutorials

We propose some tutorials to get familiar with the library and its API:

Demo on MNIST
Demo on CIFAR10

🚀 Quick Start

lipDP requires some stuff and several libraries including Numpy. Installation can be done locally by cloning the repository and running:

pip install -e .[dev]

Setup privacy parameters

Parameters are stored in a dataclass:

from deel.lipdp.model import DPParameters
dp_parameters = DPParameters(
    noisify_strategy="local",
    noise_multiplier=4.0,
    delta=1e-5,
)

epsilon_max = 10.0

Setup DP model

# construct DP_Sequential
model = DP_Sequential(
    # works like usual sequential but requires DP layers
    layers=[
        # BoundedInput works like Input, but performs input clipping to guarantee input bound
        layers.DP_BoundedInput(
            input_shape=dataset_metadata.input_shape, upper_bound=input_upper_bound
        ),
        layers.DP_QuickSpectralConv2D( # Reshaped Kernel Orthogonalization (RKO) convolution.
            filters=32,
            kernel_size=3,
            kernel_initializer="orthogonal",
            strides=1,
            use_bias=False,  # No biases since the framework handles a single tf.Variable per layer.
        ),
        layers.DP_GroupSort(2),  # GNP activation function.
        layers.DP_ScaledL2NormPooling2D(pool_size=2, strides=2),  # GNP pooling.
        layers.DP_QuickSpectralConv2D( # Reshaped Kernel Orthogonalization (RKO) convolution.
            filters=64,
            kernel_size=3,
            kernel_initializer="orthogonal",
            strides=1,
            use_bias=False,  # No biases since the framework handles a single tf.Variable per layer.
        ),
        layers.DP_GroupSort(2),  # GNP activation function.
        layers.DP_ScaledL2NormPooling2D(pool_size=2, strides=2),  # GNP pooling.
        
        layers.DP_Flatten(),   # Convert features maps to flat vector.
        
        layers.DP_QuickSpectralDense(512),  # GNP layer with orthogonal weight matrix.
        layers.DP_GroupSort(2),
        layers.DP_QuickSpectralDense(dataset_metadata.nb_classes),
    ],
    dp_parameters=dp_parameters,
    dataset_metadata=dataset_metadata,
)

Setup accountant

The privacy accountant is composed of different mechanisms from autodp package that are combined to provide a privacy accountant for Clipless DP-SGD algorithm:

Adding a privacy accountant to your model is straighforward:

from deel.lipdp.model import DP_Accountant

callbacks = [
  DP_Accountant()
]

model.fit(
    ds_train,
    epochs=num_epochs,
    validation_data=ds_test,
    callbacks=[
        # accounting is done thanks to a callback
        DP_Accountant(log_fn="logging"),  # wandb.log also available.
    ],
)

📦 What's Included

Code can be found in the deel/lipdp folder, the documentation ca be found by running mkdocs build and mkdocs serve (or loading site/index.html). Experiments were done using the code in the experiments folder.

Other tools to perform DP-training include:

tensorflow-privacy in Tensorflow
Opacus in Pytorch
jax-privacy in Jax

🙏 Acknowledgments

The creators thank the whole DEEL team for its support, and Aurélien Bellet for his guidance.

👨‍🎓 Creators

The library has been created by Louis Béthune, Thomas Masséna during an internsip at DEEL, and Thibaut Boissin.

🗞️ Citation

If you find this work useful for your research, please consider citing it:

@inproceedings{
bethune2024dpsgd,
title={{DP}-{SGD} Without Clipping: The Lipschitz Neural Network Way},
author={Louis B{\'e}thune and Thomas Massena and Thibaut Boissin and Aur{\'e}lien Bellet and Franck Mamalet and Yannick Prudent and Corentin Friedrich and Mathieu Serrurier and David Vigouroux},
booktitle={The Twelfth International Conference on Learning Representations},
year={2024},
url={https://openreview.net/forum?id=BEyEziZ4R6}
}

📝 License

The package is released under MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
.github/workflows		.github/workflows
LICENSES/headers		LICENSES/headers
deel/lipdp		deel/lipdp
docs		docs
examples		examples
experiments		experiments
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
GOVERNANCE.md		GOVERNANCE.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
mkdocs.yml		mkdocs.yml
requirements.txt		requirements.txt
requirements_dev.txt		requirements_dev.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📚 Table of contents

🔥 Tutorials

🚀 Quick Start

Setup privacy parameters

Setup DP model

Setup accountant

📦 What's Included

🙏 Acknowledgments

👨‍🎓 Creators

🗞️ Citation

📝 License

About

Releases

Packages

Contributors 5

Languages

License

Algue-Rythme/lip-dp

Folders and files

Latest commit

History

Repository files navigation

📚 Table of contents

🔥 Tutorials

🚀 Quick Start

Setup privacy parameters

Setup DP model

Setup accountant

📦 What's Included

🙏 Acknowledgments

👨‍🎓 Creators

🗞️ Citation

📝 License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages