Restructuring the tool to privacy_meter #66

amad-person · 2022-03-28T07:28:16Z

Overview

This PR contains changes for the revamp of the tool 🎉.

Users will now follow this workflow to use Privacy Meter:

Create the required target and reference datasets and wrap them in Dataset objects so Privacy Meter can use them.
Create the target and reference models and wrap them in Model objects for making them compatible with Privacy Meter.
Construct InformationSource objects that will determine which models are used for querying which splits of the datasets. These objects are used to compute signals required by the metric.
Construct a Metric object that takes in the target + reference information sources and signals e.g. ModelLoss. One can also provide a hypothesis test function if the metric uses it. If the user wants to use the default version of a metric without constructing their own, they can choose to do so as well.
Run the audit by wrapping everything in an Audit object and calling its .run() method.

Tasks for the reviewers

Ordering the tasks in terms of how deep you have to dive into the code:

Running the tutorial notebooks in the docs/ folder and commenting on whether the new API was easy to understand and use.
Going through the new code to understand the components of the tool i.e. Audit, Metric, InformationSource, Signal, Model, Dataset and leaving comments/suggestions w.r.t. the architecture design.
Adding a new metric e.g. ReferenceMetric from the Enhanced MIA paper. This will help us see how easy it is for users to add their own attacks to the tool.

The temporary API documentation website is hosted here: https://privacy-meter-doc-test-2.web.app/privacy_meter.html

…delLoss, ModelGradient

…n consequence

mireshghallah · 2022-04-26T03:17:38Z

Rest of the Review for Task 1:

For the developer guide, maybe let’s create a table of context and numberings So that it’s easier to navigate. Also, I am not 100% sure about this but I feel like it might be better to first have the building and publishing, then the documentation?

Maybe it would be a good idea to add some explanation of what openvino is, to the openvino_models.ipynb notebook.

Minor: In shadow_metric.ipynb notebook, the 13th box, let’s limit the number of prints? right now people really have to scroll far.

One overall suggestion I have is maybe we should have scripts (bash/python scripts) that we can have people run, like

attack_causal_lm.py --target_model_checkpoint finetuned_gpt2 --attack_type ref_based

I see that the notebooks kind of do this, but sometimes having scripts make it easier for people to run and adjust things.

Task 2:

information_source_signal.py, I think the ModelOutput(Signal) might be a bit ambiguous, lets, something like ModelLogits might be better? (just a suggestion. The thing is output could be anything really, it’s a bit unclear).
For dataset.py, I feel like we need separate documentation or more comments, where we actually explain how people can use it for different data modalities, such as tabular, images and text. I think it is hard to figure out now.

…ng visualizations)

rzshokri · 2022-05-13T09:07:15Z

Privacy Meter 1.0

amad-person and others added 30 commits March 16, 2022 17:42

Restructure for revamped tool

566dbeb

New dataset interface

5751251

New model interface + 2 childs for tf and pytorch

3f7eaea

docstring formatting

b9d4423

New subdivide function

1d71447

Add Metric class + child metrics for population and shadow

794f95a

New Signal class + 4 childs: ModelOutput, ModelIntermediateOutput, Mo…

c555fcf

…delLoss, ModelGradient

New InformationSource class

ebc0ef3

Renamed signal.py to avoid conflicts with built-in signal class

6ea429e

ShadowMetric now uses InformationSource objects

6a21206

Error correction in default values for ShadowModel __init__

c97664e

Make PopulationMetric use InformationSource objects

0574b8d

Create MetricResult object

10c0ddc

Add initial Audit class and update types

f2d2e1f

Code debugging

ecf309b

Use FPR tolerance list

84de830

Add tutorial notebook for population metric

1241aec

Use correct loss_function in TensorflowModel

f291e14

TODO: add randomness in subdivide() for method == 'independent'

e66384e

PytorchModel requires Tensor and not np.array

8d9ef0b

New default values for ShadowMetric mappings

12d32fa

Encapsulation of optional imports

32fd701

Corrected bad shuffling

ff1072c

Audit.run() returns the result instead of printing it

f167c0f

WIP: ShadowMetric tutorial

abce995

ModelLoss now returns one value per sample

c5f46b2

Additional explanations

4116c0a

reweight_samples parameter to balance members/non-members

fef1c00

Update text in PopulationMetric tutorial

5ab091a

Add developer guide

6d41d1d

vim951 added 10 commits April 14, 2022 11:49

Bug correction in SignalHistogramReport

3a6f654

Improved readability for SignalHistogramReport

01b5780

Error correction for ShadowMetric

ac69c55

Added threshold to histogram report

2beafea

Added reference metric to explanations.json

88ac290

Implemented a more advanced way to split datasets, and adapted code i…

f12d46e

…n consequence

Added result visualization to the shadow metric tutorial

2d3a7c2

Added result visualization to the shadow metric tutorial

000fc9c

New visualization: vulnerable points

3f63bd6

Custom index page for documentation

38c4b64

vim951 and others added 17 commits April 27, 2022 12:46

Import optimization

598c238

Implemented new inference game: avg loss of train algo (+ correspondi…

42b0141

…ng visualizations)

Additional splitting options

f421683

Moved mapping and caching to the parent class

2333abd

Adapted tutorials to new functions

668626d

New tutorial for average loss of training algorithm (WIP)

0c61a5a

Explanations for the new tutorial

aab42c3

Cleaning up docstring, typing and comments

108822f

Docstring formatting

20ce653

Fix reference metric class and tutorial, clear long outputs

da21b53

Make docstrings and comments consistent

af2c0fb

Update docs README

5f24868

Update requirements file to include seaborn

edf10b0

Update requirements file for audit report deps

33397a9

Add TOC for dev guide

5444bff

Rename model output to model logits

2fa290f

Update README and project description

495c116

rzshokri merged commit f61d734 into privacytrustlab:master May 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restructuring the tool to privacy_meter #66

Restructuring the tool to privacy_meter #66

amad-person commented Mar 28, 2022

mireshghallah commented Apr 26, 2022 •

edited

rzshokri commented May 13, 2022

Restructuring the tool to privacy_meter #66

Restructuring the tool to privacy_meter #66

Conversation

amad-person commented Mar 28, 2022

Overview

Tasks for the reviewers

mireshghallah commented Apr 26, 2022 • edited

rzshokri commented May 13, 2022

mireshghallah commented Apr 26, 2022 •

edited