Trying to better understand the framework beahavior #881

fbarusso · 2023-04-10T18:42:45Z

I'm sorry if this is not the appropriate place to talk about this, I'm happy to take this discussion to somewhere else if needed.

As the title says, I'm trying to better understand TF-Encrypted on a technical level. I found your framework to be an elegant solution and chose to talk about it in a paper I'm currently working on.

However, I'm not an expert in machine learning nor cryptography, but would like to present a precise and clear explanation of TF-Encrypted anyway.

My main hardship is in understanding each party and its role.

This is my understanding of the benchmark/training/private_training example:

Parties: we have 2 data owners (training and prediction) and the 3 servers necessary for the ABY3 protocol.
The Train Data Owner encrypts the training dataset (with additive secret sharing), splitting it in three shares (one for each ABY3 server). Each server receives two of those shares (resharing process) to be able to perform the necessary operations (but also limiting the system to handle at most 1 malicious party). It does the same for the labels and shares the whole data with the three servers. So it is possible to say that the Train Data Owner encrypts the input and sends it whole to the 3 servers.
In the 3 ABY3 servers, it is possible to privately train the model with the encrypted data and the defined ABY3 operations (addition and multiplication, piece-wise polynomial that replaces the sigmoid), and also with a randomly created and secret-shared weights vector. Finally, we have an encrypted model trained on the encrypted data (parameters like epochs and batch size can be set by the user).
For the evaluation step, The Prediction Data Owner then encrypts the prediction data, the same way the training data was encrypted, and sends it to the 3 ABY3 servers. The servers then run private predictions with the encrypted data with the previously trained model. The servers then have access to (plaintext) testing results. It would be possible (although it's not present in the example) to send an encrypted result to the testing party, who could decipher it.

Is this correct? Are there any important steps that are worth mentioning? Is this process better detailed in any work?

Thank you in advance.

zjn-code · 2023-04-12T13:27:31Z

The ABY3 protocol in TFE is an implementation of this paper, you could check that for more detail.
As for your understanding of the example, most of them are right, where you may get wrong are list below:
2. limiting the system to handle at most 1 malicious party: The ABY3 protocol does provide security against 1 malicious party, but TFE doesn't implement that, TFE only support semi-honest security for now.
4. The servers then have access to (plaintext) testing results: This example is used to benchmark training time, so we reveal the testing results to the servers for ease of coding. In practice, only Prediction Data Owner can get the (plaintext) testing results.

fbarusso · 2023-04-12T18:30:43Z

Thank you for clarifying.

fbarusso closed this as completed Apr 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Trying to better understand the framework beahavior #881

Trying to better understand the framework beahavior #881

fbarusso commented Apr 10, 2023

zjn-code commented Apr 12, 2023

fbarusso commented Apr 12, 2023

Trying to better understand the framework beahavior #881

Trying to better understand the framework beahavior #881

Comments

fbarusso commented Apr 10, 2023

zjn-code commented Apr 12, 2023

fbarusso commented Apr 12, 2023