The impacts of BatchNorm instead of LayerNorm #1

Ye-D · 2023-05-31T16:05:20Z

In the CrypTen-based MPCFormer, such as MPC-Bert, I noticed that the LayerNorm is replaced by BatchNorm (due to CrypTen does not support LayerNorm now). Will this modification influence Bert's performance (e.g., accuracy)? Due to the source code did not give the model-loading scripts, it might be difficult for me to check it in experiments. Thanks for your help.

DachengLi1 · 2023-05-31T18:37:55Z

@Ye-D thanks for the great question! We measure accuracy in plain PyTorch, and only system throughput with Crypten, as reported in the paper. We are implementing modules such as Layernorm in Crypten to measure the accuracy.

By module loading, do you mean something like this ?

Ye-D · 2023-06-01T00:49:43Z

@DachengLi1 Thanks for the reply!

Ye-D closed this as completed Jun 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The impacts of BatchNorm instead of LayerNorm #1

The impacts of BatchNorm instead of LayerNorm #1

Ye-D commented May 31, 2023

DachengLi1 commented May 31, 2023

Ye-D commented Jun 1, 2023

The impacts of BatchNorm instead of LayerNorm #1

The impacts of BatchNorm instead of LayerNorm #1

Comments

Ye-D commented May 31, 2023

DachengLi1 commented May 31, 2023

Ye-D commented Jun 1, 2023