How to draw flatness curve in Figure 3? #11

FrankZhangRp · 2022-04-30T10:28:57Z

Hi,
Thank you so much for providing this repo, the work is awesome!
And how can we reproduce the loss gap curve in Figure 3 of this paper? How to add the gamma on the model parameter and what is the metric of the distance in X-axis? I flat the model parameter dict into one vector and add a noise vector with norm 1.0 and get the loss gap about 0.2 on p domain test, I must have made a mistake on the Monte-Carlo approximation sampling.
Thanks a lot!

khanrc · 2022-05-02T09:52:43Z

Hi, thanks to the interest in our study.

We first sample an unit direction vector and compute the loss gap by changing the model parameter according to the radius gamma. The parameter difference can be computed by gamma * unit_direction_vector. The reported value is averaged over 100 sampled direction vectors. X-axis indicates the gamma.

Simple pytorch-style pseudo code is:

n_params = num_parameters(model)
direction_vector = torch.randn(n_params)
unit_direction_vector = direction_vector / torch.norm(direction_vector)
for gamma in gamma_list:
  noised_model = get_noised_model(model, unit_direction_vector * gamma)
  loss_gap = evaluate(noised_model) - evaluate(model)

FrankZhangRp · 2022-05-02T09:55:17Z

got it! Very clear! Thanks a lot!

Wang-pengfei · 2022-10-26T02:36:19Z

got it! Very clear! Thanks a lot!

The loss gap I get seems to be wrong. Did you solve this problem?

brisker · 2022-11-18T06:10:19Z

about Figure 3 plotting mentioned here

Is the model used in plotting figure 3, the final converged model , or the model during training?
are all the parameters of every layer added with weight noise?
@khanrc

khanrc · 2022-11-18T12:49:45Z

@brisker

Three converged models are used. In particular, the models are converged before 1000 steps (See Fig. 5), and models from 2500, 3500, 4500 steps are used.
Yes.

khanrc closed this as completed May 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to draw flatness curve in Figure 3? #11

How to draw flatness curve in Figure 3? #11

FrankZhangRp commented Apr 30, 2022

khanrc commented May 2, 2022 •

edited

FrankZhangRp commented May 2, 2022

Wang-pengfei commented Oct 26, 2022

brisker commented Nov 18, 2022 •

edited

khanrc commented Nov 18, 2022

How to draw flatness curve in Figure 3? #11

How to draw flatness curve in Figure 3? #11

Comments

FrankZhangRp commented Apr 30, 2022

khanrc commented May 2, 2022 • edited

FrankZhangRp commented May 2, 2022

Wang-pengfei commented Oct 26, 2022

brisker commented Nov 18, 2022 • edited

khanrc commented Nov 18, 2022

khanrc commented May 2, 2022 •

edited

brisker commented Nov 18, 2022 •

edited