Possible bugs #61

hailuu684 · 2023-05-19T09:32:49Z

Hello,
I am testing your repo and I got this error. I am not sure if this is an issue in different torch versions or incompatibilities.

I got an error in d.grad.add_(g). It says 'NoneType' object has no attribute 'add_'. I printed the type of 'd', it is obviously a Tensor type. I do not understand why can cause this problem.

hailuu684 · 2023-05-19T10:11:16Z

May I ask a question? I am curious about if I can use the generated distilled dataset that you already published to train with normal procedure. What I meant by the normal procedure is like the image below. Or do I have to write the customized network as you did?

ssnl · 2023-05-19T14:43:19Z

You may use the distilled dataset however you want. Whether to reparametrize the network shouldn't really affect results. However, other changes in training (e.g., network architectures, learning rates, epochs, etc.) can affect the results non-trivially.

ssnl · 2023-05-19T14:46:16Z

Re your original question: that shouldn't happen if you are running the code as-is, since there is this line

dataset-distillation/train_distilled_image.py

Line 70 in 1c9657a

p.grad = torch.zeros_like(p)

hailuu684 · 2023-05-19T15:15:14Z

Thank you for your quick reply, I haven't modified the code but still got this error. Can you suggest where should I take a look on? Honestly, your code is advanced to me.

ssnl · 2023-05-19T19:11:18Z

It might be a torch version issue then because I didn't see this before. I guess you can try to change code to test

if d.grad is None, just g assign to d.grad
otherwise, add g to it.

Although I haven't experienced the error you are seeing so I can't guarantee correctness.

hailuu684 · 2023-05-19T21:40:11Z

Thank you for the instruction, here is the updated code for anyone who is having the same error like me

hailuu684 · 2023-05-19T22:00:06Z

May I ask this question? In your code, are you using real data to generate distilled dataset? and then training on the distilled dataset? However, in the function init_data_optim, I see distill_label and distill_data are generating randomly. I understand that this is the initialization, but in which part of your code updates the distilled data? would you mind if you could point it out for me? Thank you very much.

ssnl · 2023-05-20T18:04:06Z

@hailuu684 In this code, distilled data are initialized as random noise. But yes, many works show that initializing from real images can work better in certain cases. It is not implemented here.

ssnl closed this as completed May 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible bugs #61

Possible bugs #61

hailuu684 commented May 19, 2023

hailuu684 commented May 19, 2023

ssnl commented May 19, 2023

ssnl commented May 19, 2023

hailuu684 commented May 19, 2023

ssnl commented May 19, 2023

hailuu684 commented May 19, 2023

hailuu684 commented May 19, 2023

ssnl commented May 20, 2023

Possible bugs #61

Possible bugs #61

Comments

hailuu684 commented May 19, 2023

hailuu684 commented May 19, 2023

ssnl commented May 19, 2023

ssnl commented May 19, 2023

hailuu684 commented May 19, 2023

ssnl commented May 19, 2023

hailuu684 commented May 19, 2023

hailuu684 commented May 19, 2023

ssnl commented May 20, 2023