Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

图像生成任务的默认设置问题 #20

Closed
Johnson-yue opened this issue Jun 21, 2024 · 2 comments
Closed

图像生成任务的默认设置问题 #20

Johnson-yue opened this issue Jun 21, 2024 · 2 comments

Comments

@Johnson-yue
Copy link

作者你好,非常感谢你放出来CVPR2024的工作源码,同时也提供了中文的论文方便我们理解。我在使用celebA 数据集 64x64的图像进行图像生成任务的时候, 发现默认的设置有些疑问导致训练不成功。

在train.py 的文件中默认设置的object 是 pred_res_noise 然后用两个Unet来预测。 但是 test_res_or_noise 设置的为 ”noise“, 在训练的时候只使用了 一个Unet的预测结果,这个是为什么?? 而且因为设置的的2个Unet,所以model_output = (0, xx),所以训练的时候,会报错 int 没有 size这个属性。

@nachifur
Copy link
Owner

nachifur commented Jun 22, 2024

可以尝试修改:

  1. test_res_or_noise = "res_noise" (测试阶段可以修改test_res_or_noise = "noise" or “res” )

  2. [self.alphas_cumsum[t]*self.num_timesteps, self.betas_cumsum[t]*self.num_timesteps]] -> [t,t] (in L852 and L1292)

@Johnson-yue
Copy link
Author

可以尝试修改:

  1. test_res_or_noise = "res_noise" (测试阶段可以修改test_res_or_noise = "noise" or “res” )
  2. [self.alphas_cumsum[t]*self.num_timesteps, self.betas_cumsum[t]*self.num_timesteps]] -> [t,t] (in L852 and L1292)

非常感谢,解决了我的疑问。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants