Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to train it? #3

Open
zhangbaijin opened this issue Oct 31, 2022 · 6 comments
Open

How to train it? #3

zhangbaijin opened this issue Oct 31, 2022 · 6 comments

Comments

@zhangbaijin
Copy link

No description provided.

@ProCommiter
Copy link

Hi, I was wondering if the authors will be releasing on how to train the models for our specific datasets or is it not necessary at all? Thanks.

@TingdiRen
Copy link

Hi, I was wondering if the authors will be releasing on how to train the models for our specific datasets or is it not necessary at all? Thanks.

I'm not the offical author :)
What's the specific dataset mean? datasets for super-resolution/ deblurring or else?
In fact, you just need to train u-net for uncondition diffusion model with the target images' dataset (e.g. deblurred image). Besides, if you have any differentiable network as the forward operator, then you can use the author's method to solve the inverse problem. Anyway, the actual performance cannot attach to my expectation, but I still appreciate the contribution of author.

@ProCommiter
Copy link

Hi, I was wondering if the authors will be releasing on how to train the models for our specific datasets or is it not necessary at all? Thanks.

I'm not the offical author :) What's the specific dataset mean? datasets for super-resolution/ deblurring or else? In fact, you just need to train u-net for uncondition diffusion model with the target images' dataset (e.g. deblurred image). Besides, if you have any differentiable network as the forward operator, then you can use the author's method to solve the inverse problem. Anyway, the actual performance cannot attach to my expectation, but I still appreciate the contribution of author.

For SR!

@DPS2022
Copy link
Owner

DPS2022 commented Dec 13, 2022

Hello @ProCommiter @zhangbaijin,

As @TingdiRen mentioned, you can use any diffusion model classified as VPSDE (the assumption in the paper).
Our code is written based on guided-diffusion by openai.

Thus, to use this repository directly for other tasks, we recommend you train the diffusion model using https://github.com/openai/guided-diffusion with your own dataset.

@xiximelon
Copy link

xiximelon commented Oct 12, 2023

@DPS2022 hello! could you please provide the parameter setting when training the diffusion model because the setting varies much in https://github.com/openai/guided-diffusion
Here is an example for unconditional imagenet64,could you please provide the parameter on which FFHQ is trained
MODEL_FLAGS="--image_size 64 --num_channels 128 --num_res_blocks 3 --learn_sigma True" DIFFUSION_FLAGS="--diffusion_steps 4000 --noise_schedule cosine" TRAIN_FLAGS="--lr 1e-4 --batch_size 128"
Thanks!

@wst2333
Copy link

wst2333 commented Oct 17, 2023

@DPS2022 hello! could you please provide the parameter setting when training the diffusion model because the setting varies much in https://github.com/openai/guided-diffusion Here is an example for unconditional imagenet64,could you please provide the parameter on which FFHQ is trained MODEL_FLAGS="--image_size 64 --num_channels 128 --num_res_blocks 3 --learn_sigma True" DIFFUSION_FLAGS="--diffusion_steps 4000 --noise_schedule cosine" TRAIN_FLAGS="--lr 1e-4 --batch_size 128" Thanks!

哥们你知道了吗?知道了能告诉我一下不/(ㄒoㄒ)/~~

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants