Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why the FCN needs a pre-trained weight? #5

Open
machanic opened this issue Jun 21, 2020 · 7 comments
Open

Why the FCN needs a pre-trained weight? #5

machanic opened this issue Jun 21, 2020 · 7 comments

Comments

@machanic
Copy link

I notice that you use chainer.serializers.load_npz('../denoise_with_convGRU/model/pretrained_15.npz', net) to load a pre-trained FCN weight? Why needs the pre-trained FCN weight?
I currently translate the code into PyTorch version, Can I train the FCN directly without loading pretrained weights?
Thank you!

@rfuruta
Copy link
Owner

rfuruta commented Jun 21, 2020

Thank you for your question.

Why needs the pre-trained FCN weight?

The pre-trained weights are used in order to make the training of the network stable.

Can I train the FCN directly without loading?

Maybe yes, but the training will be unstable especially at the beginning.

@machanic
Copy link
Author

machanic commented Jun 24, 2020

@rfuruta I have an idea of my paper to use your code, but the reward is only a scalar value for the whole image, rather than the pixel-wise rewards. Can I use PixelRL? Maybe the difficult is that many agent & action has to determine one reward.
Thank you.

@rfuruta
Copy link
Owner

rfuruta commented Jun 26, 2020

I do not think PixelRL will work in such a situation. Training PixelRL with one reward is a remaining problem.

@machanic
Copy link
Author

@rfuruta Why doesn't work? Does it because that so many actions have to be made to obtain one reward?
I have implemented that idea, but it works poorly, maybe my implementation is incorrect?

@rfuruta
Copy link
Owner

rfuruta commented Jun 30, 2020

I guessed it would not work because the reward value is determined by many actions.
But, my guess may be wrong if your idea is working.

@dingdingbin
Copy link

@machanic Hi, can you share us your code of PyTorch version of PixelRL? Thank you very much.

@Big-OrangeCat
Copy link

我注意到您使用chainer.serializers.load_npz('../denoise_with_convGRU/model/pretrained_15.npz', net)加载预训练的 FCN 重吗?为什么需要预训练的 FCN 重? 我现在将代码翻译成 PyTorch 版本,我可以直接训练 FCN 而不是加载预训练的重吗? 谢谢!
can you share us your code of PyTorch version of PixelRL? Thank you very much.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants