Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about learning GFN on the checkerboard dataset #1

Closed
yangysc opened this issue Jun 29, 2022 · 0 comments
Closed

Question about learning GFN on the checkerboard dataset #1

yangysc opened this issue Jun 29, 2022 · 0 comments

Comments

@yangysc
Copy link

yangysc commented Jun 29, 2022

Hi, Dinghuai

Thanks for your great work.

I have a question about reproducing the result on the checkerboard dataset using the GFlowNet_Randf_TB. The result I obtained was not meaningful. However, the successful result (as in the paper) can be obtained by using the learned backward policy PB GFlowNet_LearnedPb_TB.

image

I am a little confused why a random backward policy cannot work on the checkerboard dataset. Because I think we can always find a right forward policy corresponding to a certain backward policy. Maybe I missed something to tune.

Any comment would be quite helpful. Thanks in advance!

Best,
Shanchao

@yangysc yangysc closed this as completed Sep 4, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant