You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is N Time Monte Carlo sampling with n = 16 in the code. But how are the different samples generated? given_num represents how many tokens to use from the input, and irepresents the i'th sample. Why are the samples different for different values of i? Is the rollout network being updated somewhere within call to get_reward and I'm missing it? I also don't see where the randomness is coming in for the Monte Carlo estimation of the partial sequence reward.
From my examination of the code, the network doesn't get updated and the session parameters are the same so I'm not sure how different samples are being generated.
Can someone help me understand how a) different samples are being generated, b) where is the randomness coming from, c) if the rollout network has the same parameters as the Generator network, how is it generating different samples than the generator?
Any help is greatly appreciated! Thank you for providing this code it has been very helpful to me.
The text was updated successfully, but these errors were encountered:
In this loop:
SeqGAN/rollout.py
Line 79 in 5f2c0a5
This is N Time Monte Carlo sampling with n = 16 in the code. But how are the different samples generated?
given_num
represents how many tokens to use from the input, andi
represents the i'th sample. Why are the samples different for different values of i? Is the rollout network being updated somewhere within call toget_reward
and I'm missing it? I also don't see where the randomness is coming in for the Monte Carlo estimation of the partial sequence reward.From my examination of the code, the network doesn't get updated and the session parameters are the same so I'm not sure how different samples are being generated.
Can someone help me understand how a) different samples are being generated, b) where is the randomness coming from, c) if the rollout network has the same parameters as the Generator network, how is it generating different samples than the generator?
Any help is greatly appreciated! Thank you for providing this code it has been very helpful to me.
The text was updated successfully, but these errors were encountered: