Question about Rollout #19

nathan-whitaker · 2017-06-13T18:16:45Z

In this loop:

Line 79 in 5f2c0a5

for i in range(rollout_num):

This is N Time Monte Carlo sampling with n = 16 in the code. But how are the different samples generated? given_num represents how many tokens to use from the input, and irepresents the i'th sample. Why are the samples different for different values of i? Is the rollout network being updated somewhere within call to get_reward and I'm missing it? I also don't see where the randomness is coming in for the Monte Carlo estimation of the partial sequence reward.

From my examination of the code, the network doesn't get updated and the session parameters are the same so I'm not sure how different samples are being generated.

Can someone help me understand how a) different samples are being generated, b) where is the randomness coming from, c) if the rollout network has the same parameters as the Generator network, how is it generating different samples than the generator?

Any help is greatly appreciated! Thank you for providing this code it has been very helpful to me.

The text was updated successfully, but these errors were encountered:

nathan-whitaker · 2017-06-13T18:30:41Z

I think I found what I was looking for.

SeqGAN/rollout.py

Line 58 in 5f2c0a5

    
           next_token = tf.cast(tf.reshape(tf.multinomial(log_prob, 1), [self.batch_size]), tf.int32)

This line contains a call to https://www.tensorflow.org/api_docs/python/tf/multinomial

Which performs a sample over the logits generated from the network instead of taking the max like the generator network does.

guotong1988 · 2017-07-29T13:16:54Z

Great investigation！

guotong1988 · 2017-08-02T09:21:55Z

I still don't know exactly what N Time Monte Carlo sampling is.. Could you please explain? Thank you @LantaoYu @nathan-whitaker

LantaoYu closed this as completed Jun 23, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about Rollout #19

Question about Rollout #19

nathan-whitaker commented Jun 13, 2017

nathan-whitaker commented Jun 13, 2017

guotong1988 commented Jul 29, 2017

guotong1988 commented Aug 2, 2017 •

edited

Question about Rollout #19

Question about Rollout #19

Comments

nathan-whitaker commented Jun 13, 2017

nathan-whitaker commented Jun 13, 2017

guotong1988 commented Jul 29, 2017

guotong1988 commented Aug 2, 2017 • edited

guotong1988 commented Aug 2, 2017 •

edited