Shuffle samples to generate each mini-batch at Replay.get_batch. #59

nagachika · 2017-07-26T06:10:43Z

Previous implementation randomize only the offset in the memory and the mini-batch contains a series of samples in original experienced order.
I believe the experiences should be sampled at random to get rid of bias of experiences.

I've realized np.random.permutation with whole memory capacity is not memory efficient. Because I'm newbee on Python and numpy, I cannot find the equivalent way to do this more efficiently. Any hint or suggestions are welcome.

Previous implementation randomize only the offset in the memory and the mini-batch contains a series of samples in original experienced order. I believe the experiences should be sampled at random to get rid of bias of experiences.

michaelschaarschmidt · 2017-07-26T07:29:45Z

Hi,

thanks for helping! These are 2 separate strategies and both can be desired. One is to sample each entry separately (not with np.permutaton but with np.random.shuffle and re-assignment of keys or with just picking many indices), one is to sample continuous ranges. Hence, I wont merge this but rather suggest you create an issue and then we will see how we can make this optional.

Strategy 1: Batch size n, select n random indices
Strategy 2: Batch size n, select one index, continuous range, but reshuffle the data

1 sounds intuitively more performant to me but we will have a discussion

nagachika · 2017-07-26T12:26:28Z

Thank you for your comment!
I didn't know that there's the case that the continuous experiences are desirable. I will file new issue to discuss optional strategies.

michaelschaarschmidt closed this Jul 26, 2017

nagachika mentioned this pull request Jul 26, 2017

Options of strategy about experience sampling at Replay.get_batch. #60

Closed

nagachika deleted the shuffle_replay_memory_for_each_batch branch July 27, 2017 01:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shuffle samples to generate each mini-batch at Replay.get_batch. #59

Shuffle samples to generate each mini-batch at Replay.get_batch. #59

nagachika commented Jul 26, 2017

michaelschaarschmidt commented Jul 26, 2017 •

edited

nagachika commented Jul 26, 2017

Shuffle samples to generate each mini-batch at Replay.get_batch. #59

Shuffle samples to generate each mini-batch at Replay.get_batch. #59

Conversation

nagachika commented Jul 26, 2017

michaelschaarschmidt commented Jul 26, 2017 • edited

nagachika commented Jul 26, 2017

michaelschaarschmidt commented Jul 26, 2017 •

edited