Use faster queue for replay buffers #131

toslunar · 2017-08-09T09:49:21Z

TODO:

Add tests
Measure performance

This will happen to resolve #36 (see also #128)

coveralls · 2017-08-09T10:31:30Z

Coverage decreased (-0.3%) to 71.167% when pulling 34c80ab on toslunar:replace-deque into e095992 on chainer:master.

coveralls · 2017-08-10T09:53:44Z

Coverage increased (+0.005%) to 71.518% when pulling d7299fc on toslunar:replace-deque into e095992 on chainer:master.

toslunar · 2017-08-10T10:52:48Z

😰
before: 13.168848936009454
after: 16.709212244983064

import numpy as np
import timeit

from chainerrl import replay_buffer


def rand_state():
    return np.random.rand(1, 3, 50, 20).astype(np.float32)


def rand_action():
    return np.random.rand(1, 40).astype(np.float32)


def rand_reward():
    return np.random.rand()


def f(capacity, batch_size, steps, replay_start_size):
    rbuf = replay_buffer.ReplayBuffer(capacity=capacity)
    s = rand_state()
    a = rand_action()
    for i in range(steps):
        next_s = rand_state()
        next_a = rand_action()
        rbuf.append(s, a, rand_reward(), next_s, next_a)
        s = next_s
        a = next_a
        if i >= replay_start_size:
            rbuf.sample(batch_size)


print(min(timeit.Timer(
    'f(10000, 64, 100000, 1000)',
    setup="from __main__ import f; gc.enable()").
    repeat(repeat=3, number=1)))

toslunar · 2017-08-10T11:58:46Z

before: 337.00381314000697
after: 171.62339709801017
for f(100000, 64, 1000000, 10000).

muupan · 2017-08-11T22:00:00Z

Good job! But this solution seems unecessarily complex to me. How about using a ring buffer like this? https://github.com/matthiasplappert/keras-rl/blob/master/rl/memory.py#L35

I think we can assume maxlen is specified for replay buffers, otherwise training would face out-of-memory eventually.

toslunar · 2017-08-14T08:18:48Z

EpisodicReplayBuffer limits the number of the transitions in the buffer. It's not easy to determine the best maxlen of self.episodic_memory = RingBuffer(maxlen). Of course, there is sufficiently large one (and this doesn't cost memory much, relatively to other parts): self.episodic_memory = RingBuffer(capacity).

toslunar · 2017-08-17T08:06:16Z

collections.deque + random.sample: 8.064196354011074

q = deque(maxlen=10000)
for _ in range(100000):
    q.append(1)
    if len(q) > 1000:
        random.sample(q, 64)

RandomAccessQueue
-- at d7299fc: 13.529830269049853
-- at fc39b20: 10.133344302070327
-- at b8bec91: 5.198489462025464

q = RandomAccessQueue(maxlen=10000)
for _ in range(100000):
    q.append(1)
    if len(q) > 1000:
        q._sample(64)

to sample distinct elements

coveralls · 2017-08-17T10:29:19Z

Coverage increased (+0.02%) to 71.536% when pulling 88c9fac on toslunar:replace-deque into e095992 on chainer:master.

coveralls · 2017-08-17T11:12:28Z

Coverage increased (+0.02%) to 71.536% when pulling 88c9fac on toslunar:replace-deque into e095992 on chainer:master.

coveralls · 2017-08-17T12:51:48Z

Coverage increased (+0.01%) to 71.526% when pulling 874b4ad on toslunar:replace-deque into e095992 on chainer:master.

coveralls · 2017-08-18T07:52:21Z

Coverage increased (+0.1%) to 71.624% when pulling 312c025 on toslunar:replace-deque into e095992 on chainer:master.

coveralls · 2017-08-18T11:57:00Z

Coverage increased (+0.1%) to 71.624% when pulling fd14a7b on toslunar:replace-deque into e095992 on chainer:master.

H306

coveralls · 2017-08-22T04:40:35Z

Coverage increased (+0.1%) to 71.624% when pulling 1d9b658 on toslunar:replace-deque into e095992 on chainer:master.

toslunar · 2017-08-24T08:44:56Z

time (secs) after the debug:

(steps, maxlen)	baseline: collections.deque + random.sample	collections.deque + _sample_n_k	RandomAccessQueue + _sample_n_k
(100000, 10000)	7.3263045670464635	5.774126871023327	7.128595568938181
(300000, 30000)	26.707710441900417	21.629545310977846	20.986778931925073
(1000000, 100000)		197.8464105380699	74.75397148309276

Non-repetitive sampling by [q[i] for i in _sample_n_k(len(q),64)] is faster than by random.sample(q, 64). With this faster sampling, RandomAccessQueue is faster than collections.deque when maxlen is larger than around 30000.

muupan

LGTM except two comments. Great work!

muupan · 2017-09-01T23:46:09Z

chainerrl/misc/collections.py

+
+        return self._queue_front.pop()
+
+    def _sample(self, k):


This method is called from ReplayBuffer, thus should be public.

Use one leading underscore only for non-public methods and instance variables.

https://www.python.org/dev/peps/pep-0008/#method-names-and-instance-variables

muupan · 2017-09-02T03:12:52Z

tests/misc_tests/test_collections.py

+        cdfs_r = (np.arange(n) + 1) / n
+
+        # Kolmogorov-Smirnov statistic
+        d = max(np.amax(np.abs(cdfs - cdfs_x)) for cdfs_x in [cdfs_l, cdfs_r])


Although your implementation looks correct, scipy has scipy.stats.kstest and I think it's better to use the existing well-tested implementation.

You can get p-value by scipy.stats.kstest(xs, 'norm', args=(mean, std))[1].

muupan · 2017-09-02T03:42:30Z

chainerrl/misc/collections.py

@@ -0,0 +1,129 @@
+import itertools


Add future imports (at least range is affected)

toslunar · 2017-09-04T03:19:46Z

Fixed.

coveralls · 2017-09-04T04:16:24Z

Coverage increased (+0.2%) to 71.665% when pulling eba0baa on toslunar:replace-deque into e095992 on chainer:master.

coveralls · 2017-09-04T04:33:07Z

Coverage increased (+0.2%) to 71.665% when pulling eba0baa on toslunar:replace-deque into e095992 on chainer:master.

toslunar added 3 commits August 9, 2017 18:20

Add RandomAccessQueue

d41eb0f

Add maxlen option and extend method

e700e82

Replace deque by RandomAccessQueue

34c80ab

toslunar added 2 commits August 10, 2017 18:33

debug

13bc85d

Add tests

d7299fc

toslunar changed the title ~~[WIP] Use faster queue for replay buffers~~ Use faster queue for replay buffers Aug 10, 2017

toslunar added 2 commits August 17, 2017 16:21

speedup

fc39b20

speedup further

b8bec91

Revert the first 'speedup'

88c9fac

to sample distinct elements

toslunar added 3 commits August 17, 2017 20:30

Make non-repetitive sampling faster

47316d0

.

8cd347e

Add tests

874b4ad

toslunar added 4 commits August 18, 2017 12:25

Remove unused codes

5cd1b5b

Add docstring

b14217c

Improve tests

312c025

Remove unused code

fd14a7b

Fix import order

1d9b658

H306

muupan requested changes Sep 2, 2017

View reviewed changes

muupan reviewed Sep 2, 2017

View reviewed changes

toslunar added 3 commits September 4, 2017 12:03

Use scipy.stats

fdd49b6

Make sample method public

a310b4a

Python 2 support

3b0b0ac

Bugfix

eba0baa

muupan approved these changes Oct 13, 2017

View reviewed changes

muupan merged commit e84ea5f into chainer:master Oct 13, 2017

toslunar deleted the replace-deque branch October 16, 2017 01:44

muupan added the enhancement label Nov 30, 2017

muupan added this to the v0.3 milestone Nov 30, 2017

toslunar mentioned this pull request Oct 9, 2018

Make random access queue sampling code cleaner #309

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use faster queue for replay buffers #131

Use faster queue for replay buffers #131

toslunar commented Aug 9, 2017 •

edited

coveralls commented Aug 9, 2017 •

edited

coveralls commented Aug 10, 2017 •

edited

toslunar commented Aug 10, 2017

toslunar commented Aug 10, 2017

muupan commented Aug 11, 2017

toslunar commented Aug 14, 2017

toslunar commented Aug 17, 2017

coveralls commented Aug 17, 2017 •

edited

coveralls commented Aug 17, 2017 •

edited

coveralls commented Aug 17, 2017

coveralls commented Aug 18, 2017 •

edited

coveralls commented Aug 18, 2017 •

edited

coveralls commented Aug 22, 2017 •

edited

toslunar commented Aug 24, 2017

muupan left a comment

muupan Sep 1, 2017 •

edited

muupan Sep 2, 2017 •

edited

muupan Sep 2, 2017

okuta Sep 28, 2017

toslunar commented Sep 4, 2017

coveralls commented Sep 4, 2017 •

edited

coveralls commented Sep 4, 2017 •

edited

Use faster queue for replay buffers #131

Use faster queue for replay buffers #131

Conversation

toslunar commented Aug 9, 2017 • edited

coveralls commented Aug 9, 2017 • edited

coveralls commented Aug 10, 2017 • edited

toslunar commented Aug 10, 2017

toslunar commented Aug 10, 2017

muupan commented Aug 11, 2017

toslunar commented Aug 14, 2017

toslunar commented Aug 17, 2017

coveralls commented Aug 17, 2017 • edited

coveralls commented Aug 17, 2017 • edited

coveralls commented Aug 17, 2017

coveralls commented Aug 18, 2017 • edited

coveralls commented Aug 18, 2017 • edited

coveralls commented Aug 22, 2017 • edited

toslunar commented Aug 24, 2017

muupan left a comment

Choose a reason for hiding this comment

muupan Sep 1, 2017 • edited

Choose a reason for hiding this comment

muupan Sep 2, 2017 • edited

Choose a reason for hiding this comment

muupan Sep 2, 2017

Choose a reason for hiding this comment

okuta Sep 28, 2017

Choose a reason for hiding this comment

toslunar commented Sep 4, 2017

coveralls commented Sep 4, 2017 • edited

coveralls commented Sep 4, 2017 • edited

toslunar commented Aug 9, 2017 •

edited

coveralls commented Aug 9, 2017 •

edited

coveralls commented Aug 10, 2017 •

edited

coveralls commented Aug 17, 2017 •

edited

coveralls commented Aug 17, 2017 •

edited

coveralls commented Aug 18, 2017 •

edited

coveralls commented Aug 18, 2017 •

edited

coveralls commented Aug 22, 2017 •

edited

muupan Sep 1, 2017 •

edited

muupan Sep 2, 2017 •

edited

coveralls commented Sep 4, 2017 •

edited

coveralls commented Sep 4, 2017 •

edited