A question about the `sample_actions()` #5

fuyw · 2022-03-10T02:40:54Z

Line 66 in 09d7002

@functools.partial(jax.jit, static_argnames=('actor_def', 'distribution'))

Hi Ilya,

Many thanks for the nice work. I have a question of the sample_actions() function, why do we need the _sample_actions()? Isn't it redundant?

Maybe we can simply:

@functools.partial(jax.jit, static_argnames=('actor_def'))
def sample_actions(rng, actor_def, actor_params, observations, temperature):
    dist = actor_def.apply({'params': actor_params}, observations, temperature)
    rng, key = jax.random.split(rng)
    return rng, dist.sample(seed=key)

Further, I tried to reimplement IQL with TrainState. I found that use TrainState is slower than this implementation (~100-200 fps).

The text was updated successfully, but these errors were encountered:

ikostrikov · 2022-03-10T02:44:49Z

@fuyw I think there is some bug on Windows otherwise: ikostrikov/jaxrl#18

That's cool! Is it this implementation? I will take a look.

fuyw · 2022-03-10T03:02:14Z

Thanks for the reply. Yes it is, and I just refactored the code according to the flax official examples.

For simplicity, I replaced the tfd to distrax, and this does not matters in my experiments.

fuyw · 2022-03-10T03:56:27Z

Sorry Ilya, I found a bug in my previous implementation. I used a jax.device_put() when sampling from the buffer, which wastes time. When I fixed this bug, the throughput is close to this implementation now.

fuyw changed the title ~~A question about the ``~~ A question about the sample_actions() Mar 10, 2022

fuyw closed this as completed Mar 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A question about the `sample_actions()` #5

A question about the `sample_actions()` #5

fuyw commented Mar 10, 2022 •

edited

ikostrikov commented Mar 10, 2022

fuyw commented Mar 10, 2022

fuyw commented Mar 10, 2022 •

edited

A question about the sample_actions() #5

A question about the sample_actions() #5

Comments

fuyw commented Mar 10, 2022 • edited

ikostrikov commented Mar 10, 2022

fuyw commented Mar 10, 2022

fuyw commented Mar 10, 2022 • edited

A question about the `sample_actions()` #5

A question about the `sample_actions()` #5

fuyw commented Mar 10, 2022 •

edited

fuyw commented Mar 10, 2022 •

edited