[Feature Request] Flax GeneralizedModule should be able to pass rngs dict to module.init during summary #154

sooheon · 2021-02-03T07:25:22Z

Currently, the low-level API works for toy linen modules, but it does not allow for passing in multiple RNG keys, which Flax modules require for e.g. dropout.

Not sure about API design, but the hardcoded init happens in LinenModule.init. Somehow it should be possible to pass it a set of keywords to which to associate rng.next() values.

Minimal repro:

import dataget
import elegy
import flax.linen as nn
import jax
import jax.numpy as jnp
import optax

X_train, y_train, X_test, y_test = dataget.image.mnist(global_cache=True).get()

print("X_train:", X_train.shape, X_train.dtype)
print("y_train:", y_train.shape, y_train.dtype)
print("X_test:", X_test.shape, X_test.dtype)
print("y_test:", y_test.shape, y_test.dtype)


# %%
class MLP(nn.Module):
    @nn.compact
    def __call__(self, x):
        x = nn.Dense(300)(x)
        x = nn.relu(x)
        x = nn.Dropout(0.1)(x)
        x = nn.Dense(10)(x)
        return x


class FlaxLinearClassifier(elegy.Model):
    def test_step(
        self, x, y_true, states: elegy.States, initializing: bool, rng: elegy.RNGSeq
    ):
        x = jnp.reshape(x, (x.shape[0], -1)) / 255
        if initializing:
            variables = self.module.init(
                {"params": rng.next(), "dropout": rng.next()}, x
            )
            params = variables["params"]
        else:
            params = states.net_params

        logits = self.module.apply({"params": params}, x, rngs={"dropout": rng.next()})
        labels = jax.nn.one_hot(y_true, 10)
        loss = jnp.mean(-jnp.sum(labels * jax.nn.log_softmax(logits), axis=-1))
        accuracy = jnp.mean(jnp.argmax(logits, axis=-1) == y_true)

        logs = dict(accuracy=accuracy, loss=loss)
        return loss, logs, states.update(rng=rng, net_params=params)


model = FlaxLinearClassifier(module=MLP(), optimizer=optax.adamw(1e-3))

model.summary(X_test[:64])

AssertionError: Need PRNG for "dropout"

The text was updated successfully, but these errors were encountered:

cgarciae · 2021-02-03T16:27:59Z

Hey @sooheon! This is a good point.

On one hand we can improve our LinenModule implementation, currently the calls to linen.Module.init and linen.Module.apply are implemented like this:

https://github.com/poets-ai/elegy/blob/master/elegy/generalized_module/linen_module.py#L24
https://github.com/poets-ai/elegy/blob/master/elegy/generalized_module/linen_module.py#L61-L67

As you point out, only rng values are given for params, the problem is that we a priori don't know what names the user might use. I think we can just add the most common names even if the users will not need them but I don't know if this solves the problem in general.

On the other hand, summary calls pred_step (not test_step) so you can refactor you code like this to fix the issue:

import dataget
import elegy
import flax.linen as nn
import jax
import jax.numpy as jnp
import optax

X_train, y_train, X_test, y_test = dataget.image.mnist(global_cache=True).get()

print("X_train:", X_train.shape, X_train.dtype)
print("y_train:", y_train.shape, y_train.dtype)
print("X_test:", X_test.shape, X_test.dtype)
print("y_test:", y_test.shape, y_test.dtype)


class MLP(nn.Module):
    @nn.compact
    #@elegy.flax_summarize # use decorators to report module summaries
    def __call__(self, x):
        x = nn.Dense(300)(x) # core modules dont report summaries :(
        x = nn.relu(x)
        x = nn.Dropout(0.1)(x)
        x = nn.Dense(10)(x)
        return x


class FlaxLinearClassifier(elegy.Model):
    def pred_step(
        self, x, states: elegy.States, initializing: bool, rng: elegy.RNGSeq
    ) -> elegy.PredStep:
        x = jnp.reshape(x, (x.shape[0], -1)) / 255
        if initializing:
            variables = self.module.init(
                {"params": rng.next(), "dropout": rng.next()}, x
            )
            params = variables["params"]
        else:
            params = states.net_params

        logits = self.module.apply({"params": params}, x, rngs={"dropout": rng.next()})

        return elegy.PredStep.simple(logits, states.update(rng=rng, net_params=params))

    def test_step(self, x, y_true, states, mode, initializing):
        # call_pred_step is the recommended way of invoking pred_step
        logits, states, _, _, _ = self.call_pred_step(x, mode, states, initializing)

        labels = jax.nn.one_hot(y_true, 10)
        loss = jnp.mean(-jnp.sum(labels * jax.nn.log_softmax(logits), axis=-1))
        accuracy = jnp.mean(jnp.argmax(logits, axis=-1) == y_true)

        logs = dict(accuracy=accuracy, loss=loss)
        return loss, logs, states


model = FlaxLinearClassifier(module=MLP(), optimizer=optax.adamw(1e-3))

model.summary(X_test[:64])

I found a bug so using @elegy.flax_summarize which creates the summaries for the output of the Module can be uncommented after #155 is merged.

cgarciae · 2021-02-03T16:32:04Z

I'll be adding guides for the low-level API so it becomes a bit more clear what the different methods you can override do (pred_step, test_step, grad_step, train_step) and how you can compose them.

sooheon · 2021-02-04T00:50:15Z

Yeah an example using all of the canonical user-facing API would definitely go a long way.

cgarciae · 2021-02-20T16:09:40Z

@sooheon is there a way to extract all possible names that might need rng from the variables? I am thinking of trying to overcompensate (gives more names tan required) just to keep Flax happy.

sooheon · 2021-02-22T01:15:41Z

There's no static way to know ahead of time afaict. Submodules can call self.make_rng(name='foo'), and it's kind of up to you to provide 'foo' rng. Hopefully this clunky API gets improved in the future. OTOH, just adding dropout would give you 99% coverage, I think (I've yet to see a different rng key required).

sooheon added the enhancement New feature or request label Feb 3, 2021

cgarciae mentioned this issue Jul 31, 2021

New summary API #185

Merged

cgarciae closed this as completed in #185 Jul 31, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Flax GeneralizedModule should be able to pass rngs dict to module.init during summary #154

[Feature Request] Flax GeneralizedModule should be able to pass rngs dict to module.init during summary #154

sooheon commented Feb 3, 2021 •

edited

cgarciae commented Feb 3, 2021 •

edited

cgarciae commented Feb 3, 2021 •

edited

sooheon commented Feb 4, 2021

cgarciae commented Feb 20, 2021

sooheon commented Feb 22, 2021

[Feature Request] Flax GeneralizedModule should be able to pass rngs dict to module.init during summary #154

[Feature Request] Flax GeneralizedModule should be able to pass rngs dict to module.init during summary #154

Comments

sooheon commented Feb 3, 2021 • edited

cgarciae commented Feb 3, 2021 • edited

cgarciae commented Feb 3, 2021 • edited

sooheon commented Feb 4, 2021

cgarciae commented Feb 20, 2021

sooheon commented Feb 22, 2021

sooheon commented Feb 3, 2021 •

edited

cgarciae commented Feb 3, 2021 •

edited

cgarciae commented Feb 3, 2021 •

edited