Provide deterministic builds #427

josevalim · 2022-12-13T15:40:15Z

Parameter IDs were removed
Dropouts are completely removed from the network via a new :mode option
Freezing traverse the nodes directly without relying on IDs
Removed almost all usage of backend_copy(Nx.Defn.Expr)
We use integers as cache keys after the cache is built

lib/axon.ex

josevalim · 2022-12-13T18:49:43Z

lib/axon/compiler.ex

      )

    # Names are computed lazily, so compute name from current
    # op and aggregate op_counts.
    name = name_fn.(op_name, op_counts)
    op_counts = Map.update(op_counts, op_name, 1, fn x -> x + 1 end)

+    # TODO: Hack for dropout with key, fix with a better implementation


Perhaps it should not be based on the name but I would say it should be based on the key. Thoughts?

josevalim · 2022-12-13T18:50:08Z

lib/axon/compiler.ex

      # Compute arguments to be forwarded and ensure `:mode` is included
      # for inference/training behavior dependent functions
-      args = Enum.reverse(tensor_inputs) ++ [Keyword.put(opts, :mode, mode)]
+      args = Enum.reverse(tensor_inputs, [Keyword.put(opts, :mode, mode)])


Enum.reverse(list, tail) is an efficient version of Enum.reverse(list) ++ tail.

josevalim · 2022-12-13T18:50:23Z

lib/axon/layers.ex

-      {_, out, :train} ->
-        out
-    end)
+    Nx.select(mask, input / keep_prob, Nx.tensor(0, type: Nx.type(input)))


I removed the mode check from dropout because it is no longer relevant.

lib/axon/losses.ex

josevalim · 2022-12-13T18:50:59Z

test/axon/compiler_test.exs

@@ -114,7 +114,7 @@ defmodule CompilerTest do
      x2 = Axon.dense(input, 64)
      model = Axon.add(x1, x2)

-      {init_fn, _predict_fn} = Axon.build(model)
+      {init_fn, _predict_fn} = Axon.build(model, debug: true)


We only get fn stacktraces with debug: true now.

josevalim · 2022-12-13T18:51:40Z

lib/axon.ex

@@ -2722,7 +2721,8 @@ defmodule Axon do

  defp rnn_state(x, units, rnn_type, parent_name, state_name, initializer) do
    initializer = initializer || :glorot_uniform
-    key = Nx.Random.key(:erlang.system_time()) |> Nx.backend_copy(Nx.Defn.Expr)
+    # TODO: This key should be managed by the compiler


Similar to dropout.

josevalim · 2022-12-14T11:34:20Z

This is good to go!

lib/axon/losses.ex

josevalim commented Dec 13, 2022

View reviewed changes

lib/axon.ex Outdated Show resolved Hide resolved

josevalim changed the title ~~Initial work on deterministic builds~~ Provide deterministic builds Dec 13, 2022

josevalim commented Dec 13, 2022

View reviewed changes

lib/axon/losses.ex Outdated Show resolved Hide resolved

josevalim commented Dec 13, 2022

View reviewed changes

josevalim mentioned this pull request Dec 13, 2022

Use a layer state to manage dropout state so it changes between runs #426

Merged

josevalim commented Dec 14, 2022

View reviewed changes

lib/axon/losses.ex Outdated Show resolved Hide resolved

josevalim commented Dec 14, 2022

View reviewed changes

lib/axon/losses.ex Outdated Show resolved Hide resolved

josevalim added 9 commits December 21, 2022 07:32

Initial work on deterministic builds

8bb435f

Remove ID on freeze/unfreeze

35d7a46

Simplify name function

2caa526

Do not keep references around

26a4024

Skip dropout at the NN level

c2ddbbe

Only keep predict funs on prediction

23c375d

Keep stacktrace only on debug

4881e23

Fixes

c77fb48

Apply suggestions from code review

1da7d56

seanmor5 force-pushed the jv-deterministic branch from 66c75bf to 1da7d56 Compare December 21, 2022 15:34

seanmor5 added 2 commits December 21, 2022 07:45

Fix tests

ee5419d

Handle RNN state with compiler

4c60014

This was referenced Dec 21, 2022

Time to complete an epoch depends on the number of epochs #430

Closed

Validation loops always recompile inside supervised training loops #416

Closed

seanmor5 merged commit c57e37f into main Dec 21, 2022

seanmor5 mentioned this pull request Dec 21, 2022

Make building networks deterministic #410

Closed

seanmor5 deleted the jv-deterministic branch January 21, 2023 18:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide deterministic builds #427

Provide deterministic builds #427

josevalim commented Dec 13, 2022 •

edited

josevalim Dec 13, 2022

josevalim Dec 13, 2022

josevalim Dec 13, 2022

josevalim Dec 13, 2022

josevalim Dec 13, 2022

josevalim commented Dec 14, 2022

Provide deterministic builds #427

Provide deterministic builds #427

Conversation

josevalim commented Dec 13, 2022 • edited

josevalim Dec 13, 2022

Choose a reason for hiding this comment

josevalim Dec 13, 2022

Choose a reason for hiding this comment

josevalim Dec 13, 2022

Choose a reason for hiding this comment

josevalim Dec 13, 2022

Choose a reason for hiding this comment

josevalim Dec 13, 2022

Choose a reason for hiding this comment

josevalim commented Dec 14, 2022

josevalim commented Dec 13, 2022 •

edited