Training API uses streams #60

t-rutten · 2021-04-28T22:33:36Z

Per discussion in #25, changes here adapt the training API to accept inputs and labels that are streams.

seanmor5

Thanks for putting this together! I left a comment to see if we can address that before we merge. BTW, make sure to run the formatter so it passes the CI!

seanmor5 · 2021-04-28T23:35:25Z

lib/axon/training.ex

-        x when is_integer(x) ->
-          x
+    {model_state, avg_loss, total_batches} =
+      for {{inp, tar}, i} <- dataset, reduce: {model_state, Nx.tensor(0.0), 0} do


I'm guessing this runs to the end of the stream? I think perhaps we'll want an option to terminate before the end of a stream. I'm trying to think of the best way to do that without loading n steps all into memory at once with something like Enum.take, any thoughts?

You can do a Stream.take(n) or do a throw/try+catch.

Can we go with Enum.reduce_while/3? We just need the accumulator when processing each batch, but does that function keep the accumulator as well as the first n elements of the stream in memory at once? With the continue/halting function of reduce_while it would be easy to use incorporate different early stopping criteria.

@seanmor5 @josevalim, do you want me to add options for early stopping in this PR? Otherwise it'll be a straightforward addition to convert the comprehension to reduce_while later on.

Early stopping might be specified by a function that accepts a combination of current batch/step loss, previous batch loss, average loss, and batch index and returns a boolean indicating whether training should continue.

@t-rutten I would hold off on that for now, I would like to implement that as a callback when those are added into the training API

Makes sense @seanmor5. Do you have any more suggestions for the changes here?

No changes, I'm happy with where this is, unless you have anything else you'd like to add, I'll merge :)

I don't have anything else to add :) Thanks!

seanmor5 · 2021-04-28T23:37:44Z

Thanks for putting this together! I left a comment to see if we can address that before we merge. BTW, make sure to run the formatter so it passes the CI!

Ahh I just realized the formatting failure is probably from my recent commit, in any case, feel free to run the formatter on everything so it passes before we merge :)

t-rutten added 2 commits April 28, 2021 11:38

train on streams, avoid early counting of batches

7722748

add helper fn to log batch & docs

8fa2939

seanmor5 reviewed Apr 28, 2021

View reviewed changes

run formatter

f792e05

seanmor5 merged commit 1371b36 into elixir-nx:main May 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training API uses streams #60

Training API uses streams #60

t-rutten commented Apr 28, 2021

seanmor5 left a comment

seanmor5 Apr 28, 2021

josevalim Apr 29, 2021

t-rutten Apr 29, 2021 •

edited

Loading

t-rutten May 3, 2021

seanmor5 May 3, 2021

t-rutten May 3, 2021

seanmor5 May 3, 2021

t-rutten May 3, 2021

seanmor5 commented Apr 28, 2021

Training API uses streams #60

Training API uses streams #60

Conversation

t-rutten commented Apr 28, 2021

seanmor5 left a comment

Choose a reason for hiding this comment

seanmor5 Apr 28, 2021

Choose a reason for hiding this comment

josevalim Apr 29, 2021

Choose a reason for hiding this comment

t-rutten Apr 29, 2021 • edited Loading

Choose a reason for hiding this comment

t-rutten May 3, 2021

Choose a reason for hiding this comment

seanmor5 May 3, 2021

Choose a reason for hiding this comment

t-rutten May 3, 2021

Choose a reason for hiding this comment

seanmor5 May 3, 2021

Choose a reason for hiding this comment

t-rutten May 3, 2021

Choose a reason for hiding this comment

seanmor5 commented Apr 28, 2021

t-rutten Apr 29, 2021 •

edited

Loading