tutorial: adversarial training seems slow. maybe i'm wrong #5

goodfeli · 2016-09-15T21:37:49Z

should benchmark it and make sure the runtime is correct.

goodfeli · 2016-09-15T21:38:58Z

without looking at the code, I bet we're missing a stop_gradient somewhere

goodfeli · 2016-09-15T22:10:21Z

3.7 seconds per 100 batches for naive training

goodfeli · 2016-09-15T22:18:09Z

52 sec per 100 batches for adv training

goodfeli · 2016-09-15T22:32:51Z

in pylearn2, my result with adversarial training takes 3 sec per full epoch

goodfeli · 2016-09-15T22:39:50Z

in pylearn2, without adversarial training, my code runs in 1 sec per full epoch

goodfeli · 2016-09-15T22:41:58Z

naive training is forward-back, O(2).
adversarial training is forward-back, back(with different targets)-forward-back, so O(5) if no steps can be parallelized. So it should be roughly 2.5X slower than naive training in theory.
The pylearn2 implementation is 3X slower than naive training, so apparently in practice we can expect some extra overhead.
Doesn't explain why this is > 10X slower.

goodfeli · 2016-09-15T23:03:39Z

whoa, actually something is seriously weird.
1st 100 batches with adv training take 54 seconds
2nd 100 batches take 102 seconds
3rd 100 batches take 153 seconds

npapernot · 2016-09-16T15:00:50Z

You are right: the issue was due to my naive implementation which redefined the adversarial loss in the TF graph at each iteration (batch...). I fixed by introducing a new function that add the loss to the graph, and which returns the TF var to be evaluated at each iteration d7a95d3

goodfeli mentioned this issue Sep 16, 2016

do more of the work inside the tf graph #8

Merged

npapernot closed this as completed Sep 16, 2016

goodfeli mentioned this issue May 12, 2017

model_eval bug fix, wrongly use of tf.argmax #138

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tutorial: adversarial training seems slow. maybe i'm wrong #5

tutorial: adversarial training seems slow. maybe i'm wrong #5

goodfeli commented Sep 15, 2016

goodfeli commented Sep 15, 2016

goodfeli commented Sep 15, 2016

goodfeli commented Sep 15, 2016

goodfeli commented Sep 15, 2016

goodfeli commented Sep 15, 2016

goodfeli commented Sep 15, 2016

goodfeli commented Sep 15, 2016

npapernot commented Sep 16, 2016

tutorial: adversarial training seems slow. maybe i'm wrong #5

tutorial: adversarial training seems slow. maybe i'm wrong #5

Comments

goodfeli commented Sep 15, 2016

goodfeli commented Sep 15, 2016

goodfeli commented Sep 15, 2016

goodfeli commented Sep 15, 2016

goodfeli commented Sep 15, 2016

goodfeli commented Sep 15, 2016

goodfeli commented Sep 15, 2016

goodfeli commented Sep 15, 2016

npapernot commented Sep 16, 2016