Benchmark eager vs. graph? #2

rjbruin · 2019-04-17T09:15:32Z

Thanks for sharing this work! It's definitely interesting to see an example of what TF 2.0 is going to be to work with.

I have one question: in your readme you mention:

From my limited testing, GradientTape is definitely a bit slower than the normal graph mode.

Could you expand on this? Maybe share some numbers?

I'm interested because the advent of eager execution has always been to have imperative programming (for easier workflow) while not losing too much in performance. If it turns out however that for practical purposes it's not feasible to train in eager mode, one would have to maintain separate training loops, like you've done in train.py. It seems to me this would be detrimental to maintainability of TF2 repositories. Do you have a view on this?

The text was updated successfully, but these errors were encountered:

zzh8829 · 2019-05-07T07:09:31Z

Ok I guess my intuition is not that great. I just did some actual benchmark for training the full yolov3-tiny network on 1 epoch of the VOC2012 dataset with Tesla M60 (comparable to GTX 1060).

keras.fit eager: train: 157s
keras.fit graph: train: 150s
tf.GradientTape: train: 150s

Eager is slower than graph under keras.fit, but gradient tape by it self is not slower

Worth noting that Gradient tape didn't have any of the keras metrics or callbacks, so that might contribute to the speed up. I tried to compile GradientTape with @tf.function, but did not see any performance change.

Now for the maintainability point. If you want battery-included callbacks and metrics, use keras.fit and adjust the eager flag as needed. For more customized training control use gradient tapes. Either way, there is never a case where you have to keep both keras.fit or gradient tape. I did it here only to showcase the different ways to train models in tf2.0.

However, for inference there is a pretty big difference on eager and graph mode. Yolov3 on 608x608 takes about 200ms in eager mode but only 120 in graph mode.

rjbruin · 2019-05-07T08:10:26Z

Thanks for the details!

kylemcdonald · 2019-11-21T06:11:25Z

@zzh8829 what changes did you make to switch from eager mode to graph mode? I tried to do this by adding tf.compat.v1.disable_eager_execution() and adding steps=1 to my yolo.predict call. But I don't see the kind of performance improvement you describe.

zzh8829 mentioned this issue May 7, 2019

how many samples per second when you training? #11

Closed

rjbruin closed this as completed May 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark eager vs. graph? #2

Benchmark eager vs. graph? #2

rjbruin commented Apr 17, 2019

zzh8829 commented May 7, 2019

rjbruin commented May 7, 2019

kylemcdonald commented Nov 21, 2019

Benchmark eager vs. graph? #2

Benchmark eager vs. graph? #2

Comments

rjbruin commented Apr 17, 2019

zzh8829 commented May 7, 2019

rjbruin commented May 7, 2019

kylemcdonald commented Nov 21, 2019