fix(aten::batch_norm): A new batch norm implementation that hopefully doesnt have the same performace cost #55

narendasan · 2020-05-08T07:21:15Z

Signed-off-by: Naren Dasan naren@narendasan.com
Signed-off-by: Naren Dasan narens@nvidia.com

Description

Addresses performance issues seen with large input sizes and the conv based batch norm implementation. This new implementation just uses scale layers instead of conv.

Type of change

Bug fix (non-breaking change which fixes an issue)

Checklist:

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation and have regenerated the documentation (make html in docsrc)
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes

doesnt have the same performace cost Signed-off-by: Naren Dasan <naren@narendasan.com> Signed-off-by: Naren Dasan <narens@nvidia.com>

narendasan · 2020-05-14T23:15:33Z

Fix was confirmed

[JIT]: batch_size: 1
    Average latency: 61.2975 ms
    Average FPS: 16.3139 fps
    Latency Standard Deviation: 0.805419
    FPS Standard Deviation: 0.24558
(excluding initial warmup runs)
[JIT/TRT]: batch_size: 1
    Average latency: 31.1957 ms
    Average FPS: 32.0557 fps
    Latency Standard Deviation: 0.107248
    FPS Standard Deviation: 0.110173

Summary: Pull Request resolved: https://github.com/pytorch/fx2trt/pull/55 Apply pass manager to lower flow Reviewed By: khabinov Differential Revision: D35518483 fbshipit-source-id: 48bc9c364cd006cc5a2c1b04d667987827f0a4d4

fix(aten::batch_norm): A new batch norm implementation that hopefully

6461872

doesnt have the same performace cost Signed-off-by: Naren Dasan <naren@narendasan.com> Signed-off-by: Naren Dasan <narens@nvidia.com>

narendasan added the component: converters Issues re: Specific op converters label May 8, 2020

narendasan marked this pull request as draft May 8, 2020 07:21

narendasan marked this pull request as ready for review May 14, 2020 23:15

narendasan merged commit 227dea3 into master May 14, 2020

narendasan deleted the batch_norm_alt branch May 14, 2020 23:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(aten::batch_norm): A new batch norm implementation that hopefully doesnt have the same performace cost #55

fix(aten::batch_norm): A new batch norm implementation that hopefully doesnt have the same performace cost #55

Uh oh!

narendasan commented May 8, 2020

Uh oh!

narendasan commented May 14, 2020

Uh oh!

Uh oh!

fix(aten::batch_norm): A new batch norm implementation that hopefully doesnt have the same performace cost #55

fix(aten::batch_norm): A new batch norm implementation that hopefully doesnt have the same performace cost #55

Uh oh!

Conversation

narendasan commented May 8, 2020

Description

Type of change

Checklist:

Uh oh!

narendasan commented May 14, 2020

Uh oh!

Uh oh!