[Op] Add FP32 fused l2 normalize op and grad op. #291

Duyi-Wang · 2022-07-01T08:30:16Z

No description provided.

liutongxuan · 2022-07-01T11:27:53Z

tensorflow/python/ops/nn_impl.py

 @tf_export(v1=["math.l2_normalize", "linalg.l2_normalize", "nn.l2_normalize"])
 @deprecated_args(None, "dim is deprecated, use axis instead", "dim")
-def l2_normalize(x, axis=None, epsilon=1e-12, name=None, dim=None):
+def l2_normalize(x, axis=None, epsilon=1e-12, do_fusion=True, name=None, dim=None):


我建议这里不要在中间位置加参数，这样导致用户不得不改代码。新参数加到dim后面

我建议这里还是不要改变原有l2_normalize的接口，我们既然新添了v2，就让v2默认打开fusion，并且保持v2有do_fusion这个可配置的参数

v1版本去掉do_fusion只能默认启用或关闭了，那v1在调用v2接口的时候是默认启用还是关闭？

v1的行为是关闭的。

bazel test --action_env=TF_CPP_MIN_VLOG_LEVEL=1 --action_env=TF_CPP_MIN_LOG_LEVEL=0 --flaky_test_attempts 1 --test_output=all --nocache_test_results --cxxopt=-D_GLIBCXX_USE_CXX11_ABI=0 --copt=-march=skylake-avx512 -- //tensorflow/core/kernels:fused_l2_normalize_ops_test

bazel test --action_env=TF_CPP_MIN_VLOG_LEVEL=1 --action_env=TF_CPP_MIN_LOG_LEVEL=0 --flaky_test_attempts 1 --test_output=all --nocache_test_results --cxxopt=-D_GLIBCXX_USE_CXX11_ABI=0 --copt=-march=skylake-avx512 -- //tensorflow/python:nn_test

liutongxuan reviewed Jul 1, 2022

View reviewed changes

changqi1 and others added 16 commits July 11, 2022 16:54

[Op] Add fused l2 normalize op and grad op.

1306a18

[UT] python API implement.

3f32097

bazel test --action_env=TF_CPP_MIN_VLOG_LEVEL=1 --action_env=TF_CPP_MIN_LOG_LEVEL=0 --flaky_test_attempts 1 --test_output=all --nocache_test_results --cxxopt=-D_GLIBCXX_USE_CXX11_ABI=0 --copt=-march=skylake-avx512 -- //tensorflow/python:nn_test

[Op] Add handling of 128 remainders in fused l2 norm.

0caf45f

[Ops] Fix bug in store remainder output.

82d73a2

[Op] Fix array out of bounds in store output.

9df47ff

[Op] Enable fused l2 norm in tf.nn.l2_normalize.

82bcbcb

[Op] Add l2 norm grad test.

9592612

[Op] Add l2 norm called log info.

d9b0f8a

[Op] Add performance benchmarks.

e4408de

[Op] Set fused op to default enabled.

aa54aa0

[Op] Optimize AVX512 perf.

f1581db

fix api_test issue.

72ae2f8

[Op] Add annotations in fused l2n.

abd8b8e

[Op] Change l2n do_fusion parameter position.

a6b7412

[Op] disable fusion in l2n v1.

dd05512

Duyi-Wang force-pushed the features/fused_l2n branch from 1f08039 to dd05512 Compare July 11, 2022 08:58

liutongxuan approved these changes Jul 11, 2022

View reviewed changes

[Op] Fix nn_test failed.

dc882bf

liutongxuan approved these changes Jul 12, 2022

View reviewed changes

liutongxuan merged commit c7ba50c into DeepRec-AI:main Jul 13, 2022

Duyi-Wang deleted the features/fused_l2n branch July 15, 2022 02:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Op] Add FP32 fused l2 normalize op and grad op. #291

[Op] Add FP32 fused l2 normalize op and grad op. #291

Uh oh!

Duyi-Wang commented Jul 1, 2022

Uh oh!

liutongxuan Jul 1, 2022 •

edited

Loading

Uh oh!

liutongxuan Jul 6, 2022

Uh oh!

Duyi-Wang Jul 11, 2022

Uh oh!

liutongxuan Jul 11, 2022

Uh oh!

Duyi-Wang Jul 11, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Op] Add FP32 fused l2 normalize op and grad op. #291

[Op] Add FP32 fused l2 normalize op and grad op. #291

Uh oh!

Conversation

Duyi-Wang commented Jul 1, 2022

Uh oh!

liutongxuan Jul 1, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

liutongxuan Jul 6, 2022

Choose a reason for hiding this comment

Uh oh!

Duyi-Wang Jul 11, 2022

Choose a reason for hiding this comment

Uh oh!

liutongxuan Jul 11, 2022

Choose a reason for hiding this comment

Uh oh!

Duyi-Wang Jul 11, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

liutongxuan Jul 1, 2022 •

edited

Loading