Use `ops.rsqrt`, improve normalization layers and enable ops fusion in tflite #892

james77777778 · 2023-09-15T03:24:37Z

Fixes #824

This PR accomplishes the following:

adding support for rsqrt in numpy backend (using jax's impl)
replacing 1 / ops.sqrt(x) with ops.rsqrt for improved speed
reordering the ops in normalization layers to unify the implementation and match the expression of tf.nn.batch_normalization link
Ensuring 100% unit test coverage for all normalization layers

After completing 3, tflite recognizes the pattern of CONV+BN+ReLU, and the ops are fused successfully.

standalone MobileNetV3 export script

import tensorflow as tf

from keras_core.applications.mobilenet_v3 import MobileNetV3Small

keras_core_model = MobileNetV3Small(
    input_shape=(224, 224, 3), minimalistic=True
)

tf_callable = tf.function(
    keras_core_model.call,
    input_signature=[tf.TensorSpec((1, 224, 224, 3), tf.float32)],
    autograph=True,
    jit_compile=True,
)
tf_concrete_function = tf_callable.get_concrete_function()
converter = tf.lite.TFLiteConverter.from_concrete_functions(
    [tf_concrete_function], tf_callable
)
converter.optimizations = [tf.lite.Optimize.DEFAULT]
tflite_model = converter.convert()
with open("model.tflite", "wb") as f:
    f.write(tflite_model)

The visualization from netron: (before this PR vs. after this PR)

benchmark script

from keras_core import layers
from keras_core import mixed_precision
from keras_core import models
from keras_core import ops

# "float32"
# "mixed_float16"
# "mixed_bfloat16"
dtype_policy = "float32"
mixed_precision.set_dtype_policy(dtype_policy)

x_train = ops.random.uniform(shape=(512, 64, 64, 64))
y_train = ops.random.uniform(shape=(512, 64, 64, 64))

# layers.BatchNormalization
# layers.GroupNormalization
# layers.LayerNormalization
normalization_cls = layers.LayerNormalization
normalization_args = {}
if normalization_cls is layers.GroupNormalization:
    normalization_args = {"groups": -1}

model = models.Sequential(
    [
        layers.InputLayer(shape=(64, 64, 64)),
        normalization_cls(**normalization_args),
        normalization_cls(**normalization_args),
        normalization_cls(**normalization_args),
    ]
)
model.compile(loss="mse", optimizer="adam")
model.fit(x_train, y_train, batch_size=128, epochs=3)

And the improvement:

backend	layer	before this PR	after this PR
tensorflow	BatchNormalization	48ms/step	46ms/step
jax	BatchNormalization	49ms/step	48ms/step
torch	BatchNormalization	127ms/step	127ms/step
tensorflow	GroupNormalization	50ms/step	49ms/step
jax	GroupNormalization	51ms/step	50ms/step
torch	GroupNormalization	129ms/step	129ms/step
tensorflow	LayerNormalization	54ms/step	53ms/step
jax	LayerNormalization	55ms/step	54ms/step
torch	LayerNormalization	165ms/step	122ms/step

codecov · 2023-09-15T03:29:48Z

Codecov Report

Patch coverage: 100.00% and project coverage change: +0.25% 🎉

Comparison is base (94b5361) 76.56% compared to head (10e4a03) 76.82%.
Report is 4 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #892      +/-   ##
==========================================
+ Coverage   76.56%   76.82%   +0.25%     
==========================================
  Files         329      329              
  Lines       31429    31426       -3     
  Branches     6114     6111       -3     
==========================================
+ Hits        24064    24143      +79     
+ Misses       5786     5719      -67     
+ Partials     1579     1564      -15

Flag	Coverage Δ
keras_core	`76.72% <100.00%> (+0.25%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files Changed	Coverage Δ
keras_core/backend/numpy/math.py	`82.43% <100.00%> (+0.24%)`	⬆️
...s_core/layers/normalization/batch_normalization.py	`100.00% <100.00%> (ø)`
...s_core/layers/normalization/group_normalization.py	`97.64% <100.00%> (+8.63%)`	⬆️
...s_core/layers/normalization/layer_normalization.py	`100.00% <100.00%> (+2.59%)`	⬆️
...as_core/layers/normalization/unit_normalization.py	`100.00% <100.00%> (+7.69%)`	⬆️

... and 11 files with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

fchollet

Thank you for the PR!

fchollet · 2023-09-15T21:02:28Z

keras_core/backend/numpy/math.py

+
+
+def rsqrt(x):
+    return np.array(jax_rsqrt(x))


Why not 1. / sqrt(x)? It's numpy native, and we're not worried about performance for the numpy backend.

I was thinking of being consistent with other backends, but it should be okay to use 1. / sqrt(x)
Fixed.

fchollet · 2023-09-15T21:03:02Z

keras_core/layers/normalization/batch_normalization.py

+            res = res + beta
+
+        # Note: Folding BatchNormalization depends on the precise order of ops
+        # that are generated by the expression below


Good comment!

fchollet

LGTM -- thank you for the great contribution!

james77777778 added 2 commits September 15, 2023 03:00

Add rsqrt to numpy backend

c16e1c2

Improve normalization

c1cbbf0

james77777778 added 3 commits September 15, 2023 04:31

Fix order bug

bf0fca8

Update LayerNormalization

f134fdf

Improve unit test coverage

254ba10

fchollet reviewed Sep 15, 2023

View reviewed changes

Use np native

10e4a03

james77777778 requested a review from fchollet September 16, 2023 06:35

fchollet approved these changes Sep 16, 2023

View reviewed changes

fchollet merged commit c663efd into keras-team:main Sep 16, 2023

james77777778 deleted the improve-normalization-layers branch September 17, 2023 02:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use `ops.rsqrt`, improve normalization layers and enable ops fusion in tflite #892

Use `ops.rsqrt`, improve normalization layers and enable ops fusion in tflite #892

Uh oh!

james77777778 commented Sep 15, 2023 •

edited

Loading

Uh oh!

codecov bot commented Sep 15, 2023 •

edited

Loading

Uh oh!

fchollet left a comment

Uh oh!

fchollet Sep 15, 2023

Uh oh!

james77777778 Sep 16, 2023

Uh oh!

fchollet Sep 15, 2023

Uh oh!

fchollet left a comment

Uh oh!

Uh oh!

Use ops.rsqrt, improve normalization layers and enable ops fusion in tflite #892

Use ops.rsqrt, improve normalization layers and enable ops fusion in tflite #892

Uh oh!

Conversation

james77777778 commented Sep 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Sep 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

fchollet left a comment

Choose a reason for hiding this comment

Uh oh!

fchollet Sep 15, 2023

Choose a reason for hiding this comment

Uh oh!

james77777778 Sep 16, 2023

Choose a reason for hiding this comment

Uh oh!

fchollet Sep 15, 2023

Choose a reason for hiding this comment

Uh oh!

fchollet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Use `ops.rsqrt`, improve normalization layers and enable ops fusion in tflite #892

Use `ops.rsqrt`, improve normalization layers and enable ops fusion in tflite #892

james77777778 commented Sep 15, 2023 •

edited

Loading

codecov bot commented Sep 15, 2023 •

edited

Loading