StableDiffusion Tensorflow to TF Lite #1033

charbull · 2022-11-21T04:14:05Z

For fun, I tried converting stable Diffusion model from Tensorflow to TF lite, so that I can run it on coral/edge tpu.

I tried two approaches:
I- Saved model approach:
II- Go through h5

will try to document them as much as possible. (sorry in advance for the long traces)

for both:

!pip install git+https://github.com/divamgupta/stable-diffusion-tensorflow --upgrade 
!pip install tensorflow tensorflow_addons ftfy --upgrade

Using !pip install --upgrade keras-cv I was not able to save the model for both.

I- Saved model approach:

Saved the model in a directory

from stable_diffusion_tf.stable_diffusion import StableDiffusion
model = StableDiffusion(
    img_height=512,
    img_width=512,
)
model.diffusion_model.save('/saved_model')

lets try to load it:

import tensorflow as tf
model2 = tf.keras.models.load_model('/saved_model')
converter = tf.lite.TFLiteConverter.from_keras_model(model2)
tflite_model = converter.convert()
open("converted_model.tflite", "wb").write(tflite_model)

The following error is thrown:

WARNING:tensorflow:No training configuration found in save file, so the model was *not* compiled. Compile it manually.
WARNING:absl:Found untraced functions such as dense_328_layer_call_fn, dense_328_layer_call_and_return_conditional_losses, dense_329_layer_call_fn, dense_329_layer_call_and_return_conditional_losses, group_normalization_173_layer_call_fn while saving (showing 5 of 1200). These functions will not be directly callable after loading.
INFO:tensorflow:Assets written to: /tmp/tmpicdc9dmk/assets
INFO:tensorflow:Assets written to: /tmp/tmpicdc9dmk/assets
---------------------------------------------------------------------------
ConverterError                            Traceback (most recent call last)
<ipython-input-15-52d23a3e5390> in <module>
      2 model2 = tf.keras.models.load_model('mydata/ivo/pythalpha/saved_model')
      3 converter = tf.lite.TFLiteConverter.from_keras_model(model2)
----> 4 tflite_model = converter.convert()
      5 open("converted_model.tflite", "wb").write(tflite_model)

/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/lite/python/lite.py in wrapper(self, *args, **kwargs)
    931   def wrapper(self, *args, **kwargs):
    932     # pylint: disable=protected-access
--> 933     return self._convert_and_export_metrics(convert_func, *args, **kwargs)
    934     # pylint: enable=protected-access
    935 

/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/lite/python/lite.py in _convert_and_export_metrics(self, convert_func, *args, **kwargs)
    909     self._save_conversion_params_metric()
    910     start_time = time.process_time()
--> 911     result = convert_func(self, *args, **kwargs)
    912     elapsed_time_ms = (time.process_time() - start_time) * 1000
    913     if result:

/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/lite/python/lite.py in convert(self)
   1340         Invalid quantization parameters.
   1341     """
-> 1342     saved_model_convert_result = self._convert_as_saved_model()
   1343     if saved_model_convert_result:
   1344       return saved_model_convert_result

/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/lite/python/lite.py in _convert_as_saved_model(self)
   1322           self._convert_keras_to_saved_model(temp_dir))
   1323       if self.saved_model_dir:
-> 1324         return super(TFLiteKerasModelConverterV2,
   1325                      self).convert(graph_def, input_tensors, output_tensors)
   1326     finally:

/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/lite/python/lite.py in convert(self, graph_def, input_tensors, output_tensors)
   1133 
   1134     # Converts model.
-> 1135     result = _convert_graphdef(
   1136         input_data=graph_def,
   1137         input_tensors=input_tensors,

/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/lite/python/convert_phase.py in wrapper(*args, **kwargs)
    210         else:
    211           report_error_message(str(converter_error))
--> 212         raise converter_error from None  # Re-throws the exception.
    213       except Exception as error:
    214         report_error_message(str(error))

/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/lite/python/convert_phase.py in wrapper(*args, **kwargs)
    203     def wrapper(*args, **kwargs):
    204       try:
--> 205         return func(*args, **kwargs)
    206       except ConverterError as converter_error:
    207         if converter_error.errors:

/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/lite/python/convert.py in convert_graphdef(input_data, input_tensors, output_tensors, **kwargs)
    791       model_flags.output_arrays.append(util.get_tensor_name(output_tensor))
    792 
--> 793   data = convert(
    794       model_flags.SerializeToString(),
    795       conversion_flags.SerializeToString(),

/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/lite/python/convert.py in convert(model_flags_str, conversion_flags_str, input_data_str, debug_info_str, enable_mlir_converter)
    308       for error_data in _metrics_wrapper.retrieve_collected_errors():
    309         converter_error.append_error(error_data)
--> 310       raise converter_error
    311 
    312   return _run_deprecated_conversion_binary(model_flags_str,

ConverterError: /opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
<unknown>:0: error: failed while converting: 'main': 
Some ops are not supported by the native TFLite runtime, you can enable TF kernels fallback using TF Select. See instructions: https://www.tensorflow.org/lite/guide/ops_select 
TF Select ops: Conv2D
Details:
	tf.Conv2D(tensor<?x?x?x?xf32>, tensor<1x1x1280x1280xf32>) -> (tensor<?x?x?x1280xf32>) : {data_format = "NHWC", device = "", dilations = [1, 1, 1, 1], explicit_paddings = [], padding = "VALID", strides = [1, 1, 1, 1], use_cudnn_on_gpu = true}
	tf.Conv2D(tensor<?x?x?x?xf32>, tensor<1x1x320x320xf32>) -> (tensor<?x?x?x320xf32>) : {data_format = "NHWC", device = "", dilations = [1, 1, 1, 1], explicit_paddings = [], padding = "VALID", strides = [1, 1, 1, 1], use_cudnn_on_gpu = true}
	tf.Conv2D(tensor<?x?x?x?xf32>, tensor<1x1x640x640xf32>) -> (tensor<?x?x?x640xf32>) : {data_format = "NHWC", device = "", dilations = [1, 1, 1, 1], explicit_paddings = [], padding = "VALID", strides = [1, 1, 1, 1], use_cudnn_on_gpu = true}
	tf.Conv2D(tensor<?x?x?x?xf32>, tensor<3x3x1280x1280xf32>) -> (tensor<?x?x?x1280xf32>) : {data_format = "NHWC", device = "", dilations = [1, 1, 1, 1], explicit_paddings = [], padding = "VALID", strides = [1, 1, 1, 1], use_cudnn_on_gpu = true}
	tf.Conv2D(tensor<?x?x?x?xf32>, tensor<3x3x1280x640xf32>) -> (tensor<?x?x?x640xf32>) : {data_format = "NHWC", device = "", dilations = [1, 1, 1, 1], explicit_paddings = [], padding = "VALID", strides = [1, 1, 1, 1], use_cudnn_on_gpu = true}
	tf.Conv2D(tensor<?x?x?x?xf32>, tensor<3x3x1920x1280xf32>) -> (tensor<?x?x?x1280xf32>) : {data_format = "NHWC", device = "", dilations = [1, 1, 1, 1], explicit_paddings = [], padding = "VALID", strides = [1, 1, 1, 1], use_cudnn_on_gpu = true}
	tf.Conv2D(tensor<?x?x?x?xf32>, tensor<3x3x1920x640xf32>) -> (tensor<?x?x?x640xf32>) : {data_format = "NHWC", device = "", dilations = [1, 1, 1, 1], explicit_paddings = [], padding = "VALID", strides = [1, 1, 1, 1], use_cudnn_on_gpu = true}
	tf.Conv2D(tensor<?x?x?x?xf32>, tensor<3x3x2560x1280xf32>) -> (tensor<?x?x?x1280xf32>) : {data_format = "NHWC", device = "", dilations = [1, 1, 1, 1], explicit_paddings = [], padding = "VALID", strides = [1, 1, 1, 1], use_cudnn_on_gpu = true}
	tf.Conv2D(tensor<?x?x?x?xf32>, tensor<3x3x320x320xf32>) -> (tensor<?x?x?x320xf32>) : {data_format = "NHWC", device = "", dilations = [1, 1, 1, 1], explicit_paddings = [], padding = "VALID", strides = [1, 1, 1, 1], use_cudnn_on_gpu = true}
	tf.Conv2D(tensor<?x?x?x?xf32>, tensor<3x3x320x4xf32>) -> (tensor<?x?x?x4xf32>) : {data_format = "NHWC", device = "", dilations = [1, 1, 1, 1], explicit_paddings = [], padding = "VALID", strides = [1, 1, 1, 1], use_cudnn_on_gpu = true}
	tf.Conv2D(tensor<?x?x?x?xf32>, tensor<3x3x320x640xf32>) -> (tensor<?x?x?x640xf32>) : {data_format = "NHWC", device = "", dilations = [1, 1, 1, 1], explicit_paddings = [], padding = "VALID", strides = [1, 1, 1, 1], use_cudnn_on_gpu = true}
	tf.Conv2D(tensor<?x?x?x?xf32>, tensor<3x3x640x1280xf32>) -> (tensor<?x?x?x1280xf32>) : {data_format = "NHWC", device = "", dilations = [1, 1, 1, 1], explicit_paddings = [], padding = "VALID", strides = [1, 1, 1, 1], use_cudnn_on_gpu = true}
	tf.Conv2D(tensor<?x?x?x?xf32>, tensor<3x3x640x320xf32>) -> (tensor<?x?x?x320xf32>) : {data_format = "NHWC", device = "", dilations = [1, 1, 1, 1], explicit_paddings = [], padding = "VALID", strides = [1, 1, 1, 1], use_cudnn_on_gpu = true}
	tf.Conv2D(tensor<?x?x?x?xf32>, tensor<3x3x640x640xf32>) -> (tensor<?x?x?x640xf32>) : {data_format = "NHWC", device = "", dilations = [1, 1, 1, 1], explicit_paddings = [], padding = "VALID", strides = [1, 1, 1, 1], use_cudnn_on_gpu = true}
	tf.Conv2D(tensor<?x?x?x?xf32>, tensor<3x3x960x320xf32>) -> (tensor<?x?x?x320xf32>) : {data_format = "NHWC", device = "", dilations = [1, 1, 1, 1], explicit_paddings = [], padding = "VALID", strides = [1, 1, 1, 1], use_cudnn_on_gpu = true}
	tf.Conv2D(tensor<?x?x?x?xf32>, tensor<3x3x960x640xf32>) -> (tensor<?x?x?x640xf32>) : {data_format = "NHWC", device = "", dilations = [1, 1, 1, 1], explicit_paddings = [], padding = "VALID", strides = [1, 1, 1, 1], use_cudnn_on_gpu = true}

II- Go through h5

Save the model with format h5

from stable_diffusion_tf.stable_diffusion import StableDiffusion
model = StableDiffusion(
    img_height=512,
    img_width=512,
)
model.diffusion_model.save('./stable_diffusion.h5', save_format='h5')

Lets try to load it

import tensorflow as tf

model2 = tf.keras.models.load_model('stable_diffusion.h5')
converter = tf.lite.TFLiteConverter.from_keras_model(model2)
tflite_model = converter.convert()
open("converted_model.tflite", "wb").write(tflite_model)

It seems that the TF 2.11.0 does not load h5 files anymore.

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
[<ipython-input-3-72d9a214a713>](https://localhost:8080/#) in <module>
      1 import tensorflow as tf
      2 
----> 3 model2 = tf.keras.models.load_model('stable_diffusion.h5')
      4 converter = tf.lite.TFLiteConverter.from_keras_model(model2)
      5 tflite_model = converter.convert()

1 frames
[/usr/local/lib/python3.7/dist-packages/keras/saving/legacy/serialization.py](https://localhost:8080/#) in class_and_config_for_serialized_keras_object(config, module_objects, custom_objects, printable_module_name)
    384     if cls is None:
    385         raise ValueError(
--> 386             f"Unknown {printable_module_name}: '{class_name}'. "
    387             "Please ensure you are using a `keras.utils.custom_object_scope` "
    388             "and that this object is included in the scope. See "

ValueError: Unknown layer: 'UNetModel'. Please ensure you are using a `keras.utils.custom_object_scope` and that this object is included in the scope. See https://www.tensorflow.org/guide/keras/save_and_serialize#registering_the_custom_object for details.

Therefore, uninstall tf 2.11.0 and install tf 2.1.0
Attempt to load the saved h5 file:

import tensorflow as tf

model2 = tf.keras.models.load_model('stable_diffusion.h5')
converter = tf.lite.TFLiteConverter.from_keras_model(model2)
tflite_model = converter.convert()
open("converted_model.tflite", "wb").write(tflite_model)

The load_model throws the following error:

---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
<ipython-input-5-72d9a214a713> in <module>
      1 import tensorflow as tf
      2 
----> 3 model2 = tf.keras.models.load_model('stable_diffusion.h5')
      4 converter = tf.lite.TFLiteConverter.from_keras_model(model2)
      5 tflite_model = converter.convert()

1 frames
/usr/local/lib/python3.7/dist-packages/tensorflow_core/python/keras/saving/hdf5_format.py in load_model_from_hdf5(filepath, custom_objects, compile)
    164     if model_config is None:
    165       raise ValueError('No model found in config file.')
--> 166     model_config = json.loads(model_config.decode('utf-8'))
    167     model = model_config_lib.model_from_config(model_config,
    168                                                custom_objects=custom_objects)

AttributeError: 'str' object has no attribute 'decode'

The text was updated successfully, but these errors were encountered:

LukeWood · 2022-11-21T04:16:49Z

Thanks @charbull for the report! Will take a look at this. TFLite conversion would be an awesome addition.

charbull · 2022-11-21T04:32:18Z

Leaving another conversion issue here related to the tf lite max size 2G related to protobuf: divamgupta/stable-diffusion-tensorflow#58 (comment)

charbull · 2022-11-21T04:57:54Z

Looking forward to have a tf lite and stablediffusion on the coral :)

freedomtan · 2022-11-21T09:43:25Z

@charbull

I was able to convert text_encoder, diffusion_model, and encoder to tflite. There are some issues.

you have to specify batch size some where, otherwise, you got error messages you show. Do it either in model initialization or concrete_function you get from your saved_model

ConverterError: /opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: error: 'tf.Conv2D' op is neither a custom op nor a flex op
<unknown>:0: note: loc(fused["StatefulPartitionedCall:", "StatefulPartitionedCall"]): called from
/opt/conda/envs/rapids/lib/python3.8/site-packages/tensorflow/python/saved_model/save.py:1267:0: note: Error code: ERROR_NEEDS_FLEX_OPS

TensoFlow override len(), so you may have to change the _create_broadcast_shape in stable_diffusion/__internal__/layers/group_normalization.py from

 def _create_broadcast_shape(self, input_shape):
        broadcast_shape = [1] * len(input_shape)
        broadcast_shape[self.axis] = input_shape[self.axis] // self.groups
        broadcast_shape.insert(self.axis, self.groups)
        return broadcast_shape

to

def _create_broadcast_shape(self, input_shape):
        broadcast_shape = [1] * input_shape.shape.rank
        broadcast_shape[self.axis] = input_shape[self.axis] // self.groups
        broadcast_shape.insert(self.axis, self.groups)
        return broadcast_shape

if you go thru concrete_function path, you might run into 2 GiB file size limitation in protobuf. If you go with from_keras_model or from_saved_model, you may run into 2 GiB limitation of FlatBuffer when converting diffusion model. The only compatible tflite solution I can find is to convert to fp16 (so the file size would be < 2 GiB)

charbull · 2022-11-21T12:25:12Z

Thank you @freedomtan I am trying those now,
created this PR: #1035 based on your suggestion for the length.

charbull · 2022-11-21T15:32:28Z

Hi @freedomtan
how do you set the batch size ? the arguments are:

  img_height: Height of the images to generate, in pixel. Note that only
            multiples of 128 are supported; the value provided will be rounded
            to the nearest valid value. Default: 512.
        img_width: Width of the images to generate, in pixel. Note that only
            multiples of 128 are supported; the value provided will be rounded
            to the nearest valid value. Default: 512.
        jit_compile: Whether to compile the underlying models to XLA.
            This can lead to a significant speedup on some systems. Default: False.

ianstenbit · 2022-11-21T20:04:53Z

@charbull batch_size isn't configured at a StableDiffusion-wide level in our implementation, but rather passed as a parameter to specific SD flows (e.g. the text_to_image method).

charbull · 2022-11-21T20:07:48Z

@ianstenbit oh I see ! thank you for the clarification. I am not there yet.
trying to convert to TFlite first :)

tanzhenyu · 2022-11-21T20:13:45Z

I wonder why the 2GB limitation will be applied here, since weights is not part of the proto?

freedomtan · 2022-11-22T01:20:16Z

Hi @freedomtan how do you set the batch size ? the arguments are:

  img_height: Height of the images to generate, in pixel. Note that only
            multiples of 128 are supported; the value provided will be rounded
            to the nearest valid value. Default: 512.
        img_width: Width of the images to generate, in pixel. Note that only
            multiples of 128 are supported; the value provided will be rounded
            to the nearest valid value. Default: 512.
        jit_compile: Whether to compile the underlying models to XLA.
            This can lead to a significant speedup on some systems. Default: False.

Yup, you have to modify the source code, either adding an argument or directly modifying Input layers work :-)

freedomtan · 2022-11-22T01:28:03Z

I wonder why the 2GB limitation will be applied here, since weights is not part of the proto?

Yup, that's not trivial. In TensorFlow 2.10 and before (I didn't check if it is changed in 2.11 or later), it seems when you try to convert a model to tflite from concrete function, it is converted to frozen graphdef first. Thus, there is protobuf 2 GiB limitation.

bhack · 2022-11-22T01:35:15Z

I think in master is https://github.com/tensorflow/tensorflow/blob/master/tensorflow/compiler/mlir/lite/flatbuffer_export.cc#L2152-L2156

tanzhenyu · 2022-11-22T01:35:16Z

I wonder why the 2GB limitation will be applied here, since weights is not part of the proto?

Yup, that's not trivial. In TensorFlow 2.10 and before (I didn't check if it is changed in 2.11 or later), it seems when you try to convert a model to tflite from concrete function, it is converted to frozen graphdef first. Thus, there is protobuf 2 GiB limitation.

I see, that makes sense. Though in the future, I'd really hope this is part of logic will get done at tf lite runtime, instead of model saving.

tanzhenyu · 2022-11-22T01:37:28Z

I think in master is https://github.com/tensorflow/tensorflow/blob/master/tensorflow/compiler/mlir/lite/flatbuffer_export.cc#L2152-L2156

you mean the 2GB limit is still true across 2.10 and master, correct?

bhack · 2022-11-22T01:43:25Z

you mean the 2GB limit is still true across 2.10 and master, correct?

Yes that link is pointing to master. Then if we see the conversion workflow... https://www.tensorflow.org/lite/models/convert?hl=en#model_conversion

freedomtan · 2022-11-22T01:45:03Z

I think in master is https://github.com/tensorflow/tensorflow/blob/master/tensorflow/compiler/mlir/lite/flatbuffer_export.cc#L2152-L2156

you mean the 2GB limit is still true across 2.10 and master, correct?

Nope, there are two 2 GiB limitations, one is protobuf, the other is flatbuffer, which is file format of TFLite.
As far as I know, there is no way to get around the flatbuffer 2 GiB limit without breaking compatibility.

tanzhenyu · 2022-11-22T01:45:41Z

you mean the 2GB limit is still true across 2.10 and master, correct?

Yes that link is pointing to master. Then if we see the conversion workflow... https://www.tensorflow.org/lite/models/convert?hl=en#model_conversion

Right, I think this is suboptimal, freezing can be done at runtime, I will try to figure out how to push this forward with TFLite separately.

bhack · 2022-11-22T01:46:18Z

But I think that probably we could still solve using something like https://github.com/tensorflow/hub/blob/master/examples/text_embeddings/export.py#L204-L219. What do you think?

bhack · 2022-11-22T01:47:47Z

As I saw another case pointing to that workaround (tensorflow/tensorflow#47326 (comment))

tanzhenyu · 2022-11-22T01:52:26Z

But I think that probably we could still solve using something like https://github.com/tensorflow/hub/blob/master/examples/text_embeddings/export.py#L204-L219. What do you think?

Hmm...that's basically rewriting the graph and use placeholder instead....is this even do-able at TF2?

bhack · 2022-11-22T02:04:25Z

@abattery What do you think?

charbull · 2022-11-22T04:25:31Z

I was able to make some progress based on advice from @costiash here: divamgupta/stable-diffusion-tensorflow#58 (comment)

import tensorflow as tf

model2 = tf.keras.models.load_model('/content/saved_model')
converter = tf.lite.TFLiteConverter.from_keras_model(model2)
converter.target_spec.supported_ops = [
  tf.lite.OpsSet.TFLITE_BUILTINS, # enable TensorFlow Lite ops.
  tf.lite.OpsSet.SELECT_TF_OPS # enable TensorFlow ops.]
converter.optimizations = [tf.lite.Optimize.DEFAULT]
converter.target_spec.supported_types = [tf.float16]
tflite_model = converter.convert()
open("converted_model.tflite", "wb").write(tflite_model)

It does generates an TFlite with 16float ~ 1.6 GB size file.

However, since I am running tflite on edge tpu, It seems I need to go full quantization with int8.
https://www.tensorflow.org/lite/performance/post_training_quantization#integer_only

The https://www.tensorflow.org/api_docs/python/tf/lite/RepresentativeDataset is needed for full quantization, according to the reference:

Usually, this is a small subset of a few hundred samples randomly chosen, in no particular order, from the training or evaluation dataset.

Are there samples I can use from this project? or do I need generate some images from TF stable diffusion and use them?

freedomtan · 2022-11-22T05:59:58Z

@charbull, my 2 cents: DO NOT use tf.lite.OpsSet.SELECT_TF_OPS, which is to allow TF ops. Since your goal is to run on EdgeTPU, TF ops are unlikely to work on it.

With recently tf master + keras_cv (0.3.4 + group norm patch), converting from keras model works like a charm.

import keras_cv
from tensorflow import keras
import tensorflow as tf

model = keras_cv.models.StableDiffusion(img_width=512, img_height=512)

converter = tf.lite.TFLiteConverter.from_keras_model(model.diffusion_model)
converter.optimizations = [tf.lite.Optimize.DEFAULT]
converter.target_spec.supported_types = [tf.float16]
tflite_model = converter.convert()

with open('/tmp/diffusion_model_fp16.tflite', 'wb') as f:
  f.write(tflite_model)

The TF master I tested was built from tensorflow/tensorflow@680a9b2

charbull · 2022-11-22T12:55:24Z

@freedomtan thank you ! we are getting closer.

It turns out I need to run the following for the edge TPU quantization : https://www.tensorflow.org/lite/performance/post_training_quantization#integer_only

import tensorflow as tf

model = tf.keras.models.load_model('./saved_model')
converter = tf.lite.TFLiteConverter.from_keras_model(model)
converter.target_spec.supported_ops = [tf.lite.OpsSet.TFLITE_BUILTINS_INT8]
converter.inference_input_type = tf.int8  # or tf.uint8
converter.inference_output_type = tf.int8  # or tf.uint8
converter.representative_dataset = representative_data_gen()
tflite_model_int8 = converter.convert()
open("converted_model-int8.tflite", "wb").write(tflite_model)

trying to figure out how to get the representative_data_gen does it need to be on the training data?
according to the documentation:

Usually, this is a small subset of a few hundred samples randomly chosen, in no particular order, 
from the training or evaluation dataset.

Are there samples I can use from this project? or do I need generate some images from TF stable diffusion and
I am not quite sure.

Cheers,

bhack · 2022-11-22T13:10:13Z

I don't remember what checkpoint we have used:
https://huggingface.co/CompVis/stable-diffusion

Probably for the calibration you can pass prompts samples from:
"laion-improved-aesthetics" or "laion-aesthetics v2 5+"

https://laion.ai/blog/laion-aesthetics/

charbull · 2022-11-22T13:15:21Z

@bhack thank you ! will give it a try :)

charbull · 2022-11-23T00:12:05Z

Hi,

I tried the following so far with @ianstenbit

I. I generated images from prompts and put them in a csv file so that I can prepare the representative_dataset

def load_img(path_to_img):
   return tf.io.read_file(path_to_img)


def prepare_images():
    df = pd.read_csv('./represent_data/representative.csv')
    images = []
    for index, row in df.iterrows():
         print(row['output'])
         img = load_img(row['output'])
         images.append(img)
    return images

def prepare_prompts():
    df = pd.read_csv('./represent_data/representative.csv')
    prompts = []
    for index, row in df.iterrows():
         print(row['input'])
         prompts.append(str(row['input']))
    return prompts

def representative_data_gen():
  prompts = prepare_prompts()
  for prompt in prompts:
    yield [prompt, model.text_to_image(prompt)]

Then the conversion:

import tensorflow as tf

model = tf.keras.models.load_model('./saved_model')
converter = tf.lite.TFLiteConverter.from_keras_model(model)
converter.target_spec.supported_ops = [tf.lite.OpsSet.TFLITE_BUILTINS_INT8]
converter.inference_input_type = tf.int8  # or tf.uint8
converter.inference_output_type = tf.int8  # or tf.uint8
# converter.experimental_new_quantizer = True
converter.representative_dataset = representative_data_gen()
tflite_model_int8 = converter.convert()
open("converted_model-int8.tflite", "wb").write(tflite_model_int8)

Getting the following error, not sure what is the issue:

WARNING:tensorflow:No training configuration found in save file, so the model was *not* compiled. Compile it manually.
WARNING:absl:Found untraced functions such as conv2d_84_layer_call_fn, conv2d_84_layer_call_and_return_conditional_losses, _jit_compiled_convolution_op, restored_function_body, restored_function_body while saving (showing 5 of 1200). These functions will not be directly callable after loading.
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
[<ipython-input-9-dc50ce399a16>](https://localhost:8080/#) in <module>
      8 # converter.experimental_new_quantizer = True
      9 converter.representative_dataset = representative_data_gen()
---> 10 tflite_model_int8 = converter.convert()
     11 open("converted_model-int8.tflite", "wb").write(tflite_model_int8)

8 frames
[/usr/local/lib/python3.7/dist-packages/tensorflow/lite/python/lite.py](https://localhost:8080/#) in _validate_inference_input_output_types(self, quant_mode)
    963     elif (self.inference_input_type not in default_types or
    964           self.inference_output_type not in default_types):
--> 965       raise ValueError("The inference_input_type and inference_output_type "
    966                        "must be tf.float32.")
    967 

ValueError: The inference_input_type and inference_output_type must be tf.float32.

II. Tried to cut the conversion for the encoder alone:

converter = tf.lite.TFLiteConverter.from_keras_model(model.text_encoder)

which also produced the same error:

ValueError: The inference_input_type and inference_output_type must be tf.float32.

Any ideas what is going wrong?

Thank you

ianstenbit · 2022-11-23T00:17:51Z

As discussed, I think what you stored in ./saved_model is just the latent diffusion model, not the whole model.

In order to convert the full StableDiffusion model, we'll need to either

Implement call on StableDiffusion (I think this is probably not the right approach because it confines us to one use case)
Convert the individual component models to TFLite and compose them into the StableDiffusion object post-conversion (perhaps with some custom adaptation for the TFLite versions of the component models -- I'm not sure)

freedomtan · 2022-11-23T00:54:58Z

@charbull
What you saved in saved_model is fp32 model. If you check input tensors of the diffusion_model with saved_model_cli, you can see that the expected data types of the 3 input tensors is fp32 (DT_FLOAT). And as @ianstenbit noted, what you saved is the diffusion/denoise model instead of the whole pipeline. The input data for the diffusion model are supposed to be from the text_encoder and the random number generator.

$ saved_model_cli show --all --dir /tmp/sd/diffusion_model
...
signature_def['serving_default']:
  The given SavedModel SignatureDef contains the following input(s):
    inputs['input_1'] tensor_info:
        dtype: DT_FLOAT
        shape: (-1, 77, 768)
        name: serving_default_input_1:0
    inputs['input_2'] tensor_info:
        dtype: DT_FLOAT
        shape: (-1, 320)
        name: serving_default_input_2:0
    inputs['input_3'] tensor_info:
        dtype: DT_FLOAT
        shape: (-1, 64, 64, 4)
        name: serving_default_input_3:0
  The given SavedModel SignatureDef contains the following output(s):
    outputs['padded_conv2d_83'] tensor_info:
        dtype: DT_FLOAT
        shape: (-1, 64, 64, 4)
        name: StatefulPartitionedCall:0
  Method name is: tensorflow/serving/predict
...

charbull · 2022-11-23T01:06:59Z

@freedomtan @ianstenbit
I see.

What would be the best approach to get to tflite int8 in this case.
I didn't try this before with decomposing and reassembling pieces of the model.

Would the edge tpu "knows" how to handle this? There is also the Tflite interpreter.

Not sure exactly what would be the best way going forward ? :)

If you can highlight the steps and the library/tools, I am happy to give it a shot.

Cheers

freedomtan · 2022-11-28T02:57:28Z

@charbull FYR. I gave a try.

ideally, if we feed input tensors with correct ranges, we should be able to get quantized model.
if that works, we can extend the representative_data_gen().

input 1: 77x768 encoded text vectors, we can use the text encoder to generate encoded text prompt
input 2: I borrowed timestep embeddeding from the keras model
input 3: noise, since this is noise, I guess we can use random number

model = keras_cv.models.StableDiffusion(img_width=512, img_height=512)

prompt_1 = "A watercolor painting of a Golden Retriever at the beach"
encoding_1 = model.encode_text(prompt_1)


def get_timestep_embedding(timestep, batch_size, dim=320, max_period=10000):
    half = dim // 2
    freqs = tf.math.exp(
        -math.log(max_period) * tf.range(0, half, dtype=tf.float32) / half
    )

    args = tf.convert_to_tensor([timestep], dtype=tf.float32) * freqs
    embedding = tf.concat([tf.math.cos(args), tf.math.sin(args)], 0)
    embedding = tf.reshape(embedding, [1, -1])
    return tf.repeat(embedding, batch_size, axis=0)

def representative_data_gen():
  for i in range(1):
    em = get_timestep_embedding(i, 1)
    noise = tf.random.normal(1, 64, 64, 4)
    yield [encoding_1, em, noise]


converter = tf.lite.TFLiteConverter.from_keras_model(model.diffusion_model)
converter.optimizations = [tf.lite.Optimize.DEFAULT]
converter.target_spec.supported_ops = [tf.lite.OpsSet.TFLITE_BUILTINS_INT8]
converter.inference_input_type = tf.uint8  # or tf.uint8
converter.inference_output_type = tf.uint8  # or tf.uint8

converter.representative_dataset = representative_data_gen
tflite_model_int8 = converter.convert()

unfortunately, I got the following message. Dunno how to solve it yet.

Traceback (most recent call last):
  File "/hack/freedom/ml/notebooks/bar.py", line 45, in <module>
    tflite_model_int8 = converter.convert()
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/lite.py", line 933, in wrapper
    return self._convert_and_export_metrics(convert_func, *args, **kwargs)
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/lite.py", line 911, in _convert_and_export_metrics
    result = convert_func(self, *args, **kwargs)
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/lite.py", line 1342, in convert
    saved_model_convert_result = self._convert_as_saved_model()
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/lite.py", line 1325, in _convert_as_saved_model
    self).convert(graph_def, input_tensors, output_tensors)
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/lite.py", line 1141, in convert
    return self._optimize_tflite_model(
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/convert_phase.py", line 215, in wrapper
    raise error from None  # Re-throws the exception.
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/convert_phase.py", line 205, in wrapper
    return func(*args, **kwargs)
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/lite.py", line 871, in _optimize_tflite_model
    model = self._quantize(model, q_in_type, q_out_type, q_activations_type,
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/lite.py", line 608, in _quantize
    result = _calibrator.add_intermediate_tensors(result)
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/optimize/calibrator.py", line 36, in add_intermediate_tensors
    return _calibration_wrapper.AddIntermediateTensors(model_content)
ValueError: Invalid model

freedomtan · 2022-11-30T10:03:42Z

it turns an important message escaped my attention

......08.509914: E tensorflow/compiler/mlir/lite/flatbuffer_export.cc:2154] Model size is bigger than 2gb

But the quantized model isn't supposed to be bigger than 2gb :-(

Checking the converter source code, it seems for tflite full-integer quantization, a TF model is converted to a TFLite model and an intermediate tflite model file is saved. Then the converter would try to optimize the intermediate TFLite model. But since the intermediate tflite model is > 2 gb, the conversion failed.

charbull · 2022-11-30T12:47:41Z

@freedomtan thank you for updating here,
It seems we need to go on this suggestion by @ianstenbit: #1033 (comment)

To save each model separately then converting them to tflite/int8.
I am not sure how to reassemble them back in the tflite realm though? any hint/advice?

freedomtan · 2022-12-07T03:01:08Z

@freedomtan thank you for updating here, It seems we need to go on this suggestion by @ianstenbit: #1033 (comment)

To save each model separately then converting them to tflite/int8. I am not sure how to reassemble them back in the tflite realm though? any hint/advice?

@charbull
I don't see why there are such questions. As far as I can tell, there are three models (text encoder, diffusion/denoise, and decoder) to be converted, and we already successfully converted all of them to fp32/fp16. Yes, some glue code is needed (e.g., tokenizer, rng, looping the diffusion model, and constructing the pipeline). That is, implementing what the https://github.com/keras-team/keras-cv/blob/master/keras_cv/models/stable_diffusion/stable_diffusion.py does but dealing with .tflite models instead of Keras models.

LukeWood · 2022-12-09T21:15:06Z

hey @charbull so are all TFLite models converted?

charbull · 2022-12-18T02:58:54Z

Hey @LukeWood @freedomtan i wasn't able to make the Tflite/int8 work yet. As the int8 conversion requires data samples.

freedomtan · 2022-12-19T03:35:22Z

@charbull FYR. I wrote a script that could be used to split the diffusion model into two chunks, so that it will be trivial to convert the two split models into (fp32) tflite models and easier to generate post-training quantization models.

https://github.com/freedomtan/keras_cv_stable_diffusion_to_tflite

freedomtan · 2022-12-20T03:34:15Z

@charbull Generate quantized int8 models also works, see https://github.com/freedomtan/keras_cv_stable_diffusion_to_tflite/blob/main/convert_keras_diffusion_model_into_two_tflite_models_qint8.ipynb

charbull · 2022-12-30T16:03:01Z

@freedomtan thank you, I ran the qint8 version, but for some reason it is crashing my notebook. Maybe I need to upgrade :)

freedomtan · 2023-01-06T02:24:17Z

@charbull you mean you tried my conversion Jupyter notebook? my guess is, mostly memory problem (out of memory, etc.)

tanzhenyu · 2023-01-06T16:32:21Z

@charbull Generate quantized int8 models also works, see https://github.com/freedomtan/keras_cv_stable_diffusion_to_tflite/blob/main/convert_keras_diffusion_model_into_two_tflite_models_qint8.ipynb

how does the tokenizer work in this conversion?

freedomtan · 2023-01-08T12:42:04Z

@charbull Generate quantized int8 models also works, see https://github.com/freedomtan/keras_cv_stable_diffusion_to_tflite/blob/main/convert_keras_diffusion_model_into_two_tflite_models_qint8.ipynb

how does the tokenizer work in this conversion?

I don't really know how to generate representative dataset, which is essential for getting reasonable accuracy. I just showed that quantization is possible :-)

freedomtan · 2023-01-21T02:43:24Z

added a jupyter notebook showing the converted tflite models work, https://github.com/freedomtan/keras_cv_stable_diffusion_to_tflite/blob/main/text_to_image_using_converted_tflite_models.ipynb.

Most of the code is from keras cv implementation. I replaced keras model inference code with tflite model inference. It's not a complete implementation, just to test it per request from @sayakpaul.

sayakpaul · 2023-01-23T11:37:40Z

Here's an end to end Colab notebook: https://github.com/sayakpaul/Adventures-in-TensorFlow-Lite/blob/master/Stable_Diffusion_to_TFLite.ipynb building on top of @freedomtan's awesome work.

kkimmk · 2023-01-30T04:26:09Z

@freedomtan
Thank you for your workaround for integer-only quantization of diffusion model. It's really helpful.

May I ask how did you get this error message for integer-only quantization?

it turns an important message escaped my attention
......08.509914: E tensorflow/compiler/mlir/lite/flatbuffer_export.cc:2154] Model size is bigger than 2gb

I can only get the original error message which is

Traceback (most recent call last):
  File "/hack/freedom/ml/notebooks/bar.py", line 45, in <module>
    tflite_model_int8 = converter.convert()
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/lite.py", line 933, in wrapper
    return self._convert_and_export_metrics(convert_func, *args, **kwargs)
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/lite.py", line 911, in _convert_and_export_metrics
    result = convert_func(self, *args, **kwargs)
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/lite.py", line 1342, in convert
    saved_model_convert_result = self._convert_as_saved_model()
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/lite.py", line 1325, in _convert_as_saved_model
    self).convert(graph_def, input_tensors, output_tensors)
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/lite.py", line 1141, in convert
    return self._optimize_tflite_model(
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/convert_phase.py", line 215, in wrapper
    raise error from None  # Re-throws the exception.
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/convert_phase.py", line 205, in wrapper
    return func(*args, **kwargs)
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/lite.py", line 871, in _optimize_tflite_model
    model = self._quantize(model, q_in_type, q_out_type, q_activations_type,
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/lite.py", line 608, in _quantize
    result = _calibrator.add_intermediate_tensors(result)
  File "/home/freedom/tf-master/lib/python3.10/site-packages/tensorflow/lite/python/optimize/calibrator.py", line 36, in add_intermediate_tensors
    return _calibration_wrapper.AddIntermediateTensors(model_content)
ValueError: Invalid model

kadirnar · 2023-03-12T15:47:36Z

İşte uçtan uca bir Colab not defteri : https://github.com/sayakpaul/Adventures-in-TensorFlow-Lite/blob/master/Stable_Diffusion_to_TFLite.ipynb@freedomtanharika bir iş.

Error Message:

	tf.StridedSlice(tensor<?x64x10240xf16>, tensor<3xi32>, tensor<3xi32>, tensor<3xi32>) -> (tensor<?x64x5120xf16>) : {begin_mask = 7 : i64, ellipsis_mask = 0 : i64, end_mask = 3 : i64, new_axis_mask = 0 : i64, shrink_axis_mask = 0 : i64}
	tf.Sub(tensor<1x1x1x32x10xf16>, tensor<?x1x1x32x10xf16>) -> (tensor<?x1x1x32x10xf16>) : {device = ""}
	tf.Sub(tensor<32x10xf16>, tensor<?x1x1x32x10xf16>) -> (tensor<?x1x1x32x10xf16>) : {device = ""}
	tf.Sub(tensor<32x20xf16>, tensor<?x1x1x32x20xf16>) -> (tensor<?x1x1x32x20xf16>) : {device = ""}
	tf.Sub(tensor<32x30xf16>, tensor<?x1x1x32x30xf16>) -> (tensor<?x1x1x32x30xf16>) : {device = ""}
	tf.Sub(tensor<32x40xf16>, tensor<?x1x1x32x40xf16>) -> (tensor<?x1x1x32x40xf16>) : {device = ""}
	tf.Sub(tensor<32x60xf16>, tensor<?x1x1x32x60xf16>) -> (tensor<?x1x1x32x60xf16>) : {device = ""}
	tf.Sub(tensor<32x80xf16>, tensor<?x1x1x32x80xf16>) -> (tensor<?x1x1x32x80xf16>) : {device = ""}
	tf.Tanh(tensor<?x1024x2560xf16>) -> (tensor<?x1024x2560xf16>) : {device = ""}
	tf.Tanh(tensor<?x256x5120xf16>) -> (tensor<?x256x5120xf16>) : {device = ""}
	tf.Tanh(tensor<?x4096x1280xf16>) -> (tensor<?x4096x1280xf16>) : {device = ""}
	tf.Tanh(tensor<?x64x5120xf16>) -> (tensor<?x64x5120xf16>) : {device = ""}
	tf.Transpose(tensor<?x1024x8x80xf16>, tensor<4xi32>) -> (tensor<?x8x1024x80xf16>) : {device = ""}
	tf.Transpose(tensor<?x1024x8x80xf16>, tensor<4xi32>) -> (tensor<?x8x80x1024xf16>) : {device = ""}
	tf.Transpose(tensor<?x256x8x160xf16>, tensor<4xi32>) -> (tensor<?x8x160x256xf16>) : {device = ""}
	tf.Transpose(tensor<?x256x8x160xf16>, tensor<4xi32>) -> (tensor<?x8x256x160xf16>) : {device = ""}
	tf.Transpose(tensor<?x4096x8x40xf16>, tensor<4xi32>) -> (tensor<?x8x4096x40xf16>) : {device = ""}
	tf.Transpose(tensor<?x4096x8x40xf16>, tensor<4xi32>) -> (tensor<?x8x40x4096xf16>) : {device = ""}
	tf.Transpose(tensor<?x64x8x160xf16>, tensor<4xi32>) -> (tensor<?x8x160x64xf16>) : {device = ""}
	tf.Transpose(tensor<?x64x8x160xf16>, tensor<4xi32>) -> (tensor<?x8x64x160xf16>) : {device = ""}
	tf.Transpose(tensor<?x77x8x160xf16>, tensor<4xi32>) -> (tensor<?x8x160x77xf16>) : {device = ""}
	tf.Transpose(tensor<?x77x8x160xf16>, tensor<4xi32>) -> (tensor<?x8x77x160xf16>) : {device = ""}
	tf.Transpose(tensor<?x77x8x40xf16>, tensor<4xi32>) -> (tensor<?x8x40x77xf16>) : {device = ""}
	tf.Transpose(tensor<?x77x8x40xf16>, tensor<4xi32>) -> (tensor<?x8x77x40xf16>) : {device = ""}
	tf.Transpose(tensor<?x77x8x80xf16>, tensor<4xi32>) -> (tensor<?x8x77x80xf16>) : {device = ""}
	tf.Transpose(tensor<?x77x8x80xf16>, tensor<4xi32>) -> (tensor<?x8x80x77xf16>) : {device = ""}
	tf.Transpose(tensor<?x8x1024x80xf16>, tensor<4xi32>) -> (tensor<?x1024x8x80xf16>) : {device = ""}
	tf.Transpose(tensor<?x8x256x160xf16>, tensor<4xi32>) -> (tensor<?x256x8x160xf16>) : {device = ""}
	tf.Transpose(tensor<?x8x4096x40xf16>, tensor<4xi32>) -> (tensor<?x4096x8x40xf16>) : {device = ""}
	tf.Transpose(tensor<?x8x64x160xf16>, tensor<4xi32>) -> (tensor<?x64x8x160xf16>) : {device = ""}

Model_Id: https://huggingface.co/kadirnar/dreambooth_diffusion_model_v5

github-actions · 2024-01-31T01:47:31Z

This issue is stale because it has been open for 180 days with no activity. It will be closed if no further activity occurs. Thank you.

rvalitov · 2024-01-31T04:59:04Z

Up

This was referenced Nov 21, 2022

Exception encountered when calling layer 'group_normalization_60 #1034

Closed

Update group_normalization.py #1035

Merged

charbull mentioned this issue Nov 27, 2022

TF Lite convert error divamgupta/stable-diffusion-tensorflow#58

Open

ianstenbit added stat:contributions welcome type:feature labels Nov 30, 2022

freedomtan mentioned this issue Jan 21, 2023

Conversion of Diffusion model inside Colab generates a non valid .tflite file tensorflow/tensorflow#59259

Closed

LukeWood added enhancement New feature or request stable-diffusion labels Jan 23, 2023

github-actions bot added the stale label Jan 31, 2024

github-actions bot removed stale stat:contributions welcome labels Feb 1, 2024

StableDiffusion Tensorflow to TF Lite #1033

StableDiffusion Tensorflow to TF Lite #1033

Comments

charbull commented Nov 21, 2022

LukeWood commented Nov 21, 2022

charbull commented Nov 21, 2022

charbull commented Nov 21, 2022

freedomtan commented Nov 21, 2022 • edited Loading

charbull commented Nov 21, 2022

charbull commented Nov 21, 2022

ianstenbit commented Nov 21, 2022

charbull commented Nov 21, 2022

tanzhenyu commented Nov 21, 2022

freedomtan commented Nov 22, 2022

freedomtan commented Nov 22, 2022

bhack commented Nov 22, 2022

tanzhenyu commented Nov 22, 2022

tanzhenyu commented Nov 22, 2022

bhack commented Nov 22, 2022

freedomtan commented Nov 22, 2022 • edited Loading

tanzhenyu commented Nov 22, 2022

bhack commented Nov 22, 2022

bhack commented Nov 22, 2022

tanzhenyu commented Nov 22, 2022

bhack commented Nov 22, 2022

charbull commented Nov 22, 2022

freedomtan commented Nov 22, 2022 • edited Loading

charbull commented Nov 22, 2022

bhack commented Nov 22, 2022

charbull commented Nov 22, 2022

charbull commented Nov 23, 2022

ianstenbit commented Nov 23, 2022

freedomtan commented Nov 23, 2022

charbull commented Nov 23, 2022

freedomtan commented Nov 28, 2022 • edited Loading

freedomtan commented Nov 30, 2022 • edited Loading

charbull commented Nov 30, 2022

freedomtan commented Dec 7, 2022

LukeWood commented Dec 9, 2022

charbull commented Dec 18, 2022

freedomtan commented Dec 19, 2022

freedomtan commented Dec 20, 2022

charbull commented Dec 30, 2022

freedomtan commented Jan 6, 2023

tanzhenyu commented Jan 6, 2023

freedomtan commented Jan 8, 2023

freedomtan commented Jan 21, 2023

sayakpaul commented Jan 23, 2023

kkimmk commented Jan 30, 2023

kadirnar commented Mar 12, 2023

github-actions bot commented Jan 31, 2024

rvalitov commented Jan 31, 2024

freedomtan commented Nov 21, 2022 •

edited

Loading

freedomtan commented Nov 22, 2022 •

edited

Loading

freedomtan commented Nov 22, 2022 •

edited

Loading

freedomtan commented Nov 28, 2022 •

edited

Loading

freedomtan commented Nov 30, 2022 •

edited

Loading