WIP Add custom KL loss layer HLS implementation #606

katyagovorkova · 2022-07-15T12:07:34Z

Adds an implementation of the KL loss layer used for CMS Anomaly detection at L1.
Adds an example of usage of KL layer is in contrib/kl_layer.py and the HLS part is in hls4ml/templates/vivado/nnet_utils/nnet_distance.h.
The original implementation of the KL layer is available on the AE_L1_paper branch, this PR updates the implementation for the new layer API.

Type of change

Documentation update
New feature (non-breaking change which adds functionality)

Tests

The test creates a dummy Keras model which includes the KL loss layer, converts it to an hls4ml model
and synthesises it.

Test Configuration:
To run the test do python contrib/kl_layer.py

Checklist

I have read the guidelines for contributing.
I have commented my code, particularly in hard-to-understand areas.
I have made corresponding changes to the documentation.
My changes generate no new warnings.
I have added tests that prove my fix is effective or that my feature works.

jmitrevs · 2023-01-18T17:40:31Z

I think maybe all of this should go into contrib, since it's not used directly but using the extensions API. Potentially make a kl_loss directory in there with these two files + a readme to explain how to use it. I think that would be useful.

contrib/kl_layer/README.md

@jmitrevs

@jmitrevs readme updated!

remove trailing whitespace

jmitrevs · 2023-01-23T17:23:34Z

When I run python kl_layer.py, I get:

Interpreting Model
Traceback (most recent call last):
  File "/Users/jmitrevs/work/hls4ml/contrib/kl_layer/kl_layer.py", line 189, in <module>
    test_extensions(test_root_path)
  File "/Users/jmitrevs/work/hls4ml/contrib/kl_layer/kl_layer.py", line 168, in test_extensions
    hmodel = hls4ml.converters.convert_from_keras_model(
  File "/Users/jmitrevs/work/hls4ml/hls4ml/converters/__init__.py", line 241, in convert_from_keras_model
    return keras_to_hls(config)
  File "/Users/jmitrevs/work/hls4ml/hls4ml/converters/keras_to_hls.py", line 384, in keras_to_hls
    layer_list, input_layers, output_layers = parse_keras_model(model_arch, reader)
  File "/Users/jmitrevs/work/hls4ml/hls4ml/converters/keras_to_hls.py", line 298, in parse_keras_model
    raise Exception('ERROR: Unsupported layer type: {}'.format(keras_layer['class_name']))
Exception: ERROR: Unsupported layer type: TFOpLambda

Does it succeed for you?

vloncar · 2023-01-23T17:39:43Z

@jmitrevs You're probably using TF 2.8 or newer where the information about the custom layer is not embedded in the model when saving it to disk, but rather its computation graph is embedded. So when loading the model back you get these lambda ops. You can try saving and then loading the model back and printing its summary() to see if this is the cause. I found no solution for this and was hoping TF reverts to the old functionality (because this one is of dubious utility) in newer release but I didn't check if the latest TF did so.

jmitrevs · 2023-01-23T17:42:29Z

The model that the test seems to parse is:

 Layer (type)                   Output Shape         Param #     Connected to                     
==================================================================================================
 input_1 (InputLayer)           [(None, 19, 3, 1)]   0           []                               
                                                                                                  
 dense_1 (Dense)                (None, 19, 3, 10)    20          ['input_1[0][0]']                
                                                                                                  
 dense (Dense)                  (None, 19, 3, 10)    20          ['input_1[0][0]']                
                                                                                                  
 tf.__operators__.add (TFOpLamb  (None, 19, 3, 10)   0           ['dense_1[0][0]']                
 da)                                                                                              
                                                                                                  
 tf.math.square (TFOpLambda)    (None, 19, 3, 10)    0           ['dense[0][0]']                  
                                                                                                  
 tf.math.subtract (TFOpLambda)  (None, 19, 3, 10)    0           ['tf.__operators__.add[0][0]',   
                                                                  'tf.math.square[0][0]']         
                                                                                                  
 tf.math.exp (TFOpLambda)       (None, 19, 3, 10)    0           ['dense_1[0][0]']                
                                                                                                  
 tf.math.subtract_1 (TFOpLambda  (None, 19, 3, 10)   0           ['tf.math.subtract[0][0]',       
 )                                                                'tf.math.exp[0][0]']            
                                                                                                  
 tf.math.reduce_mean (TFOpLambd  (None, 19, 3, 1)    0           ['tf.math.subtract_1[0][0]']     
 a)                                                                                               
                                                                                                  
 tf.math.multiply (TFOpLambda)  (None, 19, 3, 1)     0           ['tf.math.reduce_mean[0][0]']    
                                                                                                  
==================================================================================================

jmitrevs · 2023-01-23T17:47:34Z

So I think it is as @vloncar says. Let's see what options we have to proceed.

vloncar · 2023-01-23T17:48:02Z

Maybe add get_config() to the Keras implementation and try like that? Also try decorating the function with tf.keras.utils.register_keras_serializable

jmitrevs · 2023-01-23T18:08:52Z

Is there anything we can get from the test_extensions.py, which still works?

jmitrevs · 2023-01-23T19:12:39Z

My quick attempts with get_config and tf.keras.utils.register_keras_serializable didn't really do anything, though don't really know what I am doing, so I could have easily done it wrong.

vloncar · 2023-01-23T20:06:12Z

Upon digging a bit more, turns out this implementation is problematic, not the TF version itself. Apparently we shouldn't use tensorflow.python.keras.* anymore and go with tensorflow.keras.*. I don't even remember why we used the former. With that change the issue is gone. But now the issue is how to import the required base class _Merge which has changed locations sometime between 2.7 and the current version (too lazy to investigate exactly when). So I propose we do something like:

try:
    from keras.layers.merge import _Merge as Merge
except Exception:
    from keras.layers.merging.base_merge import _Merge

contrib/kl_layer/README.md

contrib/kl_layer/kl_layer.py

vloncar · 2023-01-23T20:14:28Z

contrib/kl_layer/nnet_distance.h

+
+    // Internal info
+    static const unsigned table_size = 1024;
+    static constexpr float exp_range = 1024;


Does this really need to be float? It would likely have bad QoR if it is not a power-of-two integer.

what should it be instead of float?

jmitrevs · 2023-01-25T15:48:37Z

I think you need these changes:

(fastml39) mac-137349:hls4ml jmitrevs$ git diff
diff --git a/contrib/kl_layer/kl_layer.py b/contrib/kl_layer/kl_layer.py
index 198fb012..318c2f46 100644
--- a/contrib/kl_layer/kl_layer.py
+++ b/contrib/kl_layer/kl_layer.py
@@ -14,7 +14,7 @@ import tensorflow as tf
 try:
     from keras.layers.merge import _Merge as Merge
 except Exception:
-    from keras.layers.merging.base_merge import _Merge
+    from keras.layers.merging.base_merge import _Merge as Merge
     
 from tensorflow.python.keras.utils import tf_utils
 from tensorflow.python.ops import math_ops
@@ -110,7 +110,7 @@ class HKLLossFunctionTemplate(hls4ml.backends.template.FunctionCallTemplate):
 
 
 # Parser for converter
-def parse_klloss_layer(keras_layer, input_names, input_shapes, data_reader, config):
+def parse_klloss_layer(keras_layer, input_names, input_shapes, data_reader):
     assert 'KLLoss' in keras_layer['class_name']
 
     layer = parse_default_keras_layer(keras_layer, input_names)

However, I still get a KeyError: 'accum_t' at the format in:

    def format(self, node):
        params = self._default_config_params(node)
        params['n_in'] = node.get_input_variable(node.inputs[0]).shape[0]
        params['n_out'] = 1
        return self.template.format(**params)

around line 90 of kl_layer.py.

katyagovorkova · 2023-02-02T22:43:32Z

I think you need these changes:

Fixed, thanks!

However, I still get a KeyError: 'accum_t' at the format in:

    def format(self, node):
        params = self._default_config_params(node)
        params['n_in'] = node.get_input_variable(node.inputs[0]).shape[0]
        params['n_out'] = 1
        return self.template.format(**params)

around line 90 of kl_layer.py.

Ah I see, that's most likely because I have removed the Distance class as Vladimir suggested. But seems like it required more changes than just removing the class.. Also I can not test it locally since I have different error when running: Exception: Optimization pass vivado:clone_output already registered

vloncar · 2023-02-10T19:50:37Z

I fixed the outstanding issues. Unfortunately, pre-commit broke for me so I couldn't run it, hence the noise until I manually made it compliant. So annoying...

Anyway, it is ready now.

vloncar

Looks good, thanks Katya!

@jmitrevs

) * add kl layer * separate hls part; clean up and add docs * creeate KL layer folder in contrib and move the files there * pass pre-commit check * README and fix pre-commit issue * update readme * fix formatting * add readme * Update README.md @jmitrevs readme updated! * Update README.md remove trailing whitespace * Update kl_layer.py * Rename nnet_distance.h to kl_layer.h * Update README.md * Update kl_layer.py * Update kl_layer.h * fix pre-commit * Fix KLLoss layer example --------- Co-authored-by: Jovan Mitrevski <jmitrevs@fnal.gov> Co-authored-by: Vladimir Loncar <vloncar@users.noreply.github.com>

katyagovorkova and others added 3 commits July 14, 2022 17:42

add kl layer

78056e7

separate hls part; clean up and add docs

896ccf1

Merge branch 'fastmachinelearning:main' into master

145247d

jmitrevs requested a review from gabhijith August 4, 2022 22:36

Merge branch 'main' into master

b2dffb4

jmitrevs added the please test Trigger testing by creating local PR branch label Jan 18, 2023

katyagovorkova and others added 7 commits January 20, 2023 16:31

Merge branch 'main' into master

8f824fb

creeate KL layer folder in contrib and move the files there

163964b

pass pre-commit check

076e15e

README and fix pre-commit issue

b91d2dc

update readme

b6a0161

fix formatting

037f4c6

add readme

6ef9607

jmitrevs reviewed Jan 20, 2023

View reviewed changes

contrib/kl_layer/README.md Outdated Show resolved Hide resolved

katyagovorkova added 2 commits January 22, 2023 11:27

Update README.md

eef28f4

@jmitrevs readme updated!

Update README.md

1e31437

remove trailing whitespace

vloncar requested changes Jan 23, 2023

View reviewed changes

katyagovorkova added 3 commits January 25, 2023 12:20

Update kl_layer.py

5a88bda

Rename nnet_distance.h to kl_layer.h

0855b1c

Update README.md

c7f4b74

katyagovorkova changed the title ~~Add custom KL loss layer HLS implementation~~ WIP Add custom KL loss layer HLS implementation Jan 25, 2023

katyagovorkova and others added 3 commits February 2, 2023 22:36

Update kl_layer.py

f71fb59

Update kl_layer.h

08171be

fix pre-commit

688c887

vloncar force-pushed the master branch from 44835b2 to 79b0fcf Compare February 10, 2023 19:47

Fix KLLoss layer example

bcd022b

vloncar force-pushed the master branch from 79b0fcf to bcd022b Compare February 10, 2023 19:49

vloncar approved these changes Feb 10, 2023

View reviewed changes

vloncar merged commit 85b9531 into fastmachinelearning:main Feb 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP Add custom KL loss layer HLS implementation #606

WIP Add custom KL loss layer HLS implementation #606

katyagovorkova commented Jul 15, 2022

jmitrevs commented Jan 18, 2023 •

edited

jmitrevs commented Jan 23, 2023

vloncar commented Jan 23, 2023

jmitrevs commented Jan 23, 2023

jmitrevs commented Jan 23, 2023

vloncar commented Jan 23, 2023

jmitrevs commented Jan 23, 2023

jmitrevs commented Jan 23, 2023 •

edited

vloncar commented Jan 23, 2023

vloncar Jan 23, 2023

katyagovorkova Jan 25, 2023

jmitrevs commented Jan 25, 2023

katyagovorkova commented Feb 2, 2023

vloncar commented Feb 10, 2023

vloncar left a comment

WIP Add custom KL loss layer HLS implementation #606

WIP Add custom KL loss layer HLS implementation #606

Conversation

katyagovorkova commented Jul 15, 2022

Type of change

Tests

Checklist

jmitrevs commented Jan 18, 2023 • edited

jmitrevs commented Jan 23, 2023

vloncar commented Jan 23, 2023

jmitrevs commented Jan 23, 2023

jmitrevs commented Jan 23, 2023

vloncar commented Jan 23, 2023

jmitrevs commented Jan 23, 2023

jmitrevs commented Jan 23, 2023 • edited

vloncar commented Jan 23, 2023

vloncar Jan 23, 2023

Choose a reason for hiding this comment

katyagovorkova Jan 25, 2023

Choose a reason for hiding this comment

jmitrevs commented Jan 25, 2023

katyagovorkova commented Feb 2, 2023

vloncar commented Feb 10, 2023

vloncar left a comment

Choose a reason for hiding this comment

jmitrevs commented Jan 18, 2023 •

edited

jmitrevs commented Jan 23, 2023 •

edited