Add RNN support for Pytorch #850

JanFSchulte · 2023-08-16T19:41:38Z

Adds support for RNN layers (GRU, LSTM, RNN) to the pytorch parser.

Caveat: We currently lack implementation for getitem operations, so we can currently not return the hidden state after the calculations

Caveat 2: We currently only support a single recurrent layers, whereas multiple within the same RNN instance are supported by pytorch

Caveat 3: We currently don't support the passing of non-zero initial values for the hidden states to the RNN

So this implementation is slightly hacky at the moment, but might serve as a starting point for discussion, and can be used by interested parties if they can life with the current limitations.

Also, this contains parts of #848 because I was inattentive.

Type of change

For a new feature or function, please create an issue first to discuss it
with us before submitting a pull request.

Note: Please delete options that are not relevant.

New feature (non-breaking change which adds functionality)

Tests

Added pytests to confirm that the layers work.

Checklist

I have read the guidelines for contributing.
I have commented my code, particularly in hard-to-understand areas.
I have made corresponding changes to the documentation.
My changes generate no new warnings.
I have installed and run pre-commit on the files I edited or added.
I have added tests that prove my fix is effective or that my feature works.

vloncar · 2023-08-17T14:44:57Z

pre-commit.ci autofix

jmitrevs · 2023-10-27T15:52:17Z

The tests fail with:

FAILED test_pytorch_api.py::test_skipped_layers[io_parallel-Vivado] - TypeError: config_from_pytorch_model() got an unexpected keyword argument 'inputs_channel_last'
FAILED test_pytorch_api.py::test_skipped_layers[io_parallel-Quartus] - TypeError: config_from_pytorch_model() got an unexpected keyword argument 'inputs_channel_last'
FAILED test_pytorch_api.py::test_skipped_layers[io_stream-Vivado] - TypeError: config_from_pytorch_model() got an unexpected keyword argument 'inputs_channel_last'
FAILED test_pytorch_api.py::test_skipped_layers[io_stream-Quartus] - TypeError: config_from_pytorch_model() got an unexpected keyword argument 'inputs_channel_last'

JanFSchulte · 2024-05-03T20:27:54Z

All test failures the last time around seemed to be related to issues with the tests themselves, which I have mostly fixed. The only change I made was to add missing includes to some Quartus templates to fix compiliation errors when uint_8 was used.

There are currently still some remaining test failures with the case when activations are used in their nn.functionals implementation instead of as classes. Here I can't reproduce the failures in a standalone file, the exact same code that fails in the pytest works fine running in standalone python. Have not figured out how to debug it in those circumstances.

JanFSchulte · 2024-07-17T16:29:29Z

The tests here finally pass again and I think it is basically ready to merge. One thing to note is that this includes a change to the pytorch config interface giving more options on how to do the channels_last conversion so that it can be either full, transposing both inputs and internal layers, internal, assuming that inputs are already transposed and only transposing internal layers, or off. I developed this at some point, not sure if it was based on a discussion with Vladimir, and included it in here kinda by accident. Let me know if that's desired or should be removed.

jmitrevs · 2024-07-23T15:36:33Z

hls4ml/backends/quartus/passes/recurrent_templates.py

@@ -92,6 +93,7 @@ def format(self, node):
        params['config_mult_h'] = f'config{node.index}_h_mult'
        params['act_t'] = '{}_config{}'.format(node.get_attr('activation'), str(node.index) + '_act')
        params['act_recurrent_t'] = '{}_config{}'.format(node.get_attr('recurrent_activation'), str(node.index) + '_rec_act')
+        params['pytorch'] = 'true' if "pytorch" in node.attributes.keys() else 'false'


Very minor point: hls4ml seems to use single quotes more often (except for doc strings), so I would change "pytorch" to 'pytorch'. There are double quotes used in other places. But truthfully I am not convinced whether we need to change it and run the tests again, since this is so minor.

jmitrevs · 2024-07-23T15:44:03Z

hls4ml/backends/quartus/passes/recurrent_templates.py

@@ -301,5 +306,9 @@ def __init__(self):

    def format(self, node):
        params = self._default_function_params(node)
-        params['weights'] = 'w{0}, wr{0}, b{0}'.format(str(node.index))
+        if "pytorch" in node.attributes.keys():


Is it better to check that pytorch is in the keys or that it is defined and True? What if it is defined and set False? I wonder if using node.get_attr('pytorch') (returns None if not found) or node.get_attr('pytorch', False) is better.

Right now it will only be defined if a recurring layer is parsed in pytorch, and I can't really envision a situation where we would have code that sets this key that isn't part of the pytorch parser. But I still think you are right that this should be implemented in a more future-proof way and node.get_attr('pytorch', False) is probably the most stringent solution. I will implement it.

JanFSchulte added 8 commits August 10, 2023 17:13

cleaner first try at GRU support

6ee8f6f

fix RNN and LSTM

2d40d46

clean diff

66fb8b9

clean diff v2

1c06c6b

getting close

dde9c69

fix RNN biases

d60dadd

Merge branch 'main' into GRUv1

46e0392

fix pytests

d247057

JanFSchulte marked this pull request as ready for review August 17, 2023 14:15

precommit

ed3eaa4

vloncar added the please test Trigger testing by creating local PR branch label Aug 17, 2023

pre-commit-ci bot and others added 3 commits August 17, 2023 14:46

[pre-commit.ci] auto fixes from pre-commit hooks

c2ca3c6

fix merge conflicts in RNN parsing

3b9c5e6

fix merge conflicts in RNN parsing v2

a9c3ba7

jmitrevs added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Oct 20, 2023

jmitrevs added this to the v1.0.0 milestone Oct 20, 2023

Merge branch 'main' into GRUv1

7adfe5b

jmitrevs added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels May 3, 2024

fix most pytest issues

fc2c68a

jmitrevs added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels May 9, 2024

JanFSchulte and others added 2 commits May 28, 2024 09:52

Merge branch 'main' into GRUv1

a56d7e3

Merge branch 'main' into GRUv1

774072f

jmitrevs added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels May 31, 2024

Merge branch 'main' into GRUv1

5299392

JanFSchulte added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Jul 16, 2024

change upsampling test to changed interface for channels_last conversion

88e1f9b

JanFSchulte added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Jul 16, 2024

another fix to pytests for upsampling

229dc7b

JanFSchulte added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Jul 16, 2024

jmitrevs reviewed Jul 23, 2024

View reviewed changes

JanFSchulte added 2 commits July 23, 2024 12:36

Merge branch 'main' into GRUv1

58b7913

addressing Jovan's comments

54d7a34

JanFSchulte added please test Trigger testing by creating local PR branch and removed please test Trigger testing by creating local PR branch labels Jul 23, 2024

jmitrevs approved these changes Jul 23, 2024

View reviewed changes

jmitrevs merged commit 75b0b0d into fastmachinelearning:main Jul 23, 2024
7 checks passed

JanFSchulte mentioned this pull request Oct 10, 2024

Compiling PyTorch LSTM with Hidden State Input #1074

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add RNN support for Pytorch #850

Add RNN support for Pytorch #850

Uh oh!

JanFSchulte commented Aug 16, 2023

Uh oh!

vloncar commented Aug 17, 2023

Uh oh!

jmitrevs commented Oct 27, 2023

Uh oh!

JanFSchulte commented May 3, 2024

Uh oh!

JanFSchulte commented Jul 17, 2024

Uh oh!

jmitrevs Jul 23, 2024

Uh oh!

jmitrevs Jul 23, 2024

Uh oh!

JanFSchulte Jul 23, 2024

Uh oh!

Uh oh!

Uh oh!

Add RNN support for Pytorch #850

Add RNN support for Pytorch #850

Uh oh!

Conversation

JanFSchulte commented Aug 16, 2023

Type of change

Tests

Checklist

Uh oh!

vloncar commented Aug 17, 2023

Uh oh!

jmitrevs commented Oct 27, 2023

Uh oh!

JanFSchulte commented May 3, 2024

Uh oh!

JanFSchulte commented Jul 17, 2024

Uh oh!

jmitrevs Jul 23, 2024

Choose a reason for hiding this comment

Uh oh!

jmitrevs Jul 23, 2024

Choose a reason for hiding this comment

Uh oh!

JanFSchulte Jul 23, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!