Softmax LUT Optimization #570

bo3z · 2022-06-16T14:42:41Z

This PR exploits the following properties of the current Softmax implementation:

For all inputs, the maximum is subtracted, before calculating the exponential. Therefore the exponential lookup table need only have entries for negative inputs.
The sum of exponential is always positive. Therefore, the invert table need only have positive values, not negative.

The following plot shows the accuracy of hls4ml (old vs. new) compared to Keras at identifying argmax - the accuracy is identical for both implementations:

The following plot shows the mean absolute percentage error of hls4ml (old vs. new) compared to Keras's output - new version performs slightly better compared to the old one due to more "relevant" elements in the LUT:

Finally, the arrays holding the differences between the elements and the maximum is removed (as no significant accuracy was achieved), so resources were saved, as seen below:

bo3z · 2022-08-04T08:45:48Z

@thesps maybe you could review this PR, since you reviewed the first Quartus Softmax implementation?

hls4ml/utils/fixed_point_utils.py

bo3z added the enhancement label Jun 16, 2022

bo3z requested a review from vloncar June 16, 2022 14:42

Fixed point utils MSB bug fix

c8e8f75

bo3z force-pushed the softmax-lut-opt branch from 4a43f64 to 41fd158 Compare August 4, 2022 08:44

bo3z requested a review from thesps August 4, 2022 08:45

vloncar reviewed Aug 12, 2022

View reviewed changes

hls4ml/utils/fixed_point_utils.py Outdated Show resolved Hide resolved

vloncar reviewed Aug 12, 2022

View reviewed changes

hls4ml/utils/fixed_point_utils.py Outdated Show resolved Hide resolved

Quartus Softmax optimize LUT to store only used values

bf5762d

bo3z force-pushed the softmax-lut-opt branch from 41fd158 to bf5762d Compare August 12, 2022 15:30

vloncar approved these changes Aug 12, 2022

View reviewed changes

vloncar merged commit 9af6106 into fastmachinelearning:main Aug 12, 2022

vloncar mentioned this pull request Nov 7, 2022

Quartus Streaming Conv, Pooling & Image layers #656

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Softmax LUT Optimization #570

Softmax LUT Optimization #570

bo3z commented Jun 16, 2022

bo3z commented Aug 4, 2022

Softmax LUT Optimization #570

Softmax LUT Optimization #570

Conversation

bo3z commented Jun 16, 2022

bo3z commented Aug 4, 2022