[TFLM] Added optimized softmax kernel (int32 and float) for CEVA-BX1 and CEVA SP500 #47783

yair-ehrenwald · 2021-03-13T20:21:19Z

Relevant github issue:#45607

google-ml-butler · 2021-03-13T20:21:22Z

Thanks for contributing to TensorFlow Lite Micro.

To keep this process moving along, we'd like to make sure that you have completed the items on this list:

Read the contributing guidelines for TensorFlow Lite Micro
Created a TF Lite Micro Github issue
Linked to the issue from the PR description

We would like to have a discussion on the Github issue first to determine the best path forward, and then proceed to the PR review.

advaitjain

small comments only.

Happy to make a PR that removes uint8 support from the reference kernel by this Friday.

advaitjain · 2021-03-16T23:43:53Z

tensorflow/lite/micro/kernels/ceva/softmax.cc

+#if defined(CEVA_BX1) || defined(CEVA_SP500)
+extern int32_t* CEVA_TFLM_KERNELS_SCRATCH;
+extern int32_t CEVA_TFLM_KERNELS_SCRATCH_SIZE_VAL;
+#endif


worth putting these in a ceva_common.h or something similar to avoid repeating the same include and extern in each of the kernels?

advaitjain · 2021-03-16T23:44:41Z

tensorflow/lite/micro/kernels/ceva/softmax.cc

+                                  const TfLiteEvalTensor* input,
+                                  TfLiteEvalTensor* output,
+                                  const SoftmaxParams& op_data) {
+  if (input->type == kTfLiteUInt8) {


drop uint8 support?

…ized

advaitjain · 2021-03-18T20:45:41Z

tensorflow/lite/micro/kernels/ceva/softmax.cc

+#ifdef MCPS_MEASUREMENT
+#include "tensorflow/lite/micro/kernels/ceva/mcps_macros.h"
+#endif


you could add these to ceva_common.h as well?

No change needed in this PR. If you think it is worthwhile then you can make the change in all the kernels in a follow-up PR.

I'd like to have our measurement macros in all kernels, but only some of them require scratch memory, so I'll leave it as is for the time being. Eventually I think we will do some work on the whole scratch usage issue, exact size calculations per kernel etc and hopefully come up with something maintainable.

added optimized softmax

775b186

google-ml-butler bot added the size:M CL Change Size: Medium label Mar 13, 2021

google-cla bot added the cla: yes label Mar 13, 2021

yair-ehrenwald added comp:micro Related to TensorFlow Lite Microcontrollers comp:micro:ceva labels Mar 13, 2021

gbaned self-assigned this Mar 14, 2021

gbaned added this to Assigned Reviewer in PR Queue via automation Mar 14, 2021

gbaned requested a review from advaitjain March 14, 2021 18:25

fixed include path

27d0a2c

advaitjain requested changes Mar 16, 2021

View reviewed changes

PR Queue automation moved this from Assigned Reviewer to Reviewer Requested Changes Mar 16, 2021

yair-ehrenwald added 3 commits March 18, 2021 21:26

Merge remote-tracking branch 'upstream/master' into add_softmax_optim…

374998f

…ized

Removed uint8 and moved scratch to header file

289ab7f

Added license

00a0363

yair-ehrenwald added the kokoro:force-run Tests on submitted change label Mar 18, 2021

kokoro-team removed the kokoro:force-run Tests on submitted change label Mar 18, 2021

yair-ehrenwald requested a review from advaitjain March 18, 2021 20:44

advaitjain approved these changes Mar 18, 2021

View reviewed changes

PR Queue automation moved this from Reviewer Requested Changes to Approved by Reviewer Mar 18, 2021

google-ml-butler bot added kokoro:force-run Tests on submitted change ready to pull PR ready for merge process labels Mar 18, 2021

kokoro-team removed the kokoro:force-run Tests on submitted change label Mar 18, 2021

advaitjain added kokoro:force-run Tests on submitted change ready to pull PR ready for merge process and removed ready to pull PR ready for merge process labels Mar 19, 2021

kokoro-team removed the kokoro:force-run Tests on submitted change label Mar 19, 2021

copybara-service bot merged commit fe1dbf7 into tensorflow:master Mar 19, 2021

PR Queue automation moved this from Approved by Reviewer to Merged Mar 19, 2021

yair-ehrenwald mentioned this pull request Mar 19, 2021

[TFLM] For CEVA-DSP Makefiles: Added ceva_dsp_lib to linker, added ceva_common.h to headers list #47907

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TFLM] Added optimized softmax kernel (int32 and float) for CEVA-BX1 and CEVA SP500 #47783

[TFLM] Added optimized softmax kernel (int32 and float) for CEVA-BX1 and CEVA SP500 #47783

yair-ehrenwald commented Mar 13, 2021

google-ml-butler bot commented Mar 13, 2021

advaitjain left a comment

advaitjain Mar 16, 2021

yair-ehrenwald Mar 18, 2021

advaitjain Mar 16, 2021

yair-ehrenwald Mar 18, 2021

advaitjain Mar 18, 2021

yair-ehrenwald Mar 18, 2021

[TFLM] Added optimized softmax kernel (int32 and float) for CEVA-BX1 and CEVA SP500 #47783

[TFLM] Added optimized softmax kernel (int32 and float) for CEVA-BX1 and CEVA SP500 #47783

Conversation

yair-ehrenwald commented Mar 13, 2021

google-ml-butler bot commented Mar 13, 2021

advaitjain left a comment

Choose a reason for hiding this comment

advaitjain Mar 16, 2021

Choose a reason for hiding this comment

yair-ehrenwald Mar 18, 2021

Choose a reason for hiding this comment

advaitjain Mar 16, 2021

Choose a reason for hiding this comment

yair-ehrenwald Mar 18, 2021

Choose a reason for hiding this comment

advaitjain Mar 18, 2021

Choose a reason for hiding this comment

yair-ehrenwald Mar 18, 2021

Choose a reason for hiding this comment