speed optimization on convert(Frame):Bitmap #379

pfn · 2016-04-07T18:22:07Z

eliminate individual get/put calls as much as possible

saudet · 2016-04-08T01:52:10Z

Cool, are you sure this is faster though? AFAIK, Android isn't very efficient even on bulk operations. In any case, let's also cache inplus, in a manner similar to buffer to avoid reallocating memory all the time! And I'll merge this is. Thanks

pfn · 2016-04-08T02:00:04Z

Oops, I forgot to add the inplus member when I was refactoring, but it is cached and created together with buffer.

I'll fix that Monday.

As for performance, I've been testing and profiling on N preview and it is dramatically faster. Calling convert during live camera preview frames drops cpu time from 60% to something like 20%.

Behavior prior to N preview might be a little different, I haven't tested.

saudet · 2016-04-08T02:02:34Z

Faster then, cool!

BTW, a better way of doing it without inplus is to copy directly from the original buffer inside the loop everything expect the very last pixel ;)

pfn · 2016-04-08T16:27:38Z

I forgot today is Friday and not Saturday. So, my changes are updated per feedback.

eliminate individual get/put calls as much as possible

saudet · 2016-04-09T00:40:32Z

src/main/java/org/bytedeco/javacv/AndroidFrameConverter.java

+                            int b = in.get(y * stride + 3 * x + 2) & 0xff;
+                            rgba = (r << 24) | (g << 16) | (b << 8);
+                        }
+                        buffer.putInt(y * rowBytes + 4 * x, (rgba << 8) | 0xff);


There's going to be a problem here I think for the last pixel. It's not being shifted.

saudet · 2016-04-09T00:42:32Z

Great, thanks! A couple of places to fix as noted above, and it's good to merge.

saudet · 2016-04-09T00:52:23Z

But looking at this more closely, the order doesn't seem to be right. We assume an input of BGR. To be less confusing, we should inverse the order of "r" and "b". And then to make sure it always works as expected, we should force in to be LITTLE_ENDIAN. With buffer as BIG_ENDIAN it should work properly.

pfn · 2016-04-25T01:26:27Z

I haven't had a chance to get back to this to improve it further. I will
submit another pr when I get a chance

saudet · 2016-04-25T02:51:36Z

No problem, I figured I should add a test and move the old code there, to compare the results between new candidate code, against known good results. And while I'm at it I fixed your code to make the test pass. :) Thanks for the contribution and yes feel free to optimize further!

pfn force-pushed the master branch 2 times, most recently from abdeb44 to 612ad1e Compare April 8, 2016 16:08

speed optimization on convert(Frame):Bitmap

ae13613

eliminate individual get/put calls as much as possible

pfn force-pushed the master branch from 612ad1e to ae13613 Compare April 8, 2016 16:37

saudet reviewed Apr 9, 2016
View reviewed changes

saudet merged commit bc2ddec into bytedeco:master Apr 25, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speed optimization on convert(Frame):Bitmap #379

speed optimization on convert(Frame):Bitmap #379

pfn commented Apr 7, 2016

saudet commented Apr 8, 2016

pfn commented Apr 8, 2016

saudet commented Apr 8, 2016

pfn commented Apr 8, 2016

saudet Apr 9, 2016

saudet commented Apr 9, 2016

saudet commented Apr 9, 2016

pfn commented Apr 25, 2016 •

edited by saudet

saudet commented Apr 25, 2016

speed optimization on convert(Frame):Bitmap #379

speed optimization on convert(Frame):Bitmap #379

Conversation

pfn commented Apr 7, 2016

saudet commented Apr 8, 2016

pfn commented Apr 8, 2016

saudet commented Apr 8, 2016

pfn commented Apr 8, 2016

saudet Apr 9, 2016

Choose a reason for hiding this comment

saudet commented Apr 9, 2016

saudet commented Apr 9, 2016

pfn commented Apr 25, 2016 • edited by saudet

saudet commented Apr 25, 2016

pfn commented Apr 25, 2016 •

edited by saudet