SIMD Encoder : an AVX2 implementation #143

MKCG · 2021-12-28T22:26:38Z

Hello,
this pull request implements an AVX2 based encoder.

It also introduces a new structure named qoi_encoder_t that could be use to implement a streaming encoder.

When encoding the images provided on the official website, I get those results :

branch	type	encode ms	encode mpps	branches	branches misses
master	any	14.4	157.28	28 137 276 765	1 492 397 899
this pull request	any	7.9	284.32	11 348 182 313	714 559 161
master	RGB	40.0	132.31	21 054 094 125	1 322 613 445
this pull request	RGB	23.0	229.90	9 418 298 665	633 968 387
master	RGBA	3.4	281.22	7 080 735 865	172 313 079
this pull request before 2672a47	RGBA	1.8	524.59	1 834 707 343	79 617 576
this pull request after 2672a47	RGBA	1.5	648.99	1 880 977 270	79 359 904

Note: 2672a47 avoids performing cpu intensive operations when the next 8 pixels belong to the same run. Only images with flat surfaces like screenshots will benefit from it.

Note: RGB images are a bit tricky to encode using AVX2 instructions, and this implementation can still be improved.

…is defined

…essed

…tion

…mentation

…alue

…t pixels are part of the same run

MKCG · 2022-01-02T01:57:45Z

My PHP library use a C library defining a chunk based encoder to lower the memory footprint : https://github.com/MKCG/php-qoi/blob/main/src/FFI/lib/qoi.c#L112

I will probably open a pull request here to implement a streaming encoder.

phoboslab · 2022-01-03T13:27:25Z

This is very cool, but I'm sorry to say that I will not merge it. I want this "reference" encoder here to stay as simple as possible. Also, I neither have the expertise nor the desire to maintain this SIMD implementation.

If you publish your encoder under a different name (fastqoi? rapidqoi? simdqoi?) I will happily mention it in the readme here!

MKCG · 2022-01-03T19:35:31Z

Actually I add little hope that it would be accepted since I already knew you weren't very familiar with AVX2 instructions from a previous comment on the final specification thread.

"simdqoi" would work for me, that would be a nice tribute to the amazing work of Daniel Lemire, Geoff Langdale and so many others. I guess that now I also have to make an AVX2 based decoder before publishing it.

Thanks for your time.

Encoder : almost branchless AVX2 implementation for RGBA images

a7dbf40

MKCG marked this pull request as draft December 28, 2021 22:26

MKCG changed the title ~~WIP: RGBA Encoder : an almost branchless AVX2 implementation~~ RGBA Encoder : an almost branchless AVX2 implementation Dec 28, 2021

RGBA Encoder : create a QOI_OP_RUN mask to quickly iterate over runs

a7094c5

MKCG changed the title ~~RGBA Encoder : an almost branchless AVX2 implementation~~ RGBA Encoder : an AVX2 implementation Dec 29, 2021

RGBA Encoder : branchlessly write QOI_OP_RUN

59e0452

MKCG marked this pull request as ready for review December 29, 2021 02:38

Kévin Masseix added 13 commits December 29, 2021 03:53

RGBA Encoder : rm useless assert.h include

5db6011

RGBA Encoder : compute the number of leading runs

5b71650

RGBA Encoder : change branch order and specify the likely one

27e9e31

RGBA Encoder : QOI_LIKELY is defined as builtin_expect when __GNUC__ …

7ede04d

…is defined

RGBA Encoder : fix a typo

7883b83

RGBA Encoder : encode the first pixel only if none as already be proc…

e21b860

…essed

RGBA Encoder : increment the encoder.px_pos only once per block itera…

bcfa307

…tion

RGBA Encoder : QOI_SIMD_AVX2 must be defined to enable the AVX2 imple…

01b8462

…mentation

RGBA Encoder : fastest four bytes copy

413306e

RGBA Encoder : rm useless avx2 variable to compute QOI_OP_RGB chunk v…

05692e8

…alue

RGBA Encoder : simply avx2 op_code length computing

c61b7d9

RGBA Encoder : do not perform resource intensive op when the 8 curren…

2672a47

…t pixels are part of the same run

RGB Encoder : supports AVX2 instructions

baa6a90

MKCG changed the title ~~RGBA Encoder : an AVX2 implementation~~ SIMD Encoder : an AVX2 implementation Dec 30, 2021

Kévin Masseix added 2 commits December 30, 2021 13:32

AVX2 Encoder : QOI_SIMD_AVX2 must be defined to include the header

32d0e31

AVX2 Encoder : rm var reinit

056d54f

phoboslab closed this Jan 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SIMD Encoder : an AVX2 implementation #143

SIMD Encoder : an AVX2 implementation #143

MKCG commented Dec 28, 2021 •

edited

MKCG commented Jan 2, 2022

phoboslab commented Jan 3, 2022

MKCG commented Jan 3, 2022

SIMD Encoder : an AVX2 implementation #143

SIMD Encoder : an AVX2 implementation #143

Conversation

MKCG commented Dec 28, 2021 • edited

MKCG commented Jan 2, 2022

phoboslab commented Jan 3, 2022

MKCG commented Jan 3, 2022

MKCG commented Dec 28, 2021 •

edited