Add basic binary operators (and, or, xor, not) to `IO::Buffer`. #5893

ioquatix · 2022-05-08T06:23:11Z

This introduces both the operators (&, |, ^ and ~) as well as in-place methods: and!, or!, xor!, not!.

string = 'Hello World'
buffer = IO::Buffer.for(string)
mask = IO::Buffer.for('abcd')

pp buffer.xor!(mask).xor!(mask)

eregon · 2022-05-10T08:45:04Z

io_buffer.c

+memory_xor(unsigned char * restrict output, unsigned char * restrict base, size_t size, unsigned char * restrict mask, size_t mask_size)
+{
+    for (size_t offset = 0; offset < size; offset += 1) {
+      output[offset] = base[offset] ^ mask[offset % mask_size];


% mask_size is likely quite slower than a bit mask (if the size is power of two), or condition with a separate mask_index resetting to 0 if >= mask_size.

I checked the assembly and it looks to generate fairly efficient SSE/AVX instructions, so I trust the compiler is doing the right things here. In any case, simple and correct is a good first implementation, and later on we can unroll it by hand if performance is an issue.

It's not about unrolling. It's about a modulo operation is as expensive as division (many cycles), and a lot more than + - * & ! ^.

I played around with this: https://godbolt.org/z/7MEfsrz9z - change -Os to -O3 and you see the compiler will unroll the loop and load the data in chunks. My understanding is that by unrolling the loop carefully, some division operations can be skipped.

If you know how we can count the number of divs being performed, that would be awesome, otherwise, I'm fine with slow but correct to start with, and figuring out a faster implementation later as an improvement. If you think adding a branch to the inner loop will ultimately be faster, let's do that? But at that point I think we need to benchmark it. It's entirely possible the compiler would do a worse job at unrolling a loop with an if statement.

I'm fairly confident it does not remove the division operations, it could only do that if it did the branch I suggest, and there are plenty of divl and divq left in that assembly.
Maybe it does a few less, but it could only do that without branches if it can make sure the mask_size is e.g. > 1, and it cannot know that.

I agree, we should benchmark it, then it should be clear. With the explicit condition there would be 0 division left, but the extra branch. It should still unroll fine.
The best would be if we could have the mask_size as a constant in the source, this should be possible by doing an early check and call with a constant for the mask_size we care about + force inline this function.

…#5893)

ioquatix mentioned this pull request May 8, 2022

Payload masking is ridiculously slow socketry/protocol-websocket#8

Closed

ioquatix force-pushed the io-buffer-xor branch 2 times, most recently from 9646ba3 to 6f63ec3 Compare May 8, 2022 06:33

Add basic binary operators (and, or, xor, not) to IO::Buffer.

64e71dc

ioquatix force-pushed the io-buffer-xor branch from 6f63ec3 to a0a20df Compare May 9, 2022 03:04

Minor fixes + documentation.

e739dab

ioquatix force-pushed the io-buffer-xor branch from a0a20df to e739dab Compare May 9, 2022 04:01

ioquatix merged commit cea34bd into ruby:master May 9, 2022

ioquatix deleted the io-buffer-xor branch May 9, 2022 05:19

eregon reviewed May 10, 2022

View reviewed changes

schneems pushed a commit to schneems/ruby that referenced this pull request May 23, 2022

Add basic binary operators (and, or, xor, not) to IO::Buffer. (ruby…

9e33dfe

…#5893)

schneems pushed a commit to schneems/ruby that referenced this pull request Jul 26, 2022

Add basic binary operators (and, or, xor, not) to IO::Buffer. (ruby…

cc22a7c

…#5893)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add basic binary operators (and, or, xor, not) to `IO::Buffer`. #5893

Add basic binary operators (and, or, xor, not) to `IO::Buffer`. #5893

ioquatix commented May 8, 2022 •

edited

eregon May 10, 2022

ioquatix May 10, 2022

eregon May 10, 2022

ioquatix May 10, 2022 •

edited

eregon May 11, 2022

Add basic binary operators (and, or, xor, not) to IO::Buffer. #5893

Add basic binary operators (and, or, xor, not) to IO::Buffer. #5893

Conversation

ioquatix commented May 8, 2022 • edited

eregon May 10, 2022

Choose a reason for hiding this comment

ioquatix May 10, 2022

Choose a reason for hiding this comment

eregon May 10, 2022

Choose a reason for hiding this comment

ioquatix May 10, 2022 • edited

Choose a reason for hiding this comment

eregon May 11, 2022

Choose a reason for hiding this comment

Add basic binary operators (and, or, xor, not) to `IO::Buffer`. #5893

Add basic binary operators (and, or, xor, not) to `IO::Buffer`. #5893

ioquatix commented May 8, 2022 •

edited

ioquatix May 10, 2022 •

edited