Improved inline multiplication in JIT for x86 #5237

potatosalad · 2021-09-23T20:31:39Z

This moves multiplication for small integers inline with overflow checking, along with specialization for when the left or right hand side are immediates.

The original comments related to clobbering RDX, I think, are incorrect based on how we're using imul in 2-operand form instead of 1-operand form (see reference docs for imul):

We avoid using ARG2 and ARG3 because multiplication clobbers RDX, which is
ARG2 on Windows and ARG3 on SystemV.

As a very non-scientific benchmark: based on some tests involving a pure Erlang version of MurmurHash3_x86_32 with a 1MB input key, this change alone roughly doubled performance for me (30ms originally down to 15ms).

CLAassistant · 2021-09-23T20:31:43Z

All committers have signed the CLA.

garazdawi · 2021-09-30T09:44:15Z

Thanks!

Improved inline multiplication in JIT for x86

297d5d2

rickard-green added the team:VM Assigned to OTP team VM label Sep 23, 2021

garazdawi added enhancement testing currently being tested, tag is used by OTP internal CI labels Sep 27, 2021

rickard-green assigned garazdawi Sep 27, 2021

garazdawi merged commit 6851eed into erlang:master Sep 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved inline multiplication in JIT for x86 #5237

Improved inline multiplication in JIT for x86 #5237

potatosalad commented Sep 23, 2021

CLAassistant commented Sep 23, 2021 •

edited

garazdawi commented Sep 30, 2021

Improved inline multiplication in JIT for x86 #5237

Improved inline multiplication in JIT for x86 #5237

Conversation

potatosalad commented Sep 23, 2021

CLAassistant commented Sep 23, 2021 • edited

garazdawi commented Sep 30, 2021

CLAassistant commented Sep 23, 2021 •

edited