Skip to content

perf(compiler): optimize x86 u256 mul with bmi2 adx#428

Merged
zoowii merged 17 commits intoDTVMStack:mainfrom
abmcar:perf/compiler-u256-mulx-adx
Apr 13, 2026
Merged

perf(compiler): optimize x86 u256 mul with bmi2 adx#428
zoowii merged 17 commits intoDTVMStack:mainfrom
abmcar:perf/compiler-u256-mulx-adx

Conversation

@abmcar
Copy link
Copy Markdown
Contributor

@abmcar abmcar commented Mar 23, 2026

Summary

  • detect adx and bmi2 in the x86 compiler target feature string
  • lower generic EVM U256 MUL to a BMI2+ADX row-wise MULX + ADCX/ADOX schedule on supported x86_64 hosts
  • trim reverted square / MULX-only experiment paths and drop the temporary benchmark helper from the final branch

Validation

  • tools/format.sh check
  • cmake --build /home/abmcar/DTVM-mulx-adx-opt/build-perf -j 20 --target libdtvmapi.so
  • local evmc run --bench checks against main; gas and output matched in all cases

Performance

Local performance was re-measured with two distinct runtime 256-bit operands so the benchmark exercises the generic non-square U256 MUL path.

Case Current Main Delta
generic-mul-once 182 ns 183 ns +0.55%
generic-mul-loop-64 502 ns 543 ns +7.55%
generic-mul-loop-255 1228 ns 1372 ns +10.50%

The single-shot case stays close to noise, but the hot generic MUL loop shows a stable improvement of about 8% to 10%.

Notes

  • the final branch intentionally keeps only the generic BMI2+ADX path as the optimization path over legacy lowering
  • square-specialized and MULX-only experimental paths are not part of the final PR state

Copilot AI review requested due to automatic review settings March 23, 2026 08:13
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR improves x86_64 EVM U256 MUL codegen by detecting adx/bmi2 support from the host feature string and selecting a dedicated ADX+BMI2 lowering path that uses MULX plus ADCX/ADOX carry chains.

Changes:

  • Add host feature detection for adx and bmi2 in the x86 feature string.
  • Split lowerEvmU256MulExpr() into legacy and ADX+BMI2 implementations.
  • Introduce helpers for emitting MULX, ADCX, ADOX, and carry-chain setup/collection.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File Description
src/compiler/target/x86/x86lowering.h Declares new helpers and the legacy vs ADX lowering split for U256 MUL.
src/compiler/target/x86/x86lowering.cpp Implements ADX+BMI2 lowering using MULX + ADCX/ADOX scheduling and gates it on subtarget features.
src/compiler/context.cpp Extends host x86 feature-string construction to include adx and bmi2 as optional features.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread src/compiler/target/x86/x86lowering.cpp
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Mar 23, 2026

⚡ Performance Regression Check Results

✅ Performance Check Passed (interpreter)

Performance Benchmark Results (threshold: 25%)

Benchmark Baseline (us) Current (us) Change Status
total/main/blake2b_huff/8415nulls 1.49 1.50 +0.4% PASS
total/main/blake2b_huff/empty 0.02 0.02 -0.1% PASS
total/main/blake2b_shifts/8415nulls 11.76 11.65 -1.0% PASS
total/main/sha1_divs/5311 5.19 5.16 -0.4% PASS
total/main/sha1_divs/empty 0.06 0.06 -1.0% PASS
total/main/sha1_shifts/5311 2.85 2.82 -0.9% PASS
total/main/sha1_shifts/empty 0.03 0.03 -0.0% PASS
total/main/snailtracer/benchmark 52.54 52.88 +0.6% PASS
total/main/structarray_alloc/nfts_rank 0.96 0.95 -0.2% PASS
total/main/swap_math/insufficient_liquidity 0.00 0.00 +2.6% PASS
total/main/swap_math/received 0.00 0.00 +0.7% PASS
total/main/swap_math/spent 0.00 0.00 +1.5% PASS
total/main/weierstrudel/1 0.30 0.30 -0.5% PASS
total/main/weierstrudel/15 3.31 3.30 -0.4% PASS
total/micro/JUMPDEST_n0/empty 1.47 1.47 +0.0% PASS
total/micro/jump_around/empty 0.09 0.09 +0.2% PASS
total/micro/loop_with_many_jumpdests/empty 22.41 22.42 +0.0% PASS
total/micro/memory_grow_mload/by1 0.08 0.09 +4.0% PASS
total/micro/memory_grow_mload/by16 0.10 0.09 -5.0% PASS
total/micro/memory_grow_mload/by32 0.11 0.11 -0.2% PASS
total/micro/memory_grow_mload/nogrow 0.09 0.08 -12.8% PASS
total/micro/memory_grow_mstore/by1 0.09 0.09 +0.1% PASS
total/micro/memory_grow_mstore/by16 0.10 0.10 +0.2% PASS
total/micro/memory_grow_mstore/by32 0.11 0.11 +0.1% PASS
total/micro/memory_grow_mstore/nogrow 0.09 0.09 +0.9% PASS
total/micro/signextend/one 0.23 0.23 +0.1% PASS
total/micro/signextend/zero 0.23 0.23 +0.0% PASS
total/synth/ADD/b0 1.98 1.98 +0.0% PASS
total/synth/ADD/b1 1.97 1.97 +0.1% PASS
total/synth/ADDRESS/a0 4.98 4.98 +0.0% PASS
total/synth/ADDRESS/a1 5.36 5.35 -0.3% PASS
total/synth/AND/b0 1.71 1.71 +0.0% PASS
total/synth/AND/b1 1.71 1.71 +0.0% PASS
total/synth/BYTE/b0 6.08 6.07 -0.1% PASS
total/synth/BYTE/b1 4.76 4.74 -0.3% PASS
total/synth/CALLDATASIZE/a0 3.34 3.34 +0.0% PASS
total/synth/CALLDATASIZE/a1 4.09 4.11 +0.5% PASS
total/synth/CALLER/a0 4.96 4.96 -0.0% PASS
total/synth/CALLER/a1 5.38 5.36 -0.3% PASS
total/synth/CALLVALUE/a0 3.50 3.50 +0.0% PASS
total/synth/CALLVALUE/a1 3.53 3.53 -0.2% PASS
total/synth/CODESIZE/a0 3.68 3.91 +6.4% PASS
total/synth/CODESIZE/a1 3.94 3.94 -0.1% PASS
total/synth/DUP1/d0 1.14 1.06 -7.2% PASS
total/synth/DUP1/d1 1.31 1.31 -0.0% PASS
total/synth/DUP10/d0 0.98 1.15 +16.7% PASS
total/synth/DUP10/d1 1.00 1.01 +0.8% PASS
total/synth/DUP11/d0 0.91 0.91 -0.1% PASS
total/synth/DUP11/d1 1.23 1.00 -18.7% PASS
total/synth/DUP12/d0 0.90 1.14 +26.6% PASS
total/synth/DUP12/d1 1.00 1.23 +22.8% PASS
total/synth/DUP13/d0 1.14 1.14 +0.2% PASS
total/synth/DUP13/d1 1.00 1.00 -0.1% PASS
total/synth/DUP14/d0 1.14 0.98 -13.8% PASS
total/synth/DUP14/d1 1.00 1.23 +22.9% PASS
total/synth/DUP15/d0 0.91 1.14 +26.2% PASS
total/synth/DUP15/d1 1.01 1.23 +22.7% PASS
total/synth/DUP16/d0 1.14 1.15 +0.1% PASS
total/synth/DUP16/d1 1.00 1.00 -0.1% PASS
total/synth/DUP2/d0 1.14 1.22 +7.2% PASS
total/synth/DUP2/d1 1.02 1.23 +21.0% PASS
total/synth/DUP3/d0 1.14 0.98 -14.0% PASS
total/synth/DUP3/d1 1.00 1.23 +23.2% PASS
total/synth/DUP4/d0 1.14 1.14 +0.1% PASS
total/synth/DUP4/d1 1.00 1.23 +23.4% PASS
total/synth/DUP5/d0 0.90 1.15 +27.3% PASS
total/synth/DUP5/d1 1.23 1.23 +0.1% PASS
total/synth/DUP6/d0 1.14 1.15 +0.1% PASS
total/synth/DUP6/d1 1.00 1.01 +1.0% PASS
total/synth/DUP7/d0 1.15 1.15 -0.1% PASS
total/synth/DUP7/d1 1.23 1.00 -18.8% PASS
total/synth/DUP8/d0 1.14 0.98 -13.9% PASS
total/synth/DUP8/d1 1.00 1.23 +23.1% PASS
total/synth/DUP9/d0 0.91 1.14 +26.3% PASS
total/synth/DUP9/d1 1.02 1.23 +20.2% PASS
total/synth/EQ/b0 2.76 2.76 +0.0% PASS
total/synth/EQ/b1 1.40 1.41 +1.1% PASS
total/synth/GAS/a0 3.91 3.91 -0.0% PASS
total/synth/GAS/a1 3.94 3.95 +0.0% PASS
total/synth/GT/b0 2.62 2.62 +0.1% PASS
total/synth/GT/b1 1.47 1.48 +0.1% PASS
total/synth/ISZERO/u0 1.15 1.15 -0.0% PASS
total/synth/JUMPDEST/n0 1.47 1.47 +0.1% PASS
total/synth/LT/b0 2.61 2.61 +0.1% PASS
total/synth/LT/b1 1.48 1.48 +0.0% PASS
total/synth/MSIZE/a0 4.25 4.25 +0.1% PASS
total/synth/MSIZE/a1 4.77 4.75 -0.3% PASS
total/synth/MUL/b0 5.33 5.33 +0.0% PASS
total/synth/MUL/b1 5.40 5.40 +0.1% PASS
total/synth/NOT/u0 1.69 1.69 -0.3% PASS
total/synth/OR/b0 1.64 1.64 -0.0% PASS
total/synth/OR/b1 1.71 1.71 +0.1% PASS
total/synth/PC/a0 3.18 3.42 +7.6% PASS
total/synth/PC/a1 3.69 3.64 -1.4% PASS
total/synth/PUSH1/p0 1.23 0.93 -24.2% PASS
total/synth/PUSH1/p1 1.30 1.31 +0.6% PASS
total/synth/PUSH10/p0 0.99 1.15 +16.1% PASS
total/synth/PUSH10/p1 1.33 1.32 -0.8% PASS
total/synth/PUSH11/p0 1.15 0.99 -13.9% PASS
total/synth/PUSH11/p1 1.31 1.33 +1.6% PASS
total/synth/PUSH12/p0 1.23 1.07 -12.9% PASS
total/synth/PUSH12/p1 1.33 1.28 -3.7% PASS
total/synth/PUSH13/p0 0.99 1.07 +8.2% PASS
total/synth/PUSH13/p1 1.33 1.30 -2.1% PASS
total/synth/PUSH14/p0 1.16 1.03 -11.5% PASS
total/synth/PUSH14/p1 1.34 1.32 -1.1% PASS
total/synth/PUSH15/p0 1.10 1.15 +4.6% PASS
total/synth/PUSH15/p1 1.38 1.38 +0.4% PASS
total/synth/PUSH16/p0 0.99 1.11 +12.0% PASS
total/synth/PUSH16/p1 1.34 1.32 -1.3% PASS
total/synth/PUSH17/p0 1.22 0.91 -25.9% PASS
total/synth/PUSH17/p1 1.33 1.32 -0.8% PASS
total/synth/PUSH18/p0 0.98 1.15 +16.8% PASS
total/synth/PUSH18/p1 1.34 1.30 -3.0% PASS
total/synth/PUSH19/p0 1.15 1.07 -6.8% PASS
total/synth/PUSH19/p1 1.33 1.32 -0.8% PASS
total/synth/PUSH2/p0 0.99 1.07 +7.9% PASS
total/synth/PUSH2/p1 1.31 1.32 +0.2% PASS
total/synth/PUSH20/p0 1.01 1.15 +14.1% PASS
total/synth/PUSH20/p1 1.33 1.34 +0.6% PASS
total/synth/PUSH21/p0 1.15 1.11 -3.4% PASS
total/synth/PUSH21/p1 1.34 1.30 -2.9% PASS
total/synth/PUSH22/p0 0.99 1.11 +11.7% PASS
total/synth/PUSH22/p1 1.34 1.30 -3.1% PASS
total/synth/PUSH23/p0 0.99 1.15 +16.0% PASS
total/synth/PUSH23/p1 1.35 1.30 -3.3% PASS
total/synth/PUSH24/p0 1.06 1.07 +0.9% PASS
total/synth/PUSH24/p1 1.36 1.30 -4.5% PASS
total/synth/PUSH25/p0 1.13 1.07 -5.5% PASS
total/synth/PUSH25/p1 1.36 1.34 -1.4% PASS
total/synth/PUSH26/p0 1.15 1.15 -0.0% PASS
total/synth/PUSH26/p1 1.34 1.34 -0.5% PASS
total/synth/PUSH27/p0 1.23 1.12 -9.2% PASS
total/synth/PUSH27/p1 1.35 1.30 -3.2% PASS
total/synth/PUSH28/p0 1.21 1.23 +1.5% PASS
total/synth/PUSH28/p1 1.36 1.33 -2.4% PASS
total/synth/PUSH29/p0 1.18 1.07 -10.0% PASS
total/synth/PUSH29/p1 1.34 1.33 -1.1% PASS
total/synth/PUSH3/p0 0.99 1.07 +8.2% PASS
total/synth/PUSH3/p1 1.32 1.32 -0.1% PASS
total/synth/PUSH30/p0 1.16 1.20 +3.8% PASS
total/synth/PUSH30/p1 1.37 1.34 -2.5% PASS
total/synth/PUSH31/p0 1.21 1.15 -4.9% PASS
total/synth/PUSH31/p1 1.47 1.66 +13.0% PASS
total/synth/PUSH32/p0 1.15 1.07 -7.2% PASS
total/synth/PUSH32/p1 1.38 1.30 -5.8% PASS
total/synth/PUSH4/p0 1.23 1.15 -6.6% PASS
total/synth/PUSH4/p1 1.31 1.31 +0.0% PASS
total/synth/PUSH5/p0 1.21 0.91 -25.0% PASS
total/synth/PUSH5/p1 1.34 1.30 -2.8% PASS
total/synth/PUSH6/p0 1.15 1.15 +0.4% PASS
total/synth/PUSH6/p1 1.34 1.31 -1.7% PASS
total/synth/PUSH7/p0 1.15 0.99 -14.2% PASS
total/synth/PUSH7/p1 1.12 1.29 +15.0% PASS
total/synth/PUSH8/p0 1.15 1.07 -7.3% PASS
total/synth/PUSH8/p1 1.33 1.33 +0.1% PASS
total/synth/PUSH9/p0 1.15 0.92 -19.8% PASS
total/synth/PUSH9/p1 1.33 1.32 -0.9% PASS
total/synth/RETURNDATASIZE/a0 3.74 3.75 +0.1% PASS
total/synth/RETURNDATASIZE/a1 3.78 3.77 -0.3% PASS
total/synth/SAR/b0 3.77 3.77 +0.1% PASS
total/synth/SAR/b1 4.32 4.30 -0.4% PASS
total/synth/SGT/b0 2.60 2.60 +0.0% PASS
total/synth/SGT/b1 1.63 1.63 +0.0% PASS
total/synth/SHL/b0 3.03 3.03 -0.0% PASS
total/synth/SHL/b1 1.60 1.60 +0.0% PASS
total/synth/SHR/b0 2.94 2.93 -0.0% PASS
total/synth/SHR/b1 1.55 1.55 +0.0% PASS
total/synth/SIGNEXTEND/b0 3.29 3.29 +0.0% PASS
total/synth/SIGNEXTEND/b1 3.46 3.45 -0.3% PASS
total/synth/SLT/b0 2.59 2.60 +0.2% PASS
total/synth/SLT/b1 1.63 1.63 -0.0% PASS
total/synth/SUB/b0 1.98 1.98 +0.0% PASS
total/synth/SUB/b1 1.97 1.97 +0.0% PASS
total/synth/SWAP1/s0 1.49 1.49 -0.0% PASS
total/synth/SWAP10/s0 1.50 1.50 -0.0% PASS
total/synth/SWAP11/s0 1.50 1.50 -0.0% PASS
total/synth/SWAP12/s0 1.50 1.50 -0.1% PASS
total/synth/SWAP13/s0 1.50 1.50 -0.1% PASS
total/synth/SWAP14/s0 1.51 1.51 -0.1% PASS
total/synth/SWAP15/s0 1.51 1.51 -0.0% PASS
total/synth/SWAP16/s0 1.51 1.51 +0.0% PASS
total/synth/SWAP2/s0 1.49 1.49 +0.0% PASS
total/synth/SWAP3/s0 1.49 1.49 -0.1% PASS
total/synth/SWAP4/s0 1.49 1.49 +0.1% PASS
total/synth/SWAP5/s0 1.49 1.50 +0.1% PASS
total/synth/SWAP6/s0 1.50 1.50 -0.1% PASS
total/synth/SWAP7/s0 1.50 1.50 -0.0% PASS
total/synth/SWAP8/s0 1.50 1.50 -0.1% PASS
total/synth/SWAP9/s0 1.50 1.50 +0.1% PASS
total/synth/XOR/b0 1.55 1.55 +0.0% PASS
total/synth/XOR/b1 1.55 1.55 -0.0% PASS
total/synth/loop_v1 4.58 4.57 -0.1% PASS
total/synth/loop_v2 4.56 4.58 +0.5% PASS

Summary: 194 benchmarks, 0 regressions


✅ Performance Check Passed (multipass)

Performance Benchmark Results (threshold: 25%)

Benchmark Baseline (us) Current (us) Change Status
total/main/blake2b_huff/8415nulls 1.67 1.85 +10.5% PASS
total/main/blake2b_huff/empty 0.03 0.03 +9.9% PASS
total/main/blake2b_shifts/8415nulls 4.56 4.56 -0.1% PASS
total/main/sha1_divs/5311 0.62 0.59 -4.0% PASS
total/main/sha1_divs/empty 0.01 0.01 -3.6% PASS
total/main/sha1_shifts/5311 0.55 0.56 +0.4% PASS
total/main/sha1_shifts/empty 0.01 0.01 +1.0% PASS
total/main/snailtracer/benchmark 54.11 53.34 -1.4% PASS
total/main/structarray_alloc/nfts_rank 0.30 0.29 -3.1% PASS
total/main/swap_math/insufficient_liquidity 0.00 0.00 -0.2% PASS
total/main/swap_math/received 0.00 0.00 +0.4% PASS
total/main/swap_math/spent 0.00 0.00 -2.0% PASS
total/main/weierstrudel/1 0.29 0.29 -0.0% PASS
total/main/weierstrudel/15 3.21 3.19 -0.6% PASS
total/micro/JUMPDEST_n0/empty 0.00 0.00 +0.3% PASS
total/micro/jump_around/empty 0.08 0.09 +2.4% PASS
total/micro/loop_with_many_jumpdests/empty 0.00 0.00 +0.8% PASS
total/micro/memory_grow_mload/by1 0.09 0.08 -12.0% PASS
total/micro/memory_grow_mload/by16 0.10 0.09 -10.5% PASS
total/micro/memory_grow_mload/by32 0.12 0.11 -8.3% PASS
total/micro/memory_grow_mload/nogrow 0.09 0.08 -12.8% PASS
total/micro/memory_grow_mstore/by1 0.11 0.09 -14.6% PASS
total/micro/memory_grow_mstore/by16 0.12 0.10 -14.9% PASS
total/micro/memory_grow_mstore/by32 0.13 0.11 -13.3% PASS
total/micro/memory_grow_mstore/nogrow 0.11 0.09 -17.3% PASS
total/micro/signextend/one 0.23 0.24 +1.8% PASS
total/micro/signextend/zero 0.24 0.24 +1.6% PASS
total/synth/ADD/b0 0.00 0.00 +0.8% PASS
total/synth/ADD/b1 0.00 0.00 +0.8% PASS
total/synth/ADDRESS/a0 0.15 0.15 +0.2% PASS
total/synth/ADDRESS/a1 0.15 0.15 +0.3% PASS
total/synth/AND/b0 0.00 0.00 +0.7% PASS
total/synth/AND/b1 0.00 0.00 +0.8% PASS
total/synth/BYTE/b0 0.00 0.00 +0.7% PASS
total/synth/BYTE/b1 0.00 0.00 +0.7% PASS
total/synth/CALLDATASIZE/a0 0.07 0.07 +0.0% PASS
total/synth/CALLDATASIZE/a1 0.07 0.07 +0.0% PASS
total/synth/CALLER/a0 0.18 0.18 -0.2% PASS
total/synth/CALLER/a1 0.18 0.18 -0.0% PASS
total/synth/CALLVALUE/a0 0.26 0.26 -0.2% PASS
total/synth/CALLVALUE/a1 0.27 0.27 -0.2% PASS
total/synth/CODESIZE/a0 0.07 0.07 +0.0% PASS
total/synth/CODESIZE/a1 0.07 0.07 +0.0% PASS
total/synth/DUP1/d0 0.00 0.00 +0.8% PASS
total/synth/DUP1/d1 0.00 0.00 +1.0% PASS
total/synth/DUP10/d0 0.00 0.00 +1.2% PASS
total/synth/DUP10/d1 0.00 0.00 +0.9% PASS
total/synth/DUP11/d0 0.00 0.00 +0.8% PASS
total/synth/DUP11/d1 0.00 0.00 +0.7% PASS
total/synth/DUP12/d0 0.00 0.00 +0.8% PASS
total/synth/DUP12/d1 0.00 0.00 +1.1% PASS
total/synth/DUP13/d0 0.00 0.00 +1.0% PASS
total/synth/DUP13/d1 0.00 0.00 +0.9% PASS
total/synth/DUP14/d0 0.00 0.00 +1.1% PASS
total/synth/DUP14/d1 0.00 0.00 +1.0% PASS
total/synth/DUP15/d0 0.00 0.00 +1.0% PASS
total/synth/DUP15/d1 0.00 0.00 +0.7% PASS
total/synth/DUP16/d0 0.00 0.00 +0.8% PASS
total/synth/DUP16/d1 0.00 0.00 +1.3% PASS
total/synth/DUP2/d0 0.00 0.00 +0.9% PASS
total/synth/DUP2/d1 0.00 0.00 +1.0% PASS
total/synth/DUP3/d0 0.00 0.00 +0.9% PASS
total/synth/DUP3/d1 0.00 0.00 +0.9% PASS
total/synth/DUP4/d0 0.00 0.00 +1.0% PASS
total/synth/DUP4/d1 0.00 0.00 +0.8% PASS
total/synth/DUP5/d0 0.00 0.00 +1.2% PASS
total/synth/DUP5/d1 0.00 0.00 +1.1% PASS
total/synth/DUP6/d0 0.00 0.00 +1.2% PASS
total/synth/DUP6/d1 0.00 0.00 +0.6% PASS
total/synth/DUP7/d0 0.00 0.00 +0.5% PASS
total/synth/DUP7/d1 0.00 0.00 +1.1% PASS
total/synth/DUP8/d0 0.00 0.00 +0.9% PASS
total/synth/DUP8/d1 0.00 0.00 +0.7% PASS
total/synth/DUP9/d0 0.00 0.00 +0.7% PASS
total/synth/DUP9/d1 0.00 0.00 +1.0% PASS
total/synth/EQ/b0 0.00 0.00 +1.1% PASS
total/synth/EQ/b1 0.00 0.00 +1.1% PASS
total/synth/GAS/a0 0.78 0.78 +0.0% PASS
total/synth/GAS/a1 0.75 0.75 +0.0% PASS
total/synth/GT/b0 0.00 0.00 +1.2% PASS
total/synth/GT/b1 0.00 0.00 +1.2% PASS
total/synth/ISZERO/u0 0.00 0.00 +1.1% PASS
total/synth/JUMPDEST/n0 0.00 0.00 +0.2% PASS
total/synth/LT/b0 0.00 0.00 +0.8% PASS
total/synth/LT/b1 0.00 0.00 +1.2% PASS
total/synth/MSIZE/a0 0.00 0.00 +0.7% PASS
total/synth/MSIZE/a1 0.00 0.00 +1.0% PASS
total/synth/MUL/b0 0.00 0.00 +0.7% PASS
total/synth/MUL/b1 0.00 0.00 +0.6% PASS
total/synth/NOT/u0 0.00 0.00 +1.0% PASS
total/synth/OR/b0 0.00 0.00 +0.9% PASS
total/synth/OR/b1 0.00 0.00 +0.8% PASS
total/synth/PC/a0 0.00 0.00 +1.2% PASS
total/synth/PC/a1 0.00 0.00 +0.8% PASS
total/synth/PUSH1/p0 0.00 0.00 +1.3% PASS
total/synth/PUSH1/p1 0.00 0.00 +1.1% PASS
total/synth/PUSH10/p0 0.00 0.00 +0.8% PASS
total/synth/PUSH10/p1 0.00 0.00 +0.7% PASS
total/synth/PUSH11/p0 0.00 0.00 +0.8% PASS
total/synth/PUSH11/p1 0.00 0.00 +0.8% PASS
total/synth/PUSH12/p0 0.00 0.00 +0.9% PASS
total/synth/PUSH12/p1 0.00 0.00 +1.0% PASS
total/synth/PUSH13/p0 0.00 0.00 +1.1% PASS
total/synth/PUSH13/p1 0.00 0.00 +0.8% PASS
total/synth/PUSH14/p0 0.00 0.00 +0.9% PASS
total/synth/PUSH14/p1 0.00 0.00 +0.8% PASS
total/synth/PUSH15/p0 0.00 0.00 +1.0% PASS
total/synth/PUSH15/p1 0.00 0.00 +0.8% PASS
total/synth/PUSH16/p0 0.00 0.00 +0.6% PASS
total/synth/PUSH16/p1 0.00 0.00 +0.6% PASS
total/synth/PUSH17/p0 0.00 0.00 +0.9% PASS
total/synth/PUSH17/p1 0.00 0.00 +0.8% PASS
total/synth/PUSH18/p0 0.00 0.00 +0.8% PASS
total/synth/PUSH18/p1 0.00 0.00 +0.5% PASS
total/synth/PUSH19/p0 0.00 0.00 +1.0% PASS
total/synth/PUSH19/p1 0.00 0.00 +1.0% PASS
total/synth/PUSH2/p0 0.00 0.00 +0.9% PASS
total/synth/PUSH2/p1 0.00 0.00 +1.0% PASS
total/synth/PUSH20/p0 0.00 0.00 +0.8% PASS
total/synth/PUSH20/p1 0.00 0.00 +0.8% PASS
total/synth/PUSH21/p0 0.00 0.00 +1.2% PASS
total/synth/PUSH21/p1 0.00 0.00 +1.0% PASS
total/synth/PUSH22/p0 1.15 1.15 -0.1% PASS
total/synth/PUSH22/p1 1.33 1.30 -2.9% PASS
total/synth/PUSH23/p0 1.15 1.15 -0.3% PASS
total/synth/PUSH23/p1 1.37 1.29 -6.2% PASS
total/synth/PUSH24/p0 1.15 1.15 +0.0% PASS
total/synth/PUSH24/p1 1.36 1.29 -5.3% PASS
total/synth/PUSH25/p0 1.15 1.15 -0.0% PASS
total/synth/PUSH25/p1 1.35 1.30 -4.1% PASS
total/synth/PUSH26/p0 1.07 0.91 -15.1% PASS
total/synth/PUSH26/p1 1.36 1.29 -4.7% PASS
total/synth/PUSH27/p0 1.15 1.15 -0.1% PASS
total/synth/PUSH27/p1 1.36 1.29 -5.1% PASS
total/synth/PUSH28/p0 1.15 1.15 -0.0% PASS
total/synth/PUSH28/p1 1.36 1.29 -4.9% PASS
total/synth/PUSH29/p0 1.15 1.22 +6.4% PASS
total/synth/PUSH29/p1 1.34 1.29 -4.1% PASS
total/synth/PUSH3/p0 0.00 0.00 +0.9% PASS
total/synth/PUSH3/p1 0.00 0.00 +0.8% PASS
total/synth/PUSH30/p0 1.15 1.15 +0.0% PASS
total/synth/PUSH30/p1 1.36 1.31 -3.9% PASS
total/synth/PUSH31/p0 1.15 1.15 -0.1% PASS
total/synth/PUSH31/p1 1.45 1.40 -3.6% PASS
total/synth/PUSH32/p0 1.15 1.15 -0.3% PASS
total/synth/PUSH32/p1 1.34 1.32 -1.5% PASS
total/synth/PUSH4/p0 0.00 0.00 +1.1% PASS
total/synth/PUSH4/p1 0.00 0.00 +1.2% PASS
total/synth/PUSH5/p0 0.00 0.00 +1.3% PASS
total/synth/PUSH5/p1 0.00 0.00 +0.8% PASS
total/synth/PUSH6/p0 0.00 0.00 +1.2% PASS
total/synth/PUSH6/p1 0.00 0.00 +1.0% PASS
total/synth/PUSH7/p0 0.00 0.00 +0.8% PASS
total/synth/PUSH7/p1 0.00 0.00 +0.9% PASS
total/synth/PUSH8/p0 0.00 0.00 +0.9% PASS
total/synth/PUSH8/p1 0.00 0.00 +0.9% PASS
total/synth/PUSH9/p0 0.00 0.00 +1.1% PASS
total/synth/PUSH9/p1 0.00 0.00 +1.1% PASS
total/synth/RETURNDATASIZE/a0 0.03 0.03 +0.0% PASS
total/synth/RETURNDATASIZE/a1 0.03 0.03 -0.0% PASS
total/synth/SAR/b0 3.77 3.78 +0.2% PASS
total/synth/SAR/b1 4.31 4.32 +0.2% PASS
total/synth/SGT/b0 0.00 0.00 +0.9% PASS
total/synth/SGT/b1 0.00 0.00 +0.9% PASS
total/synth/SHL/b0 3.04 3.03 -0.3% PASS
total/synth/SHL/b1 1.60 1.60 +0.0% PASS
total/synth/SHR/b0 2.95 2.93 -0.4% PASS
total/synth/SHR/b1 1.55 1.55 +0.0% PASS
total/synth/SIGNEXTEND/b0 3.12 3.03 -2.7% PASS
total/synth/SIGNEXTEND/b1 3.21 3.22 +0.4% PASS
total/synth/SLT/b0 0.00 0.00 +0.8% PASS
total/synth/SLT/b1 0.00 0.00 +0.7% PASS
total/synth/SUB/b0 0.00 0.00 +0.9% PASS
total/synth/SUB/b1 0.00 0.00 +0.9% PASS
total/synth/SWAP1/s0 0.00 0.00 +0.9% PASS
total/synth/SWAP10/s0 0.00 0.00 +1.0% PASS
total/synth/SWAP11/s0 0.00 0.00 +1.0% PASS
total/synth/SWAP12/s0 0.00 0.00 +0.9% PASS
total/synth/SWAP13/s0 0.00 0.00 +1.1% PASS
total/synth/SWAP14/s0 0.00 0.00 +0.7% PASS
total/synth/SWAP15/s0 0.00 0.00 +0.9% PASS
total/synth/SWAP16/s0 0.00 0.00 +0.8% PASS
total/synth/SWAP2/s0 0.00 0.00 +1.0% PASS
total/synth/SWAP3/s0 0.00 0.00 +1.0% PASS
total/synth/SWAP4/s0 0.00 0.00 +0.9% PASS
total/synth/SWAP5/s0 0.00 0.00 +1.0% PASS
total/synth/SWAP6/s0 0.00 0.00 +1.0% PASS
total/synth/SWAP7/s0 0.00 0.00 +0.9% PASS
total/synth/SWAP8/s0 0.00 0.00 +0.9% PASS
total/synth/SWAP9/s0 0.00 0.00 +0.8% PASS
total/synth/XOR/b0 0.00 0.00 +0.8% PASS
total/synth/XOR/b1 0.00 0.00 +0.7% PASS
total/synth/loop_v1 1.17 1.19 +1.3% PASS
total/synth/loop_v2 1.10 1.11 +1.3% PASS

Summary: 194 benchmarks, 0 regressions


Comment thread src/compiler/target/x86/x86lowering.cpp
@abmcar abmcar requested a review from zoowii April 2, 2026 05:14
@abmcar abmcar force-pushed the perf/compiler-u256-mulx-adx branch from dab8a1e to 8e65990 Compare April 8, 2026 11:36
@abmcar abmcar requested a review from Copilot April 9, 2026 04:36
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.


💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread src/compiler/target/x86/x86lowering.cpp
Comment thread docs/changes/2026-03-23-u256-mul-bmi2-adx/README.md
abmcar added a commit to abmcar/DTVM that referenced this pull request Apr 10, 2026
Align with PR DTVMStack#428 to eliminate cross-PR merge conflicts on the
shared table row.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
abmcar added a commit to abmcar/DTVM that referenced this pull request Apr 10, 2026
Align with PR DTVMStack#428 to eliminate cross-PR merge conflicts on the
shared table row.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
abmcar added a commit to abmcar/DTVM that referenced this pull request Apr 10, 2026
Align with PR DTVMStack#428 to eliminate cross-PR merge conflicts on the
shared table row.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
abmcar added a commit to abmcar/DTVM that referenced this pull request Apr 10, 2026
Align with PR DTVMStack#428 to eliminate cross-PR merge conflicts on the
shared table row.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
abmcar added a commit to abmcar/DTVM that referenced this pull request Apr 10, 2026
Align with PR DTVMStack#428 to eliminate cross-PR merge conflicts on the
shared table row.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
abmcar and others added 2 commits April 10, 2026 21:52
- Update change doc file count to match actual PR diff
- Add comment clarifying EVMJIT-only scope per reviewer feedback

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
The shared table caused merge conflicts across concurrent PRs since
every PR modified the same line. Each change doc is self-contained in
its subdirectory; the table was redundant.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@abmcar abmcar force-pushed the perf/compiler-u256-mulx-adx branch 2 times, most recently from 8c68029 to 804a33a Compare April 10, 2026 13:58
abmcar added a commit to abmcar/DTVM that referenced this pull request Apr 10, 2026
Align with PR DTVMStack#428 to eliminate cross-PR merge conflicts on the
shared table row.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@zoowii zoowii merged commit 417c739 into DTVMStack:main Apr 13, 2026
16 checks passed
@abmcar abmcar deleted the perf/compiler-u256-mulx-adx branch April 14, 2026 06:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants