perf(lexer): use portable-SIMD to speed up multiline comment scanning #23

Boshen · 2023-02-17T06:32:11Z

No description provided.

github-actions · 2023-02-17T06:43:47Z

Parser Benchmark Results

group                    main                                   pr
-----                    ----                                   --
parser/babylon.max.js    1.06    150.8±2.27ms    68.5 MB/sec    1.00    142.4±2.67ms    72.5 MB/sec
parser/d3.js             1.00     18.1±0.36ms    30.2 MB/sec    1.01     18.3±0.32ms    29.8 MB/sec
parser/lodash.js         1.24      6.4±0.13ms    80.5 MB/sec    1.00      5.1±0.12ms   100.2 MB/sec
parser/pdf.js            1.00     10.3±0.17ms    39.1 MB/sec    1.01     10.4±0.20ms    38.7 MB/sec
parser/typescript.js     1.00    143.9±2.59ms    66.9 MB/sec    1.00    143.8±2.32ms    66.9 MB/sec

strager · 2023-02-23T23:00:38Z

crates/oxc_parser/src/lexer/simd.rs

+        let star_mask = any_star.to_bitmask();
+        let slash_mask = any_slash.to_bitmask();


I have found that to_bitmask on ARM NEON can be cause parsing to be slower compared to a scalar version. My block comment parser only uses SIMD on x86, not ARM. Do you run your benchmarks on ARM NEON or just Intel SSE?

On x86_64, to_bitmask is one instruction. On AArch64, there is no instruction and to_bitmask needs to be emulated. See: https://godbolt.org/z/531oEo5d1 It looks like the Rust compiler currently does a terrible job on AArch64. You can definitely do better by hand, but it still might not be faster than the scalar version.

Boshen force-pushed the simd-multi branch from 9a18900 to ed460b1 Compare February 17, 2023 06:52

Boshen changed the title ~~fix(lexer): fix simd multiline comment scanner with '*' on the boundary~~ perf(lexer): use portable-SIMD to speed up multiline comment scanning Feb 20, 2023

Boshen marked this pull request as draft February 20, 2023 09:27

feat(lexer): use portable-SIMD to speed up multiline comment scanning

6ce3cea

Boshen force-pushed the simd-multi branch from ed460b1 to 6ce3cea Compare February 20, 2023 13:31

Boshen marked this pull request as ready for review February 20, 2023 13:57

Boshen merged commit 83c3f34 into main Feb 20, 2023

Boshen deleted the simd-multi branch February 20, 2023 13:58

Boshen added this to the AST / Lexer / Parser milestone Feb 21, 2023

strager reviewed Feb 23, 2023

View reviewed changes

Boshen mentioned this pull request Feb 24, 2023

Benchmark on different cpu architectures #42

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(lexer): use portable-SIMD to speed up multiline comment scanning #23

perf(lexer): use portable-SIMD to speed up multiline comment scanning #23

Boshen commented Feb 17, 2023

github-actions bot commented Feb 17, 2023 •

edited

Loading

strager Feb 23, 2023

		let star_mask = any_star.to_bitmask();
		let slash_mask = any_slash.to_bitmask();

perf(lexer): use portable-SIMD to speed up multiline comment scanning #23

perf(lexer): use portable-SIMD to speed up multiline comment scanning #23

Conversation

Boshen commented Feb 17, 2023

github-actions bot commented Feb 17, 2023 • edited Loading

Parser Benchmark Results

strager Feb 23, 2023

Choose a reason for hiding this comment

github-actions bot commented Feb 17, 2023 •

edited

Loading