Migrate to RaptorJIT #1264

lukego · 2018-01-15T15:00:02Z

This pull request is intended to migrate Snabb from LuaJIT to RaptorJIT. This supersedes #1172 which is based on the same branch but was opened early as a preview into the development.

Consequences:

JIT tracing and dumping will be always enabled. The shm file audit.log contains an efficient binary representation of the information from jit.v and jit.dump. This must be decoded with separate tools (Studio.)
Profiling will be always enabled. The shm directory vmprofile/ contains profiler datasets for the engine and for each individual app. The profiler data records which JIT traces were hot and separately tracks machine code, FFI code, VM code, GC, etc. This data must also be decoded with separate tools (Studio and also tools by @eugeneia.)
The jit.dump, jit.v, and jit.p tools are no longer available. These are superseded by the features above which are superior because they are "always on" and available in production. Sorry! We will need to adjust to the new tools and identify the use cases that they currently miss.
RaptorJIT has some different optimizations and maybe perform better/worse on certain things. We will need to identify and fix any regressions.
We will be our own masters who can hack on the JIT compiler for fun and profit! :-)

TODO:

I need to provide instructions for examining trace and profiler data with Studio.

I am hoping to land this branch in the next release. Thoughts?

Contributed by Djordje Kovacevic and Stefan Pejic from RT-RK.com. Sponsored by Cisco Systems, Inc.

Extend the "Lua C API" for vmprofile to support allocating multiple profiles and switching between them. This makes it easier to use. Previously the expectation was that this functionality would be implemented in Lua code using ljsyscall to allocate file-backed shared memory for profiles and then the FFI to switch between them. (This is still possible, too, and it works the same as it did before.) Added test coverage to the test suite.

vmprofile: Extend Lua API and add test cases

raptorjit.nix: Fix executable name / broken build

For debugging purposes it is very useful to be able to refer to the origin of a trace (parent/exit) and so this change stores that information persistently in GCtrace instead of only ephemerally in jit_State. These fields are now duplicated in jit_State (valid while recording) and GCtrace (valid after recording.) This duplication could be avoided by putting them only in GCtrace and accessing them via J->cur but this would be a more noisy change since the existing fields are accessed from many places including DynASM macros.

Just to be clear about what is actually being checked.

Oops. The magic number and version fields of the VMProfile structure were only initialized via the FFI interface and not the C API one.

Make JIT trace numbers unique for completed traces. This way a trace number is an unambiguous way to refer to a GCtrace object even across flushes over the JIT. The immediate motivation is to make the vmprofile data valid across JIT flushes. It is counting samples by trace number and so when trace numbers are reused then multiple traces can "collide" on the same profiler counter.

Support JIT for taking the difference between two pointers whose element sizes are not a power of 2. This was previously an NYI but not generates code using a division to convert a pointer to an element number. The case where the element size is a power of 2 is handled the same way as before i.e. using a bit-shift instead of a division.

Suggested by François Perrad.

Contributed by Peter Cawley.

Suggested by François Perrad.

Contributed by François Perrad.

Fix lj_auditlog double-inclusion and accidental globals

wingo · 2018-03-20T14:18:35Z

I am working on raptorjit+snabb currently. The branch is https://github.com/Igalia/snabb/tree/raptorjit. Current additions relative to this branch are Igalia#1032, a merge from master (Igalia#1033), and a fix for DynASM 64-bit immediates (#1302).

Merge v2018.01.2 to raptorjit

Load potentially 64-bit values using mov64

RaptorJIT prefers to always write telemetry to an audit log, and allow rich analysis of this log with external tools.

See raptorjit/raptorjit#160.

Remove profiler

As in raptorjit/raptorjit#160, remove references to -jv and such.

Remove references to -jv, -jdump, and the like

This is another instance of the bug from commit 93ef6bd. We didn't see any issue on upstream Snabb's test suites, but with RaptorJIT's new LJ_GC64 usage did manifest itself as intermittent heap corruption. Fixes snabbco#1307.

Fix out-of-bounds write in ctable test suite

If a 64-bit value is provided as a 32-bit immediate operand for a DynASM instruction then error. The previous behavior was to automatically truncate to 32-bit. This is particularly significant for GC64 mode where pointers to objects created by `ffi.new()` cannot be safely truncated to 32 bits. The solution for referencing 64-bit immediates is to use 'mov64' and this is suggested in the error message.

dasm: Error when a 64-bit value is used as a 32-bit immediate

…rce" This reverts commit afbb06e, reversing changes made to 2ff378a.

…it-upstream

Git had misplaced this file in src/ instead of lib/luajit/src/

This should include a fix for snabbco#1303.

Fix auditlog thrashing

Limit the auditlog to 100MB. This should be ample. Future changes should make this configurable.

Timestamps are based on CLOCK_MONOTONIC i.e. suitable for calculating the time delta from one event to the next.

Extends the auditlog feature with event timestamps and a size limit.

wingo · 2018-04-03T09:50:33Z

From what I can see of the log there are two blockers here:

(1) Not so good perf on basic1-100e6; that could be statistical though. Better to use hydra here
(2) A failure in the tap app's selftest. Seems to be legit. I didn't see it locally because I don't run with SNABB_TAPTEST.

eugeneia · 2019-01-07T14:46:34Z

Closing this one in favour of #1316.

Mike Pall and others added 30 commits July 26, 2017 09:52

PPC: Add soft-float support to interpreter.

fd37da0

Contributed by Djordje Kovacevic and Stefan Pejic from RT-RK.com. Sponsored by Cisco Systems, Inc.

testsuite/bench: Remove PARAM_* for unsupported platforms

d7fd44e

Merge pull request snabbco#77 from lukego/vmprofile

eaf418c

vmprofile: Extend Lua API and add test cases

Merge branch 'raptorjit/master' into auditlog

f8d1c9e

raptorjit.nix: Fix executable name / broken build

d095eae

.travis.yml: Update Travis-CI config (was not testing)

8ec77f9

check-generated-code.nix: Fixed

076d161

check-generated-code.nix: Tweak order of diff args

d27d4d4

reusevm: Updated generated code

300d14e

Merge pull request snabbco#80 from lukego/fix-build

5708b06

raptorjit.nix: Fix executable name / broken build

check-generated-code.nix: Add more verbosity

fcf7053

Just to be clear about what is actually being checked.

Merge branch 'gctrace-origin' into auditlog

8fc672c

lj_vmprofile.c: Fixed to set file magic number with C API calls

4a06d4a

Oops. The magic number and version fields of the VMProfile structure were only initialized via the FFI interface and not the C API one.

Merge branch 'vmprofile' into auditlog

a292144

Merge branch 'unique-trace-numbers' into auditlog

48be218

Merge raptorjit master into lib/raptorjit

505831d

Makefile: Add 'make reusevm' for raptorjit

2b8d1a3

src/Makefile: Switch "luajit" command to "raptorjit"

1d81da7

src/Makefile: Fix path to raptorjit.a

bbbd4b6

snabbnfv.traffic: Remove jit.p and jit.dump calls (not in raptorjit)

55ac7f5

Remove Lua 5.0 compatibility defines.

07f976a

Suggested by François Perrad.

LJ_GC64: Fix BC_CALLM snapshot handling.

5126a06

x64/LJ_GC64: Fix emit_loadk64().

de9a886

Contributed by Peter Cawley.

Merge branch 'master' into v2.1

e834e9c

Remove old Lua 5.0 compatibility defines.

dbe5619

Suggested by François Perrad.

Add some more changes and extensions from Lua 5.2.

d6db005

Contributed by François Perrad.

wingo and others added 2 commits March 20, 2018 14:22

Merge pull request snabbco#1032 from Igalia/raptorjit-upstream

003050e

Fix lj_auditlog double-inclusion and accidental globals

Merge v2018.01.2 to raptorjit

7b50bda

wingo and others added 7 commits March 20, 2018 15:35

Merge pull request snabbco#1033 from Igalia/2018-1-2-to-raptorjit

8b199b7

Merge v2018.01.2 to raptorjit

Merge pull request snabbco#1034 from Igalia/fix-gc64-dynasm-immediates

401d0dd

Load potentially 64-bit values using mov64

Remove references to the profiler, -jv, -jdump, and the like

8119e5d

RaptorJIT prefers to always write telemetry to an audit log, and allow rich analysis of this log with external tools.

Merge github.com/Igalia/raptorjit 'remove-profiler' branch

2eebf6b

See raptorjit/raptorjit#160.

Merge pull request snabbco#1035 from Igalia/merge-raptorjit

6637b38

Remove profiler

Remove references to -jv, -jdump, and the like

bbd7635

As in raptorjit/raptorjit#160, remove references to -jv and such.

Merge pull request snabbco#1036 from Igalia/remove-traceprof

b336b2b

Remove references to -jv, -jdump, and the like

wingo mentioned this pull request Mar 20, 2018

[WIP] Another RaptorJIT integration branch #1306

Closed

wingo and others added 15 commits March 21, 2018 17:20

Fix out-of-bounds write in ctable test suite

0736ffd

This is another instance of the bug from commit 93ef6bd. We didn't see any issue on upstream Snabb's test suites, but with RaptorJIT's new LJ_GC64 usage did manifest itself as intermittent heap corruption. Fixes snabbco#1307.

Merge pull request snabbco#1038 from Igalia/fix-oob-write-in-ctable-test

2ff378a

Fix out-of-bounds write in ctable test suite

Merge pull request snabbco#1040 from lukego/dynasm-gc64-nocoerce

afbb06e

dasm: Error when a 64-bit value is used as a 32-bit immediate

Revert "Merge pull request snabbco#1040 from lukego/dynasm-gc64-nocoe…

6943135

…rce" This reverts commit afbb06e, reversing changes made to 2ff378a.

Merge remote-tracking branch 'lukego-raptorjit/auditlog' into raptorj…

5fbba43

…it-upstream

Move lj_auditlog.c out of Snabb and into RaptorJIT

7bc8597

Git had misplaced this file in src/ instead of lib/luajit/src/

Merge 'lukego/raptorjit-upstream'

648b061

This should include a fix for snabbco#1303.

Merge pull request snabbco#1042 from Igalia/fix-auditlog-thrashing

2e7b9c1

Fix auditlog thrashing

Merge remote-tracking branch 'igalia/raptorjit' into raptorjit-upstream

1ac2f85

luajit.c: Make -a and -p argument handling more consistent

0158240

lj_audit.log.c: Add file size limit (default 100MB)

d0dcc75

Limit the auditlog to 100MB. This should be ample. Future changes should make this configurable.

lj_auditlog.c: Add "nanotime" timestamp to events

b0cbca7

Timestamps are based on CLOCK_MONOTONIC i.e. suitable for calculating the time delta from one event to the next.

Merge lukego/raptorjit#auditlog into raptorjit-upstream

9b30025

Extends the auditlog feature with event timestamps and a size limit.

lj_auditlog.c: Increase in-memory log limit to 10MB (from 1MB)

f4ff59e

lukego mentioned this pull request Apr 4, 2018

RaptorJIT branch for upstreaming #1316

Merged

eugeneia closed this Jan 7, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate to RaptorJIT #1264

Migrate to RaptorJIT #1264

lukego commented Jan 15, 2018

wingo commented Mar 20, 2018

wingo commented Apr 3, 2018 •

edited

Loading

eugeneia commented Jan 7, 2019

Migrate to RaptorJIT #1264

Migrate to RaptorJIT #1264

Conversation

lukego commented Jan 15, 2018

wingo commented Mar 20, 2018

wingo commented Apr 3, 2018 • edited Loading

eugeneia commented Jan 7, 2019

wingo commented Apr 3, 2018 •

edited

Loading