GH-116017: Put JIT code and data on the same page #116845

brandtbucher · 2024-03-14T22:56:47Z

Instead of marking code as read-exec and data as read-only, put both on the same read-exec pages. This halves the amount of memory used for short traces, and also results in half as many expensive mprotect calls per trace.

3% faster (10% faster startup), 9% less memory (12% less at startup).

The next step after this will be sharing pages between traces, to further reduce the amount of wasted memory. At the same time, we can expose a way to compile multiple traces at once with a single mprotect call (useful for the cold exit array).

Issue: Compiling tiny traces wastes lots of memory #116017

brandtbucher · 2024-03-14T23:04:29Z

This also fixes an issue where we weren't using the correct alignment during parts of the build process and things just accidentally worked (due to the data starting on a page boundary).

Python/jit.c

mdboom · 2024-03-15T21:10:44Z

I realise it probably makes sense to benchmark this one on macOS (since that's where we saw the greatest memory increase in #116017). I'll fire off a run right now.

brandtbucher · 2024-03-15T21:26:03Z

Oh, I already did. :) Results were less good: 1% faster, no memory impact (which is a bit odd, but I haven't really dug into it yet).

hartwork · 2024-03-16T01:13:28Z

Instead of marking code as read-exec and data as read-only, put both on the same read-exec pages.

@brandtbucher I may be missing something here: is this a reduction in security? Has this been discussed with the security team?

brandtbucher · 2024-03-18T03:22:07Z

Instead of marking code as read-exec and data as read-only, put both on the same read-exec pages.

@brandtbucher I may be missing something here: is this a reduction in security? Has this been discussed with the security team?

I should clarify: this change doesn't mean that we're now executing arbitrary user data. In this case, "data" is stuff like C string literals for error messages, pointers to cached objects, static C function addresses, version numbers, opargs, etc.

Since this data is now executable (as it lives on the same pages as the machine code), in theory an attacker capable of forcing the compilation of an instruction operand that happens to have the same encoding as a machine instruction could take control of the program. However, that would require that we jump into the data at some point, something which never happens during normal operation. As such, this would require a separate, highly specific bug in machine code generation in order to actually exploit.

The most likely way this would happen is not by jumping from the machine code into the data, but rather accidentally running off the end of the machine code into the data portion of the buffer (again, not something that happens with the sort of well-formed traces that tier two creates). This situation currently crashes on main, but would indeed begin executing the "read-only" data with the proposed change. However, this patch mitigates this risk by always adding a _FATAL_ERROR block to the end of the code portion of the trace, assuring that even if we did have a malformed trace, the program would abort before it actually overran the buffer.

I appreciate your comment, though. Please let me know if I've missed anything, or if you still have any concerns.

)

mdboom · 2024-03-20T16:34:12Z

@brandtbucher: Just FYI: a 20% reduction in memory usage on macOS on the benchmarking suite: https://github.com/faster-cpython/benchmarking-public/blob/main/results/bm-20240315-3.13.0a5%2B-e6d8e6d-JIT/bm-20240315-darwin-arm64-brandtbucher-justin_mprotect-3.13.0a5%2B-e6d8e6d-vs-base-mem.png

brandtbucher · 2024-03-20T16:35:36Z

Awesome! Wonder why it didn't come through in my first run...

mdboom · 2024-03-20T16:37:11Z

The compiler on that machine was still broken when you kicked it off yesterday -- I didn't truly fix it until this morning.

mdboom · 2024-03-20T16:37:32Z

It's also a solid 1-3% faster on macOS.

mdboom · 2024-03-20T16:42:07Z

The compiler on that machine was still broken when you kicked it off yesterday -- I didn't truly fix it until this morning.

Oh, I see you mean the run from 5 days ago. Yeah, it's not clear why it shows "no change". Something to keep an eye on for the future in case there is a bug somewhere in the benchmarking infra (but not clear what it would be).

)

brandtbucher added 6 commits February 29, 2024 15:31

Use one mprotect call instead of two

84bbac4

Don't use separate pages for code and data

4f21fe2

Fix alignment issues

9217066

Add missing alignment check

41455df

Catch up with main

6efff81

Clean up alignement calculation

36c6c78

brandtbucher added performance Performance or resource usage skip news interpreter-core (Objects, Python, Grammar, and Parser dirs) labels Mar 14, 2024

brandtbucher requested a review from markshannon March 14, 2024 22:56

brandtbucher self-assigned this Mar 14, 2024

bedevere-app bot added the awaiting core review label Mar 14, 2024

bedevere-app bot mentioned this pull request Mar 14, 2024

Compiling tiny traces wastes lots of memory #116017

Open

Be extra paranoid

26c31d9

mdboom reviewed Mar 15, 2024

View reviewed changes

Python/jit.c Outdated Show resolved Hide resolved

total_size

e6d8e6d

brandtbucher mentioned this pull request Mar 15, 2024

GH-116422: Tier2 hot/cold splitting #116813

Merged

brandtbucher merged commit 2c82592 into python:main Mar 19, 2024
56 of 57 checks passed

bedevere-app bot removed the awaiting core review label Mar 19, 2024

vstinner pushed a commit to vstinner/cpython that referenced this pull request Mar 20, 2024

pythonGH-116017: Put JIT code and data on the same page (pythonGH-116845

318e6f1

)

adorilson pushed a commit to adorilson/cpython that referenced this pull request Mar 25, 2024

pythonGH-116017: Put JIT code and data on the same page (pythonGH-116845

62ef665

)

diegorusso pushed a commit to diegorusso/cpython that referenced this pull request Apr 17, 2024

pythonGH-116017: Put JIT code and data on the same page (pythonGH-116845

6b82138

)

brandtbucher added the topic-JIT label May 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH-116017: Put JIT code and data on the same page #116845

GH-116017: Put JIT code and data on the same page #116845

brandtbucher commented Mar 14, 2024 •

edited by bedevere-app bot

brandtbucher commented Mar 14, 2024

mdboom commented Mar 15, 2024 •

edited

brandtbucher commented Mar 15, 2024

hartwork commented Mar 16, 2024

brandtbucher commented Mar 18, 2024 •

edited

mdboom commented Mar 20, 2024

brandtbucher commented Mar 20, 2024

mdboom commented Mar 20, 2024

mdboom commented Mar 20, 2024

mdboom commented Mar 20, 2024

GH-116017: Put JIT code and data on the same page #116845

GH-116017: Put JIT code and data on the same page #116845

Conversation

brandtbucher commented Mar 14, 2024 • edited by bedevere-app bot

brandtbucher commented Mar 14, 2024

mdboom commented Mar 15, 2024 • edited

brandtbucher commented Mar 15, 2024

hartwork commented Mar 16, 2024

brandtbucher commented Mar 18, 2024 • edited

mdboom commented Mar 20, 2024

brandtbucher commented Mar 20, 2024

mdboom commented Mar 20, 2024

mdboom commented Mar 20, 2024

mdboom commented Mar 20, 2024

brandtbucher commented Mar 14, 2024 •

edited by bedevere-app bot

mdboom commented Mar 15, 2024 •

edited

brandtbucher commented Mar 18, 2024 •

edited