Add support for `sys.monitoring` events. #9482

stuartarchibald · 2024-03-07T18:12:04Z

Python 3.12 introduced a new event monitoring system sys.monitoring. This patch augments Numba's dispatcher so as to emit events associated with a Python function starting and returning from execution. Tools monitoring for such events will therefore be able to identify execution of Numba's pure machine code regions as though they were actual Python functions. This allows tools such a cProfile to record Numba compiled function execution and report them as part of their output.

Fixes #9289

Python 3.12 introduced a new event monitoring system `sys.monitoring`. This patch augments Numba's dispatcher so as to emit events associated with a Python function starting and returning from execution. Tools monitoring for such events will therefore be able to identify execution of Numba's pure machine code regions as though they were actual Python functions. This allows tools such a ``cProfile`` to record Numba compiled function execution and report them as part of their output. Fixes numba#9289

sklam

I have

read through the code to understand the implementation
manual run cProfile to compare py3.9 vs py3.12 results

Still have to review the code and cross check Python impl.

sklam · 2024-04-03T17:30:50Z

nevermind, it's just typical "leak" in numba.tests.test_profiler.TestProfiler.test_profiler.

original message

Valgrind showing these when running python runtests.py numba/tests/test_profiler.py:


==3321719== 8 bytes in 1 blocks are definitely lost in loss record 47 of 4,298
==3321719==    at 0x483DF0F: operator new(unsigned long) (vg_replace_malloc.c:483)
==3321719==    by 0x576333F4: allocate (new_allocator.h:114)
==3321719==    by 0x576333F4: allocate (alloc_traits.h:444)
==3321719==    by 0x576333F4: _M_allocate (stl_vector.h:343)
==3321719==    by 0x576333F4: reserve (vector.tcc:78)
==3321719==    by 0x576333F4: addDefinition (_dispatcher.cpp:435)
==3321719==    by 0x576333F4: Dispatcher_Insert(Dispatcher*, _object*, _object*) (_dispatcher.cpp:586)
==3321719==    by 0x556BF9: method_vectorcall_VARARGS_KEYWORDS (descrobject.c:365)
==3321719==    by 0x546CF0: UnknownInlinedFun (pycore_call.h:92)
==3321719==    by 0x546CF0: PyObject_Vectorcall (call.c:325)
==3321719==    by 0x52D15B: _PyEval_EvalFrameDefault (bytecodes.c:2706)
==3321719==    by 0x581CF9: UnknownInlinedFun (pycore_ceval.h:89)
==3321719==    by 0x581CF9: UnknownInlinedFun (ceval.c:1683)
==3321719==    by 0x581CF9: UnknownInlinedFun (call.c:419)
==3321719==    by 0x581CF9: UnknownInlinedFun (pycore_call.h:92)
==3321719==    by 0x581CF9: method_vectorcall (classobject.c:91)
==3321719==    by 0x53179E: UnknownInlinedFun (call.c:387)
==3321719==    by 0x53179E: _PyEval_EvalFrameDefault (bytecodes.c:3254)
==3321719==    by 0x524E56: UnknownInlinedFun (pycore_ceval.h:89)
==3321719==    by 0x524E56: UnknownInlinedFun (ceval.c:1683)
==3321719==    by 0x524E56: UnknownInlinedFun (call.c:419)
==3321719==    by 0x524E56: _PyObject_FastCallDictTstate (call.c:133)
==3321719==    by 0x562BD5: _PyObject_Call_Prepend (call.c:508)
==3321719==    by 0x63F815: slot_tp_call (typeobject.c:8769)
==3321719==    by 0x520D4A: _PyObject_MakeTpCall (call.c:240)
==3321719==    by 0x52D15B: _PyEval_EvalFrameDefault (bytecodes.c:2706)
==3321719== 
==3321719== 8 bytes in 1 blocks are definitely lost in loss record 48 of 4,298
==3321719==    at 0x483DF0F: operator new(unsigned long) (vg_replace_malloc.c:483)
==3321719==    by 0x57633695: allocate (new_allocator.h:114)
==3321719==    by 0x57633695: allocate (alloc_traits.h:444)
==3321719==    by 0x57633695: _M_allocate (stl_vector.h:343)
==3321719==    by 0x57633695: void std::vector<_object*, std::allocator<_object*> >::_M_realloc_insert<_object* const&>(__gnu_cxx::__normal_iterator<_object**, std::vector<_object*, std::allocator<_object*> > >, _object* const&) (vector.tcc:440)
==3321719==    by 0x576334A5: push_back (stl_vector.h:1195)
==3321719==    by 0x576334A5: addDefinition (_dispatcher.cpp:439)
==3321719==    by 0x576334A5: Dispatcher_Insert(Dispatcher*, _object*, _object*) (_dispatcher.cpp:586)
==3321719==    by 0x556BF9: method_vectorcall_VARARGS_KEYWORDS (descrobject.c:365)
==3321719==    by 0x546CF0: UnknownInlinedFun (pycore_call.h:92)
==3321719==    by 0x546CF0: PyObject_Vectorcall (call.c:325)
==3321719==    by 0x52D15B: _PyEval_EvalFrameDefault (bytecodes.c:2706)
==3321719==    by 0x581CF9: UnknownInlinedFun (pycore_ceval.h:89)
==3321719==    by 0x581CF9: UnknownInlinedFun (ceval.c:1683)
==3321719==    by 0x581CF9: UnknownInlinedFun (call.c:419)
==3321719==    by 0x581CF9: UnknownInlinedFun (pycore_call.h:92)
==3321719==    by 0x581CF9: method_vectorcall (classobject.c:91)
==3321719==    by 0x53179E: UnknownInlinedFun (call.c:387)
==3321719==    by 0x53179E: _PyEval_EvalFrameDefault (bytecodes.c:3254)
==3321719==    by 0x524E56: UnknownInlinedFun (pycore_ceval.h:89)
==3321719==    by 0x524E56: UnknownInlinedFun (ceval.c:1683)
==3321719==    by 0x524E56: UnknownInlinedFun (call.c:419)
==3321719==    by 0x524E56: _PyObject_FastCallDictTstate (call.c:133)
==3321719==    by 0x562BD5: _PyObject_Call_Prepend (call.c:508)
==3321719==    by 0x63F815: slot_tp_call (typeobject.c:8769)
==3321719==    by 0x520D4A: _PyObject_MakeTpCall (call.c:240)

numba/tests/test_sys_monitoring.py

numba/_dispatcher.cpp

Adds support to Numba's dispatcher for the ``sys.monitoring.events`` of type ``RAISE`` and `PY_UNWIND``. Associated unit tests are added and cProfile testing is updated. Docs are updated to match.

Resolved conflicts: numba/tests/support.py

As title.

If a Numba function is being run under cProfile and the function raises an exception, the dispatcher must handle this correctly. It is not valid to call `PyFrame_FastToLocals` without saving and restoring the exception state across the call as it can clear the exception state itself. This patch fixes this problem and also stops calling `PyFrame_LocalsToFast` in Python 3.11 where the "whats-new" docs claim that the frames are now looked after by the virutal machine.

Fixes some RST syntax and a couple of grammatical errors.

sklam · 2024-04-24T18:13:10Z

BFID numba_smoketest_cpu_yaml_193

sklam · 2024-04-24T18:44:29Z

There's an elusive bug that is only revealed by minimal test sequence:

python runtests.py numba.tests.test_sys_monitoring.TestMonitoring.test_disable_from_callback numba.tests.test_sys_monitoring.TestMonitoring.test_start_event

output:

test_start_event (numba.tests.test_sys_monitoring.TestMonitoring.test_start_event) ... FAIL

Stdout:
[call(<code object foo at 0x111c61f10, file "/Users/siu/dev/numba/numba/tests/test_sys_monitoring.py", line 16>, 0)]

======================================================================
FAIL: test_start_event (numba.tests.test_sys_monitoring.TestMonitoring.test_start_event)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/Users/siu/dev/numba/numba/tests/test_sys_monitoring.py", line 136, in test_start_event
    self.check_py_start_calls(cb)
  File "/Users/siu/dev/numba/numba/tests/test_sys_monitoring.py", line 77, in check_py_start_calls
    self.assertEqual(mockcalls.call_count, 2)
AssertionError: 1 != 2

Stdout:
[call(<code object foo at 0x111c61f10, file "/Users/siu/dev/numba/numba/tests/test_sys_monitoring.py", line 16>, 0)]

----------------------------------------------------------------------
Ran 2 tests in 0.165s

sklam

#9482 (comment) is the only pending problem and I have no idea why it is happening.

Code and documentation looks great

gmarkall · 2024-04-24T23:36:05Z

There's an elusive bug that is only revealed by minimal test sequence

I can reproduce this locally too.

This works around a potential bug in CPython where the state associated with a monitoring tool can "leak" in an unexpected manner. This patch also removes the use of the functools cache on the generator for test functions, this to eliminate any potential state being stored on the code objects associated with the test functions.

stuartarchibald · 2024-04-26T14:57:49Z

Thanks for finding a MWR and helping debug this @sklam, commit 5d652e7 works around the problem.

stuartarchibald · 2024-04-26T15:35:33Z

Note for the future. Were this feature request implemented python/cpython#111997 the code in this patch could be adapted to use it opposed to having to rely on CPython internal details.

Edit: This PR python/cpython#116413 implements the above. Looks like it will be in Python 3.13.

sklam

Workaround confirmed. One minor issue with the code comments

sklam · 2024-04-26T15:37:03Z

numba/tests/test_sys_monitoring.py

+            # It is necessary to restart events that have been disabled. The
+            # "disabled" state of the `PY_START` event for the tool
+            # `self.tool_id` "leaks" into subsequent tests. These tests then end
+            # up failing as events that should trigger do not! It's not really


# up failing as events that should trigger do not! It's not really ^^^^^^^

you mean that should not trigger?

The comment is trying to convey that "subsequent tests fail as events in them that should be triggered are not being triggered" (this was why the Mock.call_count value was 1 too small, an event that should have triggered did not).

My english parser segfaulted when reading it.

I've tried to make this comment more clear in 25535b3. Please take a look and see what you think. Thanks!

sklam · 2024-04-26T16:05:54Z

Reported issue to python/cpython#118327
5d652e7 is the workaround.

As title.

stuartarchibald · 2024-04-26T16:11:33Z

Reported issue to python/cpython#118327 5d652e7 is the workaround.

Thanks for reporting this @sklam, much appreciated.

stuartarchibald added the 2 - In Progress label Mar 7, 2024

sklam reviewed Mar 8, 2024

View reviewed changes

sklam added this to the 0.60.0-rc1 milestone Mar 12, 2024

sklam reviewed Apr 3, 2024

View reviewed changes

numba/tests/test_sys_monitoring.py Show resolved Hide resolved

numba/_dispatcher.cpp Show resolved Hide resolved

numba/_dispatcher.cpp Show resolved Hide resolved

numba/_dispatcher.cpp Outdated Show resolved Hide resolved

stuartarchibald added 4 commits April 22, 2024 16:43

Add support for RAISE and PY_UNWIND events.

4f870eb

Adds support to Numba's dispatcher for the ``sys.monitoring.events`` of type ``RAISE`` and `PY_UNWIND``. Associated unit tests are added and cProfile testing is updated. Docs are updated to match.

Merge 'main' into wip/sys_monitoring

88c224a

Resolved conflicts: numba/tests/support.py

Fix RST preference for underlining of headers.

0899844

As title.

stuartarchibald marked this pull request as ready for review April 24, 2024 14:29

stuartarchibald added 3 - Ready for Review Effort - long Long size effort needed and removed 2 - In Progress labels Apr 24, 2024

Fix RST syntax in sys.monitoring docs.

3a17c60

Fixes some RST syntax and a couple of grammatical errors.

stuartarchibald mentioned this pull request Apr 24, 2024

Numba 0.60.0rc1 Checklist #9544

Open

41 tasks

sklam added the Pending BuildFarm For PRs that have been reviewed but pending a push through our buildfarm label Apr 24, 2024

sklam reviewed Apr 24, 2024

View reviewed changes

stuartarchibald added 4 - Waiting on reviewer Waiting for reviewer to respond to author and removed 3 - Ready for Review labels Apr 26, 2024

sklam reviewed Apr 26, 2024

View reviewed changes

Fix comment to read more clearly.

25535b3

As title.

sklam approved these changes Apr 26, 2024

View reviewed changes

sklam added 5 - Ready to merge Review and testing done, is ready to merge and removed 4 - Waiting on reviewer Waiting for reviewer to respond to author labels Apr 26, 2024

sklam merged commit 64e7cdb into numba:main Apr 26, 2024
21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for `sys.monitoring` events. #9482

Add support for `sys.monitoring` events. #9482

stuartarchibald commented Mar 7, 2024

sklam left a comment

sklam commented Apr 3, 2024 •

edited

sklam commented Apr 24, 2024

sklam commented Apr 24, 2024 •

edited

sklam left a comment •

edited

gmarkall commented Apr 24, 2024

stuartarchibald commented Apr 26, 2024

stuartarchibald commented Apr 26, 2024 •

edited

sklam left a comment

sklam Apr 26, 2024

stuartarchibald Apr 26, 2024

sklam Apr 26, 2024

stuartarchibald Apr 26, 2024

sklam commented Apr 26, 2024

stuartarchibald commented Apr 26, 2024

Add support for sys.monitoring events. #9482

Add support for sys.monitoring events. #9482

Conversation

stuartarchibald commented Mar 7, 2024

sklam left a comment

Choose a reason for hiding this comment

sklam commented Apr 3, 2024 • edited

sklam commented Apr 24, 2024

sklam commented Apr 24, 2024 • edited

sklam left a comment • edited

Choose a reason for hiding this comment

gmarkall commented Apr 24, 2024

stuartarchibald commented Apr 26, 2024

stuartarchibald commented Apr 26, 2024 • edited

sklam left a comment

Choose a reason for hiding this comment

sklam Apr 26, 2024

Choose a reason for hiding this comment

stuartarchibald Apr 26, 2024

Choose a reason for hiding this comment

sklam Apr 26, 2024

Choose a reason for hiding this comment

stuartarchibald Apr 26, 2024

Choose a reason for hiding this comment

sklam commented Apr 26, 2024

stuartarchibald commented Apr 26, 2024

Add support for `sys.monitoring` events. #9482

Add support for `sys.monitoring` events. #9482

sklam commented Apr 3, 2024 •

edited

sklam commented Apr 24, 2024 •

edited

sklam left a comment •

edited

stuartarchibald commented Apr 26, 2024 •

edited