py: Fix handling of large number of *args, and add more tests #8472

dpgeorge · 2022-03-31T13:25:41Z

There were two issues with the existing code:

"1 << i" is computed as a 32-bit number so would overflow when executed on 64-bit machines (when mp_uint_t is 64-bit). This meant that *args beyond 32 positions would not be handled correctly.
star_args must fit as a positive small int so that it is encoded correctly in the emitted code. MP_SMALL_INT_BITS is too big because it overflows a small int by 1 bit. MP_SMALL_INT_BITS - 1 does not work because it produces a signed small int which is then sign extended when extracted (even by mp_obj_get_int_truncated), and this sign extension means that any position arg after *args is also treated as a star-arg. So the maximum bit position is MP_SMALL_INT_BITS - 2. This means that MP_OBJ_SMALL_INT_VALUE() can be used instead of mp_obj_get_int_truncated() to get the value of star_args.

These are fixed by this PR.

Also, removed an unnecessary check for kw_value == MP_OBJ_NULL.

And added a few tests.

Signed-off-by: Damien George <damien@micropython.org>

dpgeorge · 2022-03-31T13:26:08Z

@dlech this is a follow up to your work. I would appreciate a review, thanks!

dlech

Nice finds on all of the int issues. 👍

dlech · 2022-03-31T15:05:49Z

tests/basics/fun_callstardblstar.py

@@ -34,3 +34,6 @@ def f2(*args, **kwargs):


 f2(*iter(range(4)), **{'a': 1})
+
+# case where *args is not a tuple/list and takes up most of the memory allocated for **kwargs
+f2(*range(100), **{str(i): i for i in range(100)})


range is a sequence, so by itself, it is not any different than tuple/list in this usage. This is why the test above has iter(range(...)) - it hides the __len__.

This particular test crashed MicroPython prior to your changes. The point is that range(100) doesn't match the code path that tests if the type is &mp_type_list/&mp_type_tuple.

But maybe it is even better to wrap it in iter(...).

You were right, it wasn't testing the correct code path on the new version. Now updated to use iter.

dlech · 2022-03-31T15:19:26Z

tests/stress/fun_call_limit.py

+
+
+def test(n):
+    s = "f(" + ",".join(str(i) for i in range(n)) + ", *('a', 'b'), 'c', 'd')"


This might be a bit more legible using a format string (all of the quotes close to each other make me a bit cross-eyed).

Suggested change

s = "f(" + ",".join(str(i) for i in range(n)) + ", *('a', 'b'), 'c', 'd')"

s = "f({}, *('a', 'b'), 'c', 'd')".format(",".join(str(i) for i in range(n)))

updated, and changed the 1-char strings to numbers to make it easier to read

There were two issues with the existing code: 1. "1 << i" is computed as a 32-bit number so would overflow when executed on 64-bit machines (when mp_uint_t is 64-bit). This meant that *args beyond 32 positions would not be handled correctly. 2. star_args must fit as a positive small int so that it is encoded correctly in the emitted code. MP_SMALL_INT_BITS is too big because it overflows a small int by 1 bit. MP_SMALL_INT_BITS - 1 does not work because it produces a signed small int which is then sign extended when extracted (even by mp_obj_get_int_truncated), and this sign extension means that any position arg after *args is also treated as a star-arg. So the maximum bit position is MP_SMALL_INT_BITS - 2. This means that MP_OBJ_SMALL_INT_VALUE() can be used instead of mp_obj_get_int_truncated() to get the value of star_args. These issues are fixed by this commit, and a test added. Signed-off-by: Damien George <damien@micropython.org>

The values are always real objects, only the key can be MP_OBJ_NULL to indicate a **kwargs entry. Signed-off-by: Damien George <damien@micropython.org>

Signed-off-by: Damien George <damien@micropython.org>

py/emitbc: Assert that a small int fits its encoding when emitting one.

e3de723

Signed-off-by: Damien George <damien@micropython.org>

dpgeorge added the py-core label Mar 31, 2022

dpgeorge force-pushed the py-fun-call-star-args-test branch from 3cfdeb9 to 0d37cab Compare March 31, 2022 13:31

dlech reviewed Mar 31, 2022

View reviewed changes

dlech mentioned this pull request Mar 31, 2022

kwargs error in recent py/runtime commits #8473

Closed

dpgeorge added 3 commits April 1, 2022 09:20

py/runtime: Remove unnecessary check for kw_value == MP_OBJ_NULL.

40f5c74

The values are always real objects, only the key can be MP_OBJ_NULL to indicate a **kwargs entry. Signed-off-by: Damien George <damien@micropython.org>

tests/basics/fun_callstardblstar: Add test for large arg allocation.

1dbf393

Signed-off-by: Damien George <damien@micropython.org>

dpgeorge force-pushed the py-fun-call-star-args-test branch from 0d37cab to 1dbf393 Compare March 31, 2022 22:21

dlech approved these changes Apr 1, 2022

View reviewed changes

dpgeorge merged commit 1dbf393 into micropython:master Apr 1, 2022

dpgeorge deleted the py-fun-call-star-args-test branch April 1, 2022 02:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

py: Fix handling of large number of *args, and add more tests #8472

py: Fix handling of large number of *args, and add more tests #8472

dpgeorge commented Mar 31, 2022

dpgeorge commented Mar 31, 2022

dlech left a comment

dlech Mar 31, 2022 •

edited

dpgeorge Mar 31, 2022

dpgeorge Mar 31, 2022

dlech Mar 31, 2022

dpgeorge Mar 31, 2022



		def test(n):
		s = "f(" + ",".join(str(i) for i in range(n)) + ", *('a', 'b'), 'c', 'd')"

	s = "f(" + ",".join(str(i) for i in range(n)) + ", *('a', 'b'), 'c', 'd')"
	s = "f({}, *('a', 'b'), 'c', 'd')".format(",".join(str(i) for i in range(n)))

py: Fix handling of large number of *args, and add more tests #8472

py: Fix handling of large number of *args, and add more tests #8472

Conversation

dpgeorge commented Mar 31, 2022

dpgeorge commented Mar 31, 2022

dlech left a comment

Choose a reason for hiding this comment

dlech Mar 31, 2022 • edited

Choose a reason for hiding this comment

dpgeorge Mar 31, 2022

Choose a reason for hiding this comment

dpgeorge Mar 31, 2022

Choose a reason for hiding this comment

dlech Mar 31, 2022

Choose a reason for hiding this comment

dpgeorge Mar 31, 2022

Choose a reason for hiding this comment

dlech Mar 31, 2022 •

edited