Build Parsing.py in the limited api #5845

da-woods · 2023-11-19T13:03:35Z

It isn't actually usable because it cimports other modules but it does build without warning. Changes are mainly to string handling (because that's the nature of parsing).

Depends on #5841 and #5798 to actually build Parsing.py, but the PR itself can be applied independently.

It isn't actually usable because it cimports other modules but it does build without warning. Changes are mainly to string handling (because that's the nature of parsing). Depends on cython#5841 and cython#5798 to actually build, but the PR itself doesn't depend on those

This PR (on top of cython#5845 and its dependencies) is sufficient to compile all the built modules in the Plex folder of Cython. It's currently unknown if they actually work (since it really needs all of Cython to be built to test).

scoder · 2023-11-25T19:53:46Z

Cython/Compiler/ExprNodes.py

@@ -14193,6 +14194,9 @@ def generate_result_code(self, code):
            checks = ["(%s != Py_None)" % self.arg.py_result()] if self.arg.may_be_none() else []
            checks.append("(%s(%s) != 0)" % (test_func, self.arg.py_result()))
            code.putln("%s = %s;" % (self.result(), '&&'.join(checks)))
+            code.putln("#if !CYTHON_ASSUME_SAFE_MACROS")
+            code.putln(code.error_goto_if_neg(self.result(), self.pos))


This is unfortunate, because it may lead to an unused label. In rare cases, admittedly…

I can rewrite it as

if ((!CYTHON_ASSUME_SAFE_MACROS) && %result% < 0) goto error

so the label is never unused. That might be slightly better (provided we manage to avoid "condition always true" warnings)

Cython/Utility/Builtins.c

Cython/Utility/ObjectHandling.c

scoder · 2023-11-25T20:05:08Z

Cython/Utility/ObjectHandling.c

+#if CYTHON_ASSUME_SAFE_MACROS
 #define __Pyx_unpack_tuple2(tuple, value1, value2, is_tuple, has_known_size, decref_tuple) \
    (likely(is_tuple || PyTuple_Check(tuple)) ? \
        (likely(has_known_size || PyTuple_GET_SIZE(tuple) == 2) ? \
            __Pyx_unpack_tuple2_exact(tuple, value1, value2, decref_tuple) : \
            (__Pyx_UnpackTupleError(tuple, 2), -1)) : \
        __Pyx_unpack_tuple2_generic(tuple, value1, value2, has_known_size, decref_tuple))
+#else
+static CYTHON_INLINE int __Pyx_unpack_tuple2(
+    PyObject* tuple, PyObject** value1, PyObject** value2, int is_tuple, int has_known_size, int decref_tuple);
+#endif


It seems worth generally unfolding this lengthy macro into the inline function you wrote, and just special casing the size check.

scoder · 2023-11-25T20:15:38Z

Cython/Utility/StringTools.c

+#if CYTHON_COMPILING_IN_LIMITED_API
+    // Note that from Python 3.7, the indices of FindChar account for wraparound so no
+    // need to check the length
+    Py_ssize_t idx = PyUnicode_FindChar(unicode, character, 0, -1, 1);


You could also pass PY_SSIZE_T_MAX as end to avoid the risk of an off-by-one error for -1 (because, is that inclusive or exclusive?).

Given that there's a C-API function for this, though, I wonder if we're really still faster than that. The implementation seems quite efficient. From my expecience, PyUnicode_READ() isn't the fastest thing to use, compared to a straight memory loop.

I've taken this to mean "just use PyUnicode_FindChar in all cases". Happy to revert if this wasn't what you mean though!

Cython/Utility/StringTools.c

Cython/Utility/TypeConversion.c

Co-authored-by: scoder <stefan_ml@behnel.de>

Cython/Utility/Builtins.c

Cython/Utility/ObjectHandling.c

da-woods · 2023-12-01T10:29:24Z

Cython/Utility/StringTools.c

 static CYTHON_INLINE PyObject* __Pyx_PyUnicode_Substring(
            PyObject* text, Py_ssize_t start, Py_ssize_t stop);
+#else
+// In the limited API since 3.7
+#define __Pyx_PyUnicode_Substring(text, start, stop) PyUnicode_Substring(text, start, stop)


This isn't quite right - the __Pyx_PyUnicode_Substring function handles negative indices and PyUnicode_Substring doesn't. Will fix

…d-api-parsing

This PR (on top of #5845 and its dependencies) is sufficient to compile all the built modules in the Plex folder of Cython.

scoder · 2023-12-02T11:09:01Z

Cython/Utility/Optimize.c

            PyTuple_SET_ITEM(tuple, 0, key);
            PyTuple_SET_ITEM(tuple, 1, value);
+            #else
+            if (unlikely(PyTuple_SetItem(tuple, 0, key) < 0)) {
+                Py_DECREF(value); // we haven't set this yet


We haven't actually set key in this case either. We need to decref both (and value in the failure case below).

PyTuple_SetItem decrefs key on failure (https://github.com/python/cpython/blob/939fc6d6eab9b7ea8c244d513610dbdd556503a7/Objects/tupleobject.c#L122C10-L122C10) so that it effectively steals a reference whether or not it succeeds.

I'll add a comment because it isn't obvious though

Cython/Utility/StringTools.c

Cython/Utility/TypeConversion.c

Cython/Utility/ObjectHandling.c

Co-authored-by: scoder <stefan_ml@behnel.de>

scoder · 2023-12-02T19:44:47Z

I've cleaned up the PyUnicode_GET_LENGTH() usages in #5890

Cython/Utility/StringTools.c

Cython/Utility/Builtins.c

scoder · 2023-12-04T20:16:32Z

Nice!

da-woods added the limited api label Nov 19, 2023

da-woods force-pushed the limited-api-parsing branch from 771c50b to c7fe812 Compare November 19, 2023 13:05

da-woods mentioned this pull request Nov 19, 2023

Build all Plex/*.so modules in the Limited API #5846

Merged

Fix typo

40503c7

scoder reviewed Nov 25, 2023

View reviewed changes

scoder added enhancement Code Generation labels Nov 25, 2023

da-woods and others added 5 commits November 25, 2023 20:54

Apply suggestions from code review

4523bd9

Co-authored-by: scoder <stefan_ml@behnel.de>

Further suggestsions from review

30d7448

Remove extra "if"`

b3446fc

Assume-safe-macros -> compiling-in-limited-api

27949af

Merge branch 'master' into limited-api-parsing

b188d8a

da-woods commented Nov 29, 2023

View reviewed changes

Cython/Utility/Builtins.c Outdated Show resolved Hide resolved

Update Cython/Utility/Builtins.c

ef6da04

da-woods commented Nov 29, 2023

View reviewed changes

Cython/Utility/Builtins.c Outdated Show resolved Hide resolved

Update Cython/Utility/Builtins.c

cc74491

da-woods commented Nov 29, 2023

View reviewed changes

Cython/Utility/Builtins.c Outdated Show resolved Hide resolved

Update Cython/Utility/Builtins.c

0599937

da-woods commented Nov 29, 2023

View reviewed changes

Cython/Utility/ObjectHandling.c Outdated Show resolved Hide resolved

Update Cython/Utility/ObjectHandling.c

b3421cd

da-woods commented Dec 1, 2023

View reviewed changes

da-woods added 2 commits December 1, 2023 10:46

Fix negative indices on unicode substring

7c12b4e

Merge remote-tracking branch 'origin/limited-api-parsing' into limite…

560169c

…d-api-parsing

scoder pushed a commit that referenced this pull request Dec 2, 2023

Get all Plex/*.so modules to build in the Limited API (GH-5846)

7d0d519

This PR (on top of #5845 and its dependencies) is sufficient to compile all the built modules in the Plex folder of Cython.

scoder reviewed Dec 2, 2023

View reviewed changes

da-woods and others added 2 commits December 2, 2023 12:50

Apply suggestions from code review

c16311a

Co-authored-by: scoder <stefan_ml@behnel.de>

Comment about ref-counting

cab9762

scoder added 2 commits December 3, 2023 20:06

Merge branch 'master' into limited-api-parsing

5561a15

Merge branch 'master' into limited-api-parsing

d1979e9

scoder reviewed Dec 3, 2023

View reviewed changes

Cython/Utility/StringTools.c Outdated Show resolved Hide resolved

Cython/Utility/Builtins.c Outdated Show resolved Hide resolved

Fix error handling when creating a fronzenset.

709beee

scoder merged commit 02ddde3 into cython:master Dec 4, 2023
63 checks passed

da-woods deleted the limited-api-parsing branch December 4, 2023 20:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Build Parsing.py in the limited api #5845

Build Parsing.py in the limited api #5845

da-woods commented Nov 19, 2023

scoder Nov 25, 2023

da-woods Nov 25, 2023

scoder Nov 25, 2023

scoder Nov 25, 2023

da-woods Nov 25, 2023

da-woods Dec 1, 2023

scoder Dec 2, 2023

da-woods Dec 2, 2023

scoder commented Dec 2, 2023

scoder commented Dec 4, 2023

Build Parsing.py in the limited api #5845

Build Parsing.py in the limited api #5845

Conversation

da-woods commented Nov 19, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

scoder commented Dec 2, 2023

scoder commented Dec 4, 2023