[mypyc] Add primitive for bytes.decode #10951

97littleleaf11 · 2021-08-09T09:08:16Z

Description

JukkaL · 2021-08-09T14:43:40Z

mypyc/lib-rt/str_ops.c

+}
+
+#define PyUnicode_UTF8(op)                              \


It's better not use a macro here, as it buys us little and makes the code harder to understand and maintain. If you need to share code, use an inline function.

JukkaL · 2021-08-09T14:44:35Z

mypyc/lib-rt/str_ops.c

+#define PyUnicode_UTF8(op)                              \
+    (assert(PyUnicode_Check(op)),                       \
+     assert(PyUnicode_IS_READY(op)),                    \


This seems to crash the process in case the string object is not ready, which is not the right thing to do? Instead, you should probably use PyUnicode_READY and return NULL if it fails.

JukkaL · 2021-08-09T14:45:53Z

mypyc/primitives/str_ops.py

+decode_types: List[RType] = [bytes_rprimitive, str_rprimitive, str_rprimitive]
+decode_constants: List[Tuple[int, RType]] = [(0, pointer_rprimitive),
+                                             (0, pointer_rprimitive)]
+for i in range(len(decode_types)):


The for loop doesn't save a lot of code and makes this harder to understand. I'd prefer just having three normal primitive definitions.

97littleleaf11 · 2021-08-10T15:01:48Z

mypyc/primitives/str_ops.py


 str_ssize_t_size_op = custom_op(
    arg_types=[str_rprimitive],
    return_type=c_pyssize_t_rprimitive,
    c_function_name='CPyStr_Size_size_t',
    error_kind=ERR_NEG_INT)
+


We prefer three separate helper functions.

JukkaL · 2021-08-11T09:36:39Z

mypyc/lib-rt/str_ops.c

+}
+
+PyObject* CPy_DecodeWithErrors(PyObject *obj, PyObject *encoding, PyObject *errors) {


As discussed offline, I'd prefer to have three primitive definitions that all call a single C function, i.e. we'd only duplicate the method_op declarations, not C implementations. Some primitives can provide fixed extra arguments, but without using the for loop for clarity. Sorry for the extra back and forth!

JukkaL

Thanks for the updates!

JukkaL · 2021-08-11T10:28:22Z

mypyc/test-data/run-strings.test

@@ -598,6 +600,7 @@ def test_decode() -> None:
    assert b.decode('gbk') == '浣犲ソ'
    assert b.decode('latin1') == 'ä½\xa0å¥½'

+[case testEncode]


Here it would be reasonable to have them in a single test case, since these are related (all string operations) and having fewer run tests will speed up tests. But it's not a big deal. The original name (testChrOrdEncodeDecode) wasn't the clearest though. Also we already have testStringOps, which would cover these as well. Not sure what's the best way to organize our tests.

97littleleaf11 added 3 commits August 9, 2021 16:51

Support bytes.decode

17a22bd

Add tests

8089348

Support optional parameters

cd4c5c5

97littleleaf11 marked this pull request as ready for review August 9, 2021 13:45

JukkaL reviewed Aug 9, 2021

View reviewed changes

Flat

5aea0e9

97littleleaf11 commented Aug 10, 2021

View reviewed changes

JukkaL reviewed Aug 11, 2021

View reviewed changes

97littleleaf11 added 2 commits August 11, 2021 17:48

Merge from master

31325fd

Merge C helper funcs

75015bf

JukkaL approved these changes Aug 11, 2021

View reviewed changes

JukkaL merged commit 5adb0a0 into python:master Aug 11, 2021

97littleleaf11 deleted the decode branch February 22, 2022 08:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[mypyc] Add primitive for bytes.decode #10951

[mypyc] Add primitive for bytes.decode #10951

Uh oh!

97littleleaf11 commented Aug 9, 2021

Uh oh!

JukkaL Aug 9, 2021

Uh oh!

JukkaL Aug 9, 2021

Uh oh!

JukkaL Aug 9, 2021

Uh oh!

97littleleaf11 Aug 10, 2021 •

edited

Loading

Uh oh!

JukkaL Aug 11, 2021

Uh oh!

JukkaL left a comment

Uh oh!

JukkaL Aug 11, 2021

Uh oh!

Uh oh!

		}

		PyObject* CPy_DecodeWithErrors(PyObject obj, PyObject encoding, PyObject *errors) {

		}

		#define PyUnicode_UTF8(op) \

Uh oh!

[mypyc] Add primitive for bytes.decode #10951

[mypyc] Add primitive for bytes.decode #10951

Uh oh!

Conversation

97littleleaf11 commented Aug 9, 2021

Description

Uh oh!

JukkaL Aug 9, 2021

Choose a reason for hiding this comment

Uh oh!

JukkaL Aug 9, 2021

Choose a reason for hiding this comment

Uh oh!

JukkaL Aug 9, 2021

Choose a reason for hiding this comment

Uh oh!

97littleleaf11 Aug 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JukkaL Aug 11, 2021

Choose a reason for hiding this comment

Uh oh!

JukkaL left a comment

Choose a reason for hiding this comment

Uh oh!

JukkaL Aug 11, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

97littleleaf11 Aug 10, 2021 •

edited

Loading