Convert hash API to use MP_UNARY_OP_HASH instead of ad-hoc function #1250

dpgeorge · 2015-05-11T12:40:04Z

Hashing is now done using mp_unary_op function with MP_UNARY_OP_HASH as
the operator argument. Hashing for int, str and bytes still go via
fast-path in mp_unary_op since they are the most common objects which
need to be hashed.

This lead to quite a bit of code cleanup, and should be more efficient
if anything. It saves 176 bytes code space on Thumb2, and 360 bytes on
x86.

The only loss is that the error message "unhashable type" is now the
more generic "unsupported type for hash".

To address issue #1246.

@pfalcon it turned out that I could save a significant number of bytes and at the same time (hopefully) cover all your points. Please take a critical look!

Hashing is now done using mp_unary_op function with MP_UNARY_OP_HASH as the operator argument. Hashing for int, str and bytes still go via fast-path in mp_unary_op since they are the most common objects which need to be hashed. This lead to quite a bit of code cleanup, and should be more efficient if anything. It saves 176 bytes code space on Thumb2, and 360 bytes on x86. The only loss is that the error message "unhashable type" is now the more generic "unsupported type for __hash__".

dpgeorge · 2015-05-11T12:51:27Z

py/objtype.c

+        mp_obj_t val = mp_call_function_1(member[0], self_in);
+        // __hash__ must return a small int
+        if (op == MP_UNARY_OP_HASH) {
+            val = MP_OBJ_NEW_SMALL_INT(mp_obj_get_int(val));


Actually, this should be mp_obj_get_int_truncated.

coveralls · 2015-05-11T13:21:52Z

Coverage decreased (-0.01%) to 93.47% when pulling c9ae888 on hash-unary-op into a7c02c4 on master.

pfalcon · 2015-05-12T18:33:19Z

Well, I imagined more localized refactor, where mp_obj_hash() stays the same for simple types and dispatches to ->unary_op for more complex ones, but if that's what allowed to get more code size savings, looks good, thanks for going for that!

dpgeorge · 2015-05-12T21:41:03Z

It was natural to put small-int hash in unary op with all the other small-int unary ops there already. And then only str hashing was left as the most common one needing a fast path (and actually needed a better fast path than existing in mp_obj_hash, as per old comment in objstr.c). So mp_obj_hash became reduntant and everything worked out for the best.

dpgeorge · 2015-05-12T21:47:31Z

Merged in c2a4e4e.

@stinos you can now hopefully fix your original problem with hashing of native types.

dpgeorge reviewed May 11, 2015
View reviewed changes

dpgeorge closed this May 12, 2015

dpgeorge deleted the hash-unary-op branch May 12, 2015 21:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert hash API to use MP_UNARY_OP_HASH instead of ad-hoc function #1250

Convert hash API to use MP_UNARY_OP_HASH instead of ad-hoc function #1250

dpgeorge commented May 11, 2015

dpgeorge May 11, 2015

pfalcon May 12, 2015

coveralls commented May 11, 2015

pfalcon commented May 12, 2015

dpgeorge commented May 12, 2015

dpgeorge commented May 12, 2015

Convert hash API to use MP_UNARY_OP_HASH instead of ad-hoc function #1250

Convert hash API to use MP_UNARY_OP_HASH instead of ad-hoc function #1250

Conversation

dpgeorge commented May 11, 2015

dpgeorge May 11, 2015

Choose a reason for hiding this comment

pfalcon May 12, 2015

Choose a reason for hiding this comment

coveralls commented May 11, 2015

pfalcon commented May 12, 2015

dpgeorge commented May 12, 2015

dpgeorge commented May 12, 2015