gh-141370: Fix undefined behavior when using Py_ABS() #141548

serhiy-storchaka · 2025-11-14T09:25:37Z

Issue: Undefined behavior in Py_ABS() macro with LLONG_MIN #141370

skirpichev · 2025-11-14T12:44:47Z

Python/marshal.c

+        uint64_t abs_value = long_export.value < -INT64_MAX
+            ? (uint64_t)INT64_MAX + (uint64_t)-(long_export.value + INT64_MAX)
+            : (uint64_t)Py_ABS(long_export.value);


Can't we assume two's complement representation of integers and use simpler workaround? As this fix span several places, I would prefer if it could be refactored to a separate macro.

I.e. something like:

#define My_ABS(x, MAX) \ ((x) < 0 ? ((x) >= -MAX ? -(x) : (U##MAX >> 1) + 1) : (x))

Nowadays two's complement is only case permitted by the C23 and is a de-facto standard. Do you have some system in mind on which we should care? I'm pretty sure all Tier 1-3 platforms fit to this picture.

All Tier 1-3 platforms perhaps fine with the current code. We change the code because we cannot be sure that it work on all exotic platforms and that future versions of the C compiler will not interpret an undefined behavior in interesting way.

The current gcc does not generate additions and substractions for this PR. It generates something more smart, although not so smart as for Py_ABS(). Although for your proposition it generates more complex code.

Details

#include <limits.h> #define Py_ABS(x) ((x) < 0 ? -(x) : (x)) #define My_ABS(x, MAX) \ ((x) < 0 ? ((x) >= -MAX ? -(x) : (U##MAX >> 1) + 1) : (x)) unsigned int intabs0(int x) { return (unsigned int)Py_ABS(x); } unsigned int intabs(int x) { return x < -INT_MAX ? (unsigned int)INT_MAX + (unsigned int)-(x + INT_MAX) : (unsigned int)Py_ABS(x); } unsigned int intabs2(int x) { return My_ABS(x, INT_MAX); } unsigned long longabs0(long x) { return (unsigned long)Py_ABS(x); } unsigned long longabs(long x) { return x < -LONG_MAX ? (unsigned long)LONG_MAX + (unsigned long)-(x + LONG_MAX) : (unsigned long)Py_ABS(x); } unsigned long longabs2(long x) { return My_ABS(x, LONG_MAX); }

Anyway, the performance of this code is not critical (if there is any difference).

All Tier 1-3 platforms perhaps fine with the current code.

(Yes, and I suspect tests might be redundant in fact.)

Although for your proposition it generates more complex code.

A different version:

#define __GMP_CAST(type, expr) ((type) (expr)) #define NEG_CAST(T,x) (- (__GMP_CAST (T, (x) + 1) - 1)) #define ABS_CAST(T,x) ((x) >= 0 ? __GMP_CAST (T, x) : NEG_CAST (T, x))

I found same approach in the GNU GMP, so just copied NIH code here.

Anyway, the performance of this code is not critical (if there is any difference).

Your solution looks ok for me. But in any case we should factor it to some macro.

For this version gcc generates exactly the same code as for the current code. But it is now free from undefined behavior.

Co-authored-by: Sergey B Kirpichev <skirpichev@gmail.com>

…ABC-UB

skirpichev

LGTM

This version triggers a warning on M$ compiler, but this is probably ok.

picnixz · 2025-11-18T09:02:11Z

Include/pymacro.h

 #define Py_MAX(x, y) (((x) > (y)) ? (x) : (y))

 /* Absolute value of the number x */
+#define _Py_ABS_CAST(T,x) ((x) >= 0 ? ((T) (x)) : (- (((T) ((x) + 1)) - 1)))


Suggested change

#define _Py_ABS_CAST(T,x) ((x) >= 0 ? ((T) (x)) : (- (((T) ((x) + 1)) - 1)))

#define _Py_ABS_CAST(T, x) ((x) >= 0 ? ((T) (x)) : (- (((T) ((x) + 1)) - 1)))

For posterity:

_Py_ABS_CAST(uint8_t, (int8_t)-128) == 128

The (T)((x) + 1) - 1 is:

( ( (uint8_t) ( (-128) + 1 // -127 (still int8_t) ) // 129 (2's complement on 8 bits) ) - 1 // 128 (as an uint8_t) )

Since $-128\bmod{256} = 128$, we are good. For another number say -5 we have:

( ( (uint8_t) ( (-5) + 1 // -4 (still int8_t) ) // 252 (2's complement on 8 bits) ) - 1 // 251 (as an uint8_t) )

And now $-251 \bmod{256} = 5$ and we're good.

Python/marshal.c

pythongh-141370: Fix undefined behavior when using Py_ABS()

1acabab

serhiy-storchaka added skip news needs backport to 3.13 bugs and security fixes needs backport to 3.14 bugs and security fixes labels Nov 14, 2025

bedevere-app bot added the awaiting core review label Nov 14, 2025

bedevere-app bot mentioned this pull request Nov 14, 2025

Undefined behavior in Py_ABS() macro with LLONG_MIN #141370

Open

serhiy-storchaka requested a review from skirpichev November 14, 2025 10:07

skirpichev reviewed Nov 14, 2025

View reviewed changes

Merge branch 'main' into Py_ABC-UB

445bcc8

skirpichev self-requested a review November 15, 2025 02:00

serhiy-storchaka and others added 3 commits November 17, 2025 20:08

Merge branch 'main' into Py_ABC-UB

887853d

Use more efficient implementation.

df24c4f

Co-authored-by: Sergey B Kirpichev <skirpichev@gmail.com>

Merge remote-tracking branch 'refs/remotes/origin/Py_ABC-UB' into Py_…

a584478

…ABC-UB

skirpichev approved these changes Nov 17, 2025

View reviewed changes

skirpichev requested a review from picnixz November 17, 2025 23:01

picnixz approved these changes Nov 18, 2025

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting core review labels Nov 18, 2025

Try other code. Add a comment.

b292561

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

gh-141370: Fix undefined behavior when using Py_ABS() #141548

gh-141370: Fix undefined behavior when using Py_ABS() #141548

Uh oh!

serhiy-storchaka commented Nov 14, 2025 •

edited by bedevere-app bot

Loading

Uh oh!

skirpichev Nov 14, 2025

Uh oh!

serhiy-storchaka Nov 14, 2025

Uh oh!

skirpichev Nov 15, 2025

Uh oh!

serhiy-storchaka Nov 17, 2025

Uh oh!

skirpichev left a comment

Uh oh!

picnixz Nov 18, 2025

Uh oh!

picnixz Nov 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	#define _Py_ABS_CAST(T,x) ((x) >= 0 ? ((T) (x)) : (- (((T) ((x) + 1)) - 1)))
	#define _Py_ABS_CAST(T, x) ((x) >= 0 ? ((T) (x)) : (- (((T) ((x) + 1)) - 1)))

Uh oh!

gh-141370: Fix undefined behavior when using Py_ABS() #141548

Are you sure you want to change the base?

gh-141370: Fix undefined behavior when using Py_ABS() #141548

Uh oh!

Conversation

serhiy-storchaka commented Nov 14, 2025 • edited by bedevere-app bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

skirpichev Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

serhiy-storchaka Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

skirpichev Nov 15, 2025

Choose a reason for hiding this comment

Uh oh!

serhiy-storchaka Nov 17, 2025

Choose a reason for hiding this comment

Uh oh!

skirpichev left a comment

Choose a reason for hiding this comment

Uh oh!

picnixz Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

picnixz Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

serhiy-storchaka commented Nov 14, 2025 •

edited by bedevere-app bot

Loading

picnixz Nov 18, 2025 •

edited

Loading