py/nlr: In MP_NLR_JUMP_HEAD make top and top_ptr volatile. #5008

dpgeorge · 2019-08-15T09:06:05Z

The unix coverage port currently crashes with the toolchain and set up that I'm using (gcc 9.1.0 on Arch Linux x86-64). This is due to gcc doing further optimisations which leads to the following code being omitted (ie no corresponding machine code generated) in MP_NLR_JUMP_HEAD:

top->ret_val = val; \
MP_NLR_RESTORE_PYSTACK(top); \
*_top_ptr = top->prev; \

This is because the nlr_jump function is marked as no-return, so gcc deduces that the above code has no effect (rightfully so, I guess).

I'm not very happy with the solution presented here, rather it's more for discussion how to resolve the problem properly (and as an workaround if necessary).

There could be other variables that need to be declared volatile (eg to do with PYSTACK), and other platforms (non x86-64) may have different behaviour and require other fixes.

This really points to the importance of #4131, removal of NLR completely.

jimmo · 2019-08-15T11:29:07Z

I'm not very happy with the solution presented here...

Can you elaborate? I know better than to make any confident sounding assertion about the behavior of volatile but... preventing optimisations is what volatile is for?

Anyway, the real issue here is that GCC doesn't understand that the following asm block needs to be treated specially, so I have two (possibly non-portable) alternatives that seen to work on gcc 9.2.

Either add:

__asm__ goto (""::::a);
goto b;
a:
b:

before the asm block (I guess at the end of the MP_NLR_JUMP_HEAD macro). See https://gcc.gnu.org/onlinedocs/gcc/Extended-Asm.html 6.47.2.7 Goto Labels

Or, as that page suggests, instead add __builtin_unreachable() after the asm block (before the for(;;) ).

dpgeorge · 2019-08-15T11:49:11Z

Can you elaborate? I know better than to make any confident sounding assertion about the behavior of volatile but... preventing optimisations is what volatile is for?

The question is where to put the volatile's, what else needs to be volatile? It may be that a future compiler finds another subtle optimisation that the code didn't guard against and it breaks again. Because the NLR mechanism is not strict C (except with the setjmp/longjmp implementation) It's very compiler dependent.

See 34c04d2 for another case of compiler dependence with NLR. Also grep for volatile in py/vm.c.

Completely removing NLR is a full, robust solution to all these issues, although perhaps overkill. It could be that the right use of volatile is enough here.

And use it to replace the same pattern at the end of nlrthumb.c:nlr_jump.

Recent versions of gcc perform optimisations which can lead to the following code from the MP_NLR_JUMP_HEAD macro being omitted: top->ret_val = val; \ MP_NLR_RESTORE_PYSTACK(top); \ *_top_ptr = top->prev; \ This is noticeable (at least) in the unix coverage on x86-64 built with gcc 9.1.0. This is because the nlr_jump function is marked as no-return, so gcc deduces that the above code has no effect. Adding MP_UNREACHABLE tells the compiler that the asm code may branch elsewhere, and so it cannot optimise away the code.

dpgeorge · 2019-08-19T06:24:57Z

Ok, the builtins_unreachable is a nice solution that works well. So I made that change, see 11ecdf2

dpgeorge force-pushed the py-nlr-volatile-top-ptr branch from e07f118 to f432bbf Compare August 19, 2019 05:55

dpgeorge added 2 commits August 19, 2019 16:14

py: Introduce MP_UNREACHABLE macro to annotate unreachable code.

0c80cb3

And use it to replace the same pattern at the end of nlrthumb.c:nlr_jump.

dpgeorge force-pushed the py-nlr-volatile-top-ptr branch from f432bbf to 11ecdf2 Compare August 19, 2019 06:15

dpgeorge merged commit 11ecdf2 into micropython:master Aug 19, 2019

dhalbert mentioned this pull request Dec 12, 2019

beta.1 crashes after ctrl-d after exception in REPL adafruit/circuitpython#2378

Closed

dpgeorge deleted the py-nlr-volatile-top-ptr branch July 12, 2022 11:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

py/nlr: In MP_NLR_JUMP_HEAD make top and top_ptr volatile. #5008

py/nlr: In MP_NLR_JUMP_HEAD make top and top_ptr volatile. #5008

dpgeorge commented Aug 15, 2019

jimmo commented Aug 15, 2019

dpgeorge commented Aug 15, 2019

dpgeorge commented Aug 19, 2019

py/nlr: In MP_NLR_JUMP_HEAD make top and top_ptr volatile. #5008

py/nlr: In MP_NLR_JUMP_HEAD make top and top_ptr volatile. #5008

Conversation

dpgeorge commented Aug 15, 2019

jimmo commented Aug 15, 2019

dpgeorge commented Aug 15, 2019

dpgeorge commented Aug 19, 2019