safe recursion #1006

andrewrk · 2018-05-11T03:12:35Z

If we solve #157 then we'll know at compile-time the maximum stack usage of the entire call graph.

Except for recursion. Recursion - whether direct or indirect - represents a loop in the graph.

Here you can see indirect recursion B->C->E->D->B...

Recursion is problematic because it can happen a number of times that is only known at runtime, whereas the stack size is determined at compile-time. If too much recursion happens, we get stack overflow. Apart from being an annoying bug, stack overflow is a real concern for embedded software, which typically must resort to heuristics to find out just how small of a stack size they can get away with using.

But recursion is very useful. It's an intuitive way to reason about control flow. @Hejsil wrote zig's self-hosted parser without recursion, and it is not as straightforward as recursive programming.

So what do we do?

We have our cake and eat it, too. That's what we do.

Looking at the graph above again, we can have every function call in the graph be normal, except for D calling B. That's the only entrance into the cycle. So we can make there be a compile error for a normal function call from D to B. To make a function call which starts a cycle, you have to use a builtin:

const new_stack = try allocator.alloc(u8, @stackSize(function));
defer allocator.free(new_stack);
_ = @recursiveCall(new_stack, function, args);

It is a compile error to use @recursiveCall when you could have used a normal function call, and it is a compile error to use a normal function call when you must use @recursiveCall. So the compiler always tells you what to do.

I've prototyped how this would be emitted to LLVM using inline assembly, and I am talking to the LLVM mailing list to find out what's the best way to accomplish this. http://lists.llvm.org/pipermail/llvm-dev/2018-May/123217.html

There's another, simpler way we can have safe recursion, by limiting the amount of recursion at compile-time. This would be another builtin, that would look like this:

var cycles_left: usize = undefined;
_ = @limitedRecursiveCall(999, &cycles_left, function, args);
if (cycles_left == 0) @panic("recursion limit exceeded");

This would work the same way except instead of allocating memory on the heap, we specify the maximum number of cycles, and the function call fails if the cycle limit is exceeded. Then when calculating the stack upper bound, we multiply the cycle stack size by the recursion limit.

So you then would decide at compile time how much stack space you're going to ask for from the OS.

The benefit of this idea is that it actually does not depend on any changes to LLVM and we could implement it even before #157 is finished.

Until #105 is done we would have to specify the Recursive or LimitedRecursive calling convention on the first node getting called in the cycle (B in the picture).

The text was updated successfully, but these errors were encountered:

bnoordhuis · 2018-05-11T10:06:01Z

Interesting proposal. Do I understand right that both builtins work (and can only work) under a closed world assumption?

How does @recursiveCall deal with function pointers? I saw mention of computing the set of all possible values but that seems (P-)hard.

About @limitedRecursiveCall, how is the counter propagated and what decrements it?

FWIW, I've been thinking along similar lines as @limitedRecursiveCall:

fn f(@rec n: usize) { if (n > 0) g(n-1); }
fn g(@rec n: usize) { if (n > 0) f(n-1); }

The idea being that the compiler knows n is special and "trivially" reduces on every call. Doesn't give a stack upper bound (at least not without further analysis) but it does guarantee termination with direct and indirect function calls and mutual recursion.

I'm using "trivially" kind of hand-wavy because I'm not sure what the compiler can trivially infer. There is probably something clever that can be done with comptime expressions.

bheads · 2018-05-11T12:30:23Z

This could be really cool if you can solve all the corner cases. How would the compiler handle allocating on the stack?

http://man7.org/linux/man-pages/man3/alloca.3.html

andrewrk · 2018-05-11T14:55:34Z

How would the compiler handle allocating on the stack?

@alloca in zig was removed and is not a supported use case: #225 (comment)

We will have a builtin function to annotate extra stack usage for the places we need it. For example if you use inline assembly to modify the stack pointer directly then you may need to use @annotateStackUsage(1234) with the number of bytes possibly used.

External functions could change the stack pointer in this way as well; external functions will support annotations to specify the stack usage upper bound per function.

I'm thinking about writing a tool which analyses binaries and disassembles functions to automatically compute these annotations.

andrewrk · 2018-05-11T19:30:40Z

About @limitedRecursiveCall, how is the counter propagated and what decrements it?

I didn't think this through very well. I'll have to re-think it. It would work for direct recursion where we can add secret parameters, but we wouldn't want to have to change the calling convention of every function in the cycle.

kyle-github · 2018-05-11T23:06:14Z

This is a hard problem to solve generally due to NP completeness.

That said, why not start small and work at this for a while?

For instance, rather than trying various constraints like recursion limits, just have checked stack allocation only for call graphs that do not have cycles.

Over time you either chip away at the various other cases or people start writing code for embedded systems that does not use recursion.

There is very little Zig code out there today and it seems like you may be trying to solve the problem without any solid data.

Put in the checks you need to double check actual usage in test blocks.

test " ensure stack size" { assert(@maxStack() < 1024); .... }

I am typing on a phone so I have no idea how this will get formatted!

The idea is that the built-in above would return usize and if there was any recursion at all would either panic or return the maximum usize value.

As more code is written and you see the patterns that are safe, then you can relax the constraints with more complicated and refined rules.

See #1006

andrewrk · 2018-05-12T22:40:05Z

I did a proof of concept of this and it's now available for use: @newStackCall

This proof of concept means that, even though we don't have all the features done yet for safe recursion, the ideas will work. So recursion is OK to use, it's not evil, and the compiler will help you remove the possibility of stack overflow in a future version of zig.

What's left to do for this issue:

Builtin function to tell you the maximum stack size of a given function #157
compile error for recursion without @newStackCall
runtime safety check for @newStackCall to make sure the stack given is big enough

See #1006

ghost · 2018-09-06T08:31:37Z

the other day I was reading On Recursion, Continuations and Trampolines

... and looked up CPS were It says:

As CPS and TCO eliminate the concept of an implicit function return, their combined use can eliminate the need for a run-time stack. Several compilers and interpreters for functional programming languages use this ability in novel ways.

my gut feeling is that this can't be true, otherwise everyone would just do it, or there is some other major drawback hiding

andrewrk · 2018-09-06T13:34:46Z

The drawback of TCO is that you can't always do it. See #694

We have CPS in the form of coroutines and indeed it solves the recursion issue. I plan to revisit this issue with that in mind.

thejoshwolfe · 2018-09-06T14:26:43Z

If you eliminate the runtime stack, you have to replace it with some other allocator. The proponents of functional languages would probably advocate storing bound variables on the heap, which doesn't seem like an improvement.

ghost · 2018-09-06T14:38:31Z

The drawback of TCO is that you can't always do it.

from https://eli.thegreenplace.net/2017/on-recursion-continuations-and-trampolines/

It turns out we can convert any function to use tail calls instead of recursion (direct or indirect) by applying the following recipe:

Pass each function an extra parameter - cont.
Whenever the function returns an expression that doesn't contain function calls, send that expression to the continuation cont instead.
Whenever a function call occurs in a tail position, call the function with the same continuation - cont.
Whenever a function call occurs in an operand (non-tail) position, instead perform this call in a new continuation that gives a name to the result and continues with the expression.

I don't claim I completely understand the issue because there should definitely still be a catch somehow, just reading though it seems as if it would work always and solve the issue.

thejoshwolfe · 2018-09-06T15:31:23Z

That all assumes passing a continuation as a first class function encapsulates all the bound variables for the closure. The concepts in that discussion are definitely not possible without bound variables. Bound variables have to be allocated in memory somewhere.

Functional programming literature is hard for me to understand because it all seems to ignore the reality of computers. Where is the memory allocated? That's a really important question that you can't just gloss over.

@monouser7dig I think that's the catch you're looking for.

ghost · 2018-09-06T16:06:06Z

Ok maybe that’s true, thanks for the pointers.

hryx · 2019-07-07T20:39:04Z

Just some thoughts over coffee ☕️

Should safe recursion be allowed at comptime? I suppose this would depend on:
1. a comptime allocator (use case: comptime allocator #1291), but not out of necessity. For example, a fixed-size buffer could be used for the stacks.
2. the ability to detect cycles in a comptime call graph. Maybe we get that for free? I'm not yet familiar with the implementation of comptime analysis/evaluation.
Let's take the parser as an exemplary use case of recursion. Some challenges I can foresee:
1. Unlike AST nodes, which generally stick around once allocated, the stacks will come and go rapidly. Because of this, we might not want to use arena allocation for them.
2. If the grammar changes, the call graph will change, and so might the point of recursion (D->B in the original example). I wonder how painful it would be to have to juggle @newStackCall around the source file in this case?

which heap allocate their own frames related: #1006

* `await @asyncCall` generates better code. See #3065 * `@asyncCall` works with a real `@Frame(func)` in addition to a byte slice. Closes #3072 * `@asyncCall` allows passing `{}` (a void value) as the result pointer, which uses the result location inside the frame. Closes #3068 * support `await @asyncCall` on a non-async function. This is in preparation for safe recursion (#1006).

andrewrk · 2019-09-24T17:22:29Z

The plan to solve this is to use async functions and @Frame to heap-allocate recursive function calls. Zig will force recursive calls to be async, and this will cause a compile error due to async frame dependency loop, which will show the call graph cycle. The cycle can then be broken by this pattern:

const frame = try allocator.create(@TypeOf(async func(a, b, c)));
defer allocator.destroy(frame);
await @asyncCall(frame, {}, func, a, b, c);

metaleap · 2020-02-03T21:39:36Z

It is a compile error to use @recursiveCall when you could have used a normal function call, and it is a compile error to use a normal function call when you must use @recursiveCall.

If that capability can indeed be reached, a more "implicit magic"-inclined language/compiler would be tempted to "at that point" hide the difference again under the covers and allow the same syntax for both calls. But I'm quite in favour of the explicitness, especially in larger code-bases it can be a nice highlighting of otherwise-hidden circularities / cycles.

andrewrk · 2020-02-03T21:40:50Z

@recursiveCall is not part of the accepted proposal. Accepted proposal is #1006 (comment)

anosovic · 2020-04-18T03:30:44Z

if recursion depth is known at compile time, or if the call graph cycle is tail call optimized, will you need to allocate the function call like that?
with this change, will it really be impossible to stack overflow at runtime? or could you eg call a bunch of different functions in a row to try to get zig to stack overflow?

marcthe12 · 2020-05-29T10:22:50Z

Ackerman function should be test as it very good in creating stack overflows and impossible to avoid recursion.

ghost · 2020-07-30T06:15:00Z

One subtlety: when detecting cycles, we need to count cycles that involve tail calls, but those cycles are only a problem if they involve regular or stack async calls as well. Since converting any one of those calls to an allocated async call will break the cycle, an instance of recursion should be considered a property of a call graph rather than any individual call. Error reporting should list all functions in the cycle and whether they're regular calls, tail calls, or stack async calls -- due to the lazy nature of the compiler, the "entry point" of the cycle may not always be defined.

Anyway, that might be obvious, sorry if so.

Jarred-Sumner · 2021-04-25T07:34:51Z

The plan to solve this is to use async functions and @Frame to heap-allocate recursive function calls. Zig will force recursive calls to be async, and this will cause a compile error due to async frame dependency loop, which will show the call graph cycle.

Does this mean that, after this change, recursive function calls will be slower in Zig than right now due to the heap allocation? Or would there be a way to allow recursion of up to N depth like in the earlier example with @limitedRecursiveCall and then @Frame could be heap allocated in subsequent calls?

If the goal is "prevent/handle stack overflows", I wonder if a stack overflow could be a required error to handle when recursively calling a function, like error.OutOfMemory but e.g. error.StackOverflow. I honestly don't know much about compilers though

matu3ba · 2021-10-14T16:50:23Z

Since this may happen potentially along the complete program graph Johnsons algorithm sounds like the best solution.
Described in "Finding all the elementary circuits of a directed graph" by Donal B. Johnson. ~~Other solutions have quadratic runtime, which scales bad for bigger programs.~~

Somebody tested Johnson, Tarjan etc and found another solution was more flexible: link to paper.

Enumerating Circuits and Loops in Graphs with Self-Arcs and Multiple-Arcs.
Hawick and H.A. James, In Proceedings of FCS. 2008, 14-20

This stuff should be properly benchmarked and not only being used in theoretical limbospace, because no paper so far published their benchmark data and test suite.

dumblob · 2021-10-14T19:19:00Z

I know it's a bit off-topic, but Nim guys have implemented a full-featured & seamless CPS for Nim (i.e. including support for arbitrary recursion). Of course it's based around captured variables (i.e. "closures"), but it works really well.

The only downside is it's currently somewhat slower - but there is a huge potential for optimization (as their implementation focuses on correctness and readability and not at all on optimization).

matu3ba · 2022-11-03T14:56:22Z

The folks from mold are discussing/planning to utilize Tarjan or some related methods/parallelization (https://ppl.stanford.edu/papers/sc13-hong.pdf) to speed up mold rui314/mold#842, so we might be able to ~~steal~~ get inspired by their ideas (and experimental results on code).

Their relevant research question looks like, if the same method(s) are applicable on object code, which I suspect translates very good to source code execution semantics (besides comptime dead code elimination).

dvmason · 2023-05-21T13:22:43Z

CPS conversion is always possible - if you have a heap and garbage collection. So that isn't really an (automatic) option for Zig. However, I am using CPS conversion in the runtime I'm building, and using explicit tail-calls between the CPS functions. I use an explicit stack and have automatic promotion to the heap where necessary.

andrewrk added the proposal This issue suggests modifications. If it also has the "accepted" label then it is planned. label May 11, 2018

andrewrk added this to the 0.4.0 milestone May 11, 2018

andrewrk added a commit that referenced this issue May 12, 2018

add @newStackCall builtin function

a6ae451

See #1006

andrewrk mentioned this issue May 12, 2018

Proposal: ability to execute a function on a different stack WebAssembly/design#1207

Open

andrewrk added the accepted This proposal is planned. label May 12, 2018

andrewrk added a commit that referenced this issue May 13, 2018

refactor std.zig.render to be recursive

7cdc9d9

See #1006

andrewrk mentioned this issue May 13, 2018

zig fmt policy #1003

Closed

andrewrk mentioned this issue Nov 23, 2018

Self-referential function calls segfault #1772

Closed

andrewrk modified the milestones: 0.4.0, 0.5.0 Feb 15, 2019

andrewrk mentioned this issue Apr 29, 2019

The Coroutine Rewrite Issue #2377

Closed

andrewrk mentioned this issue Jul 13, 2019

improve the std.fs.deleteTree API to not require an allocator #2886

Closed

andrewrk added a commit that referenced this issue Aug 31, 2019

support recursive async and non-async functions

6ab8b2a

which heap allocate their own frames related: #1006

andrewrk modified the milestones: 0.5.0, 0.6.0 Sep 20, 2019

andrewrk mentioned this issue Feb 29, 2020

Buffer Overflow / Stack Canary #4542

Closed

andrewrk modified the milestones: 0.7.0, 0.8.0 Oct 9, 2020

andrewrk modified the milestones: 0.8.0, 0.9.0 May 19, 2021

andrewrk modified the milestones: 0.9.0, 0.10.0 Nov 20, 2021

rohlem mentioned this issue Feb 23, 2022

Recursion detection is over aggressive #10967

Closed

andrewrk modified the milestones: 0.10.0, 0.11.0 Apr 16, 2022

andrewrk modified the milestones: 0.11.0, 0.12.0 Apr 9, 2023

ok-ryoko mentioned this issue May 5, 2023

Explore alternatives to recursion ok-ryoko/multiring.zig#2

Closed

1 task

andrewrk mentioned this issue Jun 18, 2023

#12474: std.fs.Dir.makeOpenPath: optimize case, if path already exists #14833

Merged

andrewrk modified the milestones: 0.13.0, 0.12.0 Jun 29, 2023

wooster0 mentioned this issue Aug 23, 2023

segfault when struct custom formatter attemps to recursively print itself #16933

Closed

matklad mentioned this issue Sep 15, 2023

Zig tracking issue tigerbeetle/tigerbeetle#1191

Open

andrewrk mentioned this issue Oct 3, 2023

async/await/suspend/resume #6025

Open

andrewrk mentioned this issue Dec 29, 2023

passing --stack to zig test makes the getrlimit/setrlimit test case fail #18395

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

safe recursion #1006

safe recursion #1006

andrewrk commented May 11, 2018 •

edited

Loading

bnoordhuis commented May 11, 2018

bheads commented May 11, 2018

andrewrk commented May 11, 2018

andrewrk commented May 11, 2018

kyle-github commented May 11, 2018

andrewrk commented May 12, 2018

ghost commented Sep 6, 2018

andrewrk commented Sep 6, 2018

thejoshwolfe commented Sep 6, 2018

ghost commented Sep 6, 2018

thejoshwolfe commented Sep 6, 2018

ghost commented Sep 6, 2018

hryx commented Jul 7, 2019

andrewrk commented Sep 24, 2019 •

edited

Loading

metaleap commented Feb 3, 2020

andrewrk commented Feb 3, 2020

anosovic commented Apr 18, 2020 •

edited

Loading

marcthe12 commented May 29, 2020

ghost commented Jul 30, 2020

Jarred-Sumner commented Apr 25, 2021 •

edited

Loading

matu3ba commented Oct 14, 2021 •

edited

Loading

dumblob commented Oct 14, 2021

matu3ba commented Nov 3, 2022

dvmason commented May 21, 2023

safe recursion #1006

safe recursion #1006

Comments

andrewrk commented May 11, 2018 • edited Loading

bnoordhuis commented May 11, 2018

bheads commented May 11, 2018

andrewrk commented May 11, 2018

andrewrk commented May 11, 2018

kyle-github commented May 11, 2018

andrewrk commented May 12, 2018

ghost commented Sep 6, 2018

andrewrk commented Sep 6, 2018

thejoshwolfe commented Sep 6, 2018

ghost commented Sep 6, 2018

thejoshwolfe commented Sep 6, 2018

ghost commented Sep 6, 2018

hryx commented Jul 7, 2019

andrewrk commented Sep 24, 2019 • edited Loading

metaleap commented Feb 3, 2020

andrewrk commented Feb 3, 2020

anosovic commented Apr 18, 2020 • edited Loading

marcthe12 commented May 29, 2020

ghost commented Jul 30, 2020

Jarred-Sumner commented Apr 25, 2021 • edited Loading

matu3ba commented Oct 14, 2021 • edited Loading

dumblob commented Oct 14, 2021

matu3ba commented Nov 3, 2022

dvmason commented May 21, 2023

andrewrk commented May 11, 2018 •

edited

Loading

andrewrk commented Sep 24, 2019 •

edited

Loading

anosovic commented Apr 18, 2020 •

edited

Loading

Jarred-Sumner commented Apr 25, 2021 •

edited

Loading

matu3ba commented Oct 14, 2021 •

edited

Loading