Generic tail recursion #3020

jschobben · 2019-07-29T23:12:30Z

This re-implements tail recursion of functions; now any self-call in tail position gets to enjoy tail recursion, instead of functions that follow a very specific structure. In particular, let/assert/echo and nested ternary operators can be used freely now.

Unlike at parse time as before, now the decision to use tail recursion is made at run-time, and for each function call instead of each function declaration.

Tried to separate it in a few commits. Here's a summary of changes:

Remove FunctionTailRecursion and its factory method
For a few Expression subclasses, factor out method evaluateStep from evaluate. It returns the following Expression, instead of evaluating it.
Use a nested while-loop for function evaluation; the inner loop iteratively follows a single execution path where possible, and falls back to a regular recursive call otherwise (for binary operators etc)
Add a few tests

A word on performance; I benchmarked using this silly example:
function speed(n=1000000) = n == 0 ? 42 : speed(n-1);
Initial results weren't too good; it was almost twice as slow as master. So, had to hunt down a few optimizations:

custom function to prepare new tail call context, caching some stuff: FunctionCall::prepareTailCallContext
Use typeid check + static_cast, instead of dynamic cast
Reduce Context creation overhead (not in this PR though; see PR Use shared_ptr for Context::document_path #3019 )

Now the speed is more or less the same as on master.

This allows to evaluate just the first step of the expression, and return the remainder.

t-paul

This is great. Thanks for splitting up the patch, that helped a lot. I did some additional testing and that looks good too. @kintel can you have an additional look at the context handing. I don't see any issues but I'm still not 100% sure I know about all the details 😄.

kintel · 2019-09-06T20:26:43Z

LGTM!
It's a bit surprising that the typeid check + static_cast is actually faster than dynamic_pointer_cast, considering this even messes with the reference count.
(at some point we should consider adding performance regressions as I fear code like this can be attempted refactored to look cleaner in the future)

nophead · 2019-09-06T20:41:55Z

Is it actually because the original code creates a shared pointer in every if test but the new code only creates the pointer in the one case that matches?

…

On Fri, 6 Sep 2019 at 21:26, Marius Kintel ***@***.***> wrote: Merged #3020 <#3020> into master. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#3020?email_source=notifications&email_token=AAEKHBN47BXXE42O4XPL5GDQIK4JFA5CNFSM4IHW7FLKYY3PNVWWK3TUL52HS4DFWZEXG43VMVCXMZLOORHG65DJMZUWGYLUNFXW5KTDN5WW2ZLOORPWSZGOTPSZW3Q#event-2615516014>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAEKHBPQLUSWGGHV3LYPDBDQIK4JFANCNFSM4IHW7FLA> .

kintel · 2019-09-06T20:45:40Z

Yeah, I guess dynamic_pointer_cast would need to mess with the ref count as well : /

jschobben · 2019-09-07T00:49:03Z

Thanks for merging this!

The typeid+static cast being faster, is a combination of two things: the (cheap) typeid check prevents doing more than one cast per loop, and dynamic_cast is slightly slower than static_cast. I'd say it's mostly the typeid thing that matters here.
Downside is that typeid is not polymorphic, it only detects an exact match of the type.

Was thinking too about how to add a performance test, since results depend on the HW used. Probably it will just be a matter of "compare PR performance against master" or something.

t-paul · 2019-09-07T01:01:52Z

I recently watched a talk discussing that topic... https://www.youtube.com/watch?v=nOwUzFYt0NQ&feature=youtu.be&t=188 (in context of adding pattern matching to C++)

Jesse Schobben added 11 commits July 29, 2019 22:40

Remove UserFunction::create and FunctionTailRecursion

d587ab1

Add evaluateStep method for TernaryOp, Assert, Echo and Let

c94380c

This allows to evaluate just the first step of the expression, and return the remainder.

Prepare UserFunction::evaluate for tail-recursion support

84052a7

Restore original level of tail-recursion support

1d6747a

More efficient way to set next tail call parameters

62bb01b

Support tail-recursion after assert/echo/let

f80b089

Update failed recursion tests with new expected stacktrace

d117c9e

Replace dynamic_pointer_cast with typeid check + static_pointer_cast

a352855

Support nested context for let() in function

ef46cf2

Properly handle config variables ($...)

0c240cc

Add tests for generic tail recursion and function scope

e18863f

jschobben mentioned this pull request Jul 29, 2019

Allow let() with tail recursion #2053

Closed

t-paul approved these changes Aug 25, 2019

View reviewed changes

t-paul requested a review from kintel August 25, 2019 18:44

kintel approved these changes Sep 6, 2019

View reviewed changes

kintel merged commit 48c794f into openscad:master Sep 6, 2019

thehans mentioned this pull request Dec 16, 2019

Tail recursion limit not working #3118

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generic tail recursion #3020

Generic tail recursion #3020

jschobben commented Jul 29, 2019

t-paul left a comment

kintel commented Sep 6, 2019

nophead commented Sep 6, 2019 via email

kintel commented Sep 6, 2019

jschobben commented Sep 7, 2019

t-paul commented Sep 7, 2019

Generic tail recursion #3020

Generic tail recursion #3020

Conversation

jschobben commented Jul 29, 2019

t-paul left a comment

Choose a reason for hiding this comment

kintel commented Sep 6, 2019

nophead commented Sep 6, 2019 via email

kintel commented Sep 6, 2019

jschobben commented Sep 7, 2019

t-paul commented Sep 7, 2019