Tail Recursion Optimization #218

CAG2Mark · 2024-03-29T08:19:28Z

Adds a pass to the IR which optimizes mutually tail-recursive and mutually tail-recursive modulo cons functions.

The full pass first optimizes modulo cons functions to be tail recursive, then optimizes those functions to be tail recursive.

This PR consists of the following features

IR Field Assignment

Adds an assignment primitive to the IR. Given x: CtorApp and val: TrvialExpr, we now have the primitive

x.field = val in ...

Also modified the interpreter to no longer assume references to constructors are referentially transparent.

Tail Recursion Modulo Cons

Extends the idea in this paper to optimize strongly connected components of tail recursive functions, of which some calls may be within a constructor (guarded recursion).

Supports both modulo-cons calls and regular tail calls within the same strongly connected component.

Rewrite mutually tail recursive functions

Suppose we have functions:

def f(n) = e1
def g(m) = e2

where e1 and e2 tail-call f or g. This function will be rewritten a single function:

def f_g(branch, n, m) =
  let join j(branch, m, n) = 
    if branch == 0 then e1'
    else e2'
  j(branch, m, n)

def f(n) = f_g(0, n, dummy)
def g(n) = f_g(1, dummy, m)

where e1' and e2' are e1 and e2 with their tail calls to f and g replaced with jumps to j.

In general, strongly connected components of mutually tail recursive functions will be optimized in this way. All join points used in each strongly connected component will be rewritten and merged into the same join point.

CAG2Mark · 2024-03-29T08:32:23Z

TODO:

Make sure variable names in the stack frame do not clash
Make sure the merged function's name does not clash with other functions
Substitute dummy variables (may need extension to the IR). See here
Propagage @tailrec annotations to the IR.

…e variable name clash

compiler/shared/main/scala/mlscript/compiler/optimizer/TailRecOpt.scala

compiler/shared/main/scala/mlscript/compiler/ir/Builder.scala

LPTK · 2024-05-13T02:25:13Z

compiler/shared/main/scala/mlscript/compiler/optimizer/TailRecOpt.scala

@@ -66,37 +77,57 @@ class TailRecOpt(fnUid: FreshInt, classUid: FreshInt, tag: FreshInt):
      if names.contains(nme) then Some(nme)
      else None

-  // would prefer to have this inside discoverOptimizableCalls, but this makes scala think it's not tail recursive
+  // would prefer to have this inside discoverOptCalls, but scala does not support partially tail recursive functions directly


I think you could just add private val discoverOptCallsIndirect = discoverOptCalls and call that instead.

LPTK · 2024-05-13T02:37:54Z

Instead of writing class True and class False in every single test case, why not make it a built-in class info? It should be a trivial change and would remove much clutter. Could also do the same with Cons/Nil and Some/None.

shared/src/test/diff/nu/NuScratch.mls

LPTK · 2024-05-13T07:06:12Z

compiler/shared/main/scala/mlscript/compiler/optimizer/TailRecOpt.scala

+      // Tail calls to another function in the component will be replaced with a tail call
+      // to the merged function
+      def transformDefn(defn: Defn): Defn =
+        // TODO: Figure out how to substitute variables with dummy variables.


This should be addressed or explained better.

LPTK · 2024-05-13T07:10:47Z

Steps before this can be an official PR:

Address the remaining TODOs or phrase them better so other people can address them later
Make a proper error reporting system that points to the problem
Make sure to document the nontrivial aspects/invariants of the approach
Make sure no reordering of computations can occur; you may use a purity check to determine if something can be reordered, whose current implementation can be very primitive (it will be improved later)
Document the semantics of @tailrec when ascribing calls and definitions
We should in fact have both:
- @tailcall to annotate calls, with the current semantics assigned to @tailrec
- ~~@tailrec-annotated calls that ensure the recursion back to the current function is tail-recursive~~ from the meeting discussion: just drop these for now and only accept @tailrec on definitions

compiler/shared/test/diff-ir/IRRec.mls

compiler/shared/main/scala/mlscript/compiler/ir/IR.scala

CAG2Mark · 2024-06-01T15:05:54Z

Completed:

Improved error reporting and propagation of locations.
- Added compilation errors to the diff tests. Note that this changes the output of some old tests.
- Also, postProcess now takes a raise: Diagnostic -> Unit parameter, so extensions using postProcess can now raise errors in the same way as tests in the main project.

TODO:

Purity check
Remaining TODOs and unresolved comments
Document semantics and non-trivial aspects

After this it should be good to merge.

compiler/shared/main/scala/mlscript/compiler/optimizer/TailRecOpt.scala

CAG2Mark · 2024-06-05T13:34:06Z

Instead of writing class True and class False in every single test case, why not make it a built-in class info? It should be a trivial change and would remove much clutter. Could also do the same with Cons/Nil and Some/None.

resolved

CAG2Mark added 7 commits March 13, 2024 17:50

move changes from tailrec-opt to new branch

8b73a24

Optimize strongly connected components

93bf78f

update map braces

9b65640

small refactor

915891d

refactor

6f2ca3d

Update test infrastructure, fix code, add basic test for tailrec.

f987e02

Update test

184a5de

CAG2Mark self-assigned this Mar 29, 2024

CAG2Mark added the enhancement New feature or request label Mar 29, 2024

remove todos

3066c7f

CAG2Mark added 15 commits April 8, 2024 17:10

Add field assignment to IR

d7b4bb3

Prevent unnecessary inlining for mutually tail recursive funcs, handl…

058a929

…e variable name clash

Update test

0e3a875

Improved tests

dd50c7a

Fix bugs and cases

fc282e7

Interpret field assignment

edf7c89

add class info to field assignment

c08f31b

propagate tailrec, fix tailrec parsing issue

518ffb4

progress, fix bug

d00d072

Detect mod cons tail calls

e45412b

Refactor tail call discovery

fc473a4

change test

22dd518

add test, verify mod cons call discovery works

c1d90d9

add tests, improve formatting

16134c2

remove newlines

e9d6c54

LPTK reviewed Apr 29, 2024

View reviewed changes

compiler/shared/main/scala/mlscript/compiler/optimizer/TailRecOpt.scala Outdated Show resolved Hide resolved

CAG2Mark added 3 commits April 30, 2024 18:05

add tostring, improve formatting

dcd616b

remove println

8b91f91

actually handle single tail recursive

8d6e14f

CAG2Mark added 2 commits May 12, 2024 17:32

fix some tests

d8d05be

Propagate tailrec, fix join point infinite recursion bug

5887014

LPTK reviewed May 12, 2024

View reviewed changes

compiler/shared/main/scala/mlscript/compiler/ir/Builder.scala Outdated Show resolved Hide resolved

CAG2Mark added 2 commits May 12, 2024 22:54

refactor and check @tailrec for function definitions

78da910

update

42b264f

LPTK reviewed May 13, 2024

View reviewed changes

shared/src/test/diff/nu/NuScratch.mls Outdated Show resolved Hide resolved

LPTK reviewed May 13, 2024

View reviewed changes

compiler/shared/test/diff-ir/IRRec.mls Show resolved Hide resolved

LPTK requested a review from waterlens June 1, 2024 04:03

LPTK reviewed Jun 1, 2024

View reviewed changes

compiler/shared/main/scala/mlscript/compiler/ir/IR.scala Outdated Show resolved Hide resolved

CAG2Mark added 7 commits June 1, 2024 16:20

make AssiggnField an Expr instead of a Node

2199e37

Fix unsafe partial destruction

eacf538

rename @tailrec to @tailcall for call-level annotations

214f9d3

Propagate positions of calls and @tailrec annotations

51af35f

Add error reporting to IR diff tests, report tailrec IR errors, fix bug

8ce1b35

fix tests

ee15969

fix tests

5af3621

Fix grammar

487279e

LPTK reviewed Jun 2, 2024

View reviewed changes

compiler/shared/main/scala/mlscript/compiler/optimizer/TailRecOpt.scala Outdated Show resolved Hide resolved

CAG2Mark added 2 commits June 5, 2024 21:22

Purity check, fix join point bug

a4efc7a

Build in true/false class in IR

0c8dec1

CAG2Mark added 4 commits June 5, 2024 21:36

restore nuscratch changes

1ea98f1

Ensure no function name clashes

abcadfe

Document, add undefined literal and address remaining todos

ea8d442

use unitlit instead of a new literal type for undefined

0125b1e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tail Recursion Optimization #218

Tail Recursion Optimization #218

CAG2Mark commented Mar 29, 2024 •

edited

CAG2Mark commented Mar 29, 2024 •

edited

LPTK May 13, 2024

LPTK commented May 13, 2024

LPTK May 13, 2024

LPTK commented May 13, 2024 •

edited

CAG2Mark commented Jun 1, 2024 •

edited

CAG2Mark commented Jun 5, 2024

Tail Recursion Optimization #218

Are you sure you want to change the base?

Tail Recursion Optimization #218

Conversation

CAG2Mark commented Mar 29, 2024 • edited

IR Field Assignment

Tail Recursion Modulo Cons

Rewrite mutually tail recursive functions

CAG2Mark commented Mar 29, 2024 • edited

LPTK May 13, 2024

Choose a reason for hiding this comment

LPTK commented May 13, 2024

LPTK May 13, 2024

Choose a reason for hiding this comment

LPTK commented May 13, 2024 • edited

CAG2Mark commented Jun 1, 2024 • edited

CAG2Mark commented Jun 5, 2024

CAG2Mark commented Mar 29, 2024 •

edited

CAG2Mark commented Mar 29, 2024 •

edited

LPTK commented May 13, 2024 •

edited

CAG2Mark commented Jun 1, 2024 •

edited