Automatically apply `from` conversion when runtime argument check fails #7009

JaroslavTulach · 2023-06-10T06:18:38Z

Pull Request Description

Enhancing ReadArgumentNode to apply from conversions when the argument type check fails. For example there is no need to type 42.to_text when requiring a Text argument. The conversion will happen automatically.

Details

#6790 has introduced ability to opt-in and check type of arguments where needed (from a standard library perspective). These checks are slowly appearing in the Enso code base. They help us catch typing errors as soon as possible and report errors to users in a meaningful way. User can then change their code and apply necessary conversions themselves. However there is no need for such manual conversions!

We can automatically apply from conversion when the argument type doesn't fit the ascribed type and such a conversion is available in the module scope. We can do such conversions effectively - almost at no performance cost as benchmarks added by this PR demonstrate.

Non-goal

Changing the way conversions are imported into scope of a module isn't a goal for this PR. The way conversions are found remains unchanged by this PR. If a from conversion was found "manually" before, it is going to be found "automatically" now. Designing new ways for conversions to be located is out of scope of this PR. However as #7271 demonstrates, the current behavior may actually be good enough.

Future Work

These automatic conversions work great with runtime type classes. Implementing such runtime type classes with automatic conversions from any value is going to be possible when this PR is integrated.

It is (almost certainly) possible to add support for & operator without any performance penalty - e.g. one could write (a: Ord & Eq) and the type system would verify that the argument a supports all the required conversions - to Ord as well as to Eq. Implementation of this is out of scope of this PR.

Checklist

Please ensure that the following checklist has been satisfied before submitting the PR:

The documentation has been updated, if necessary.
All code follows the
Scala,
Java,
All code has been tested:
- Unit tests have been written where possible.
- Ensure no regression in ListBenchmarks.mapOverList benchmark

GregoryTravis · 2023-06-12T15:43:44Z

I am concerned that this might result in hard-to-find bugs. If you write a bad method call that should fail, and it succeeds because of a conversion you weren't aware about, you'll get unexpected behavior that will be tricky to find, because it is made possible by a call to from that you did not explicitly make.

On the other hand, I am not familiar with the from conversions that exist in the code, so I don't know how much of a problem this will actually be in practice.

xvcgreg · 2023-06-26T13:38:15Z

@JaroslavTulach what is the status of this issue?

JaroslavTulach · 2023-06-28T05:40:33Z

@JaroslavTulach what is the status of this issue?

It is a draft.

…ntConversion

JaroslavTulach · 2023-07-05T06:56:08Z

Running benchmarks to get official results for the new ListBenchmarks. The results are as expected:

        <case>
            <label>org.enso.interpreter.bench.benchmarks.semantic.ListBenchmarks.mapAnyOverList</label>
            <scores>
                <score>147.31108088695652</score>
            </scores>
        </case>
        <case>
            <label>org.enso.interpreter.bench.benchmarks.semantic.ListBenchmarks.mapConvOverList</label>
            <scores>
                <score>84287.3115472</score>
            </scores>
        </case>
        <case>
            <label>org.enso.interpreter.bench.benchmarks.semantic.ListBenchmarks.mapIntegerOverList</label>
            <scores>
                <score>145.74264644395203</score>
            </scores>
        </case>
        <case>
            <label>org.enso.interpreter.bench.benchmarks.semantic.ListBenchmarks.mapMultiOverList</label>
            <scores>
                <score>144.37380159563025</score>
            </scores>
        </case>
        <case>
            <label>org.enso.interpreter.bench.benchmarks.semantic.ListBenchmarks.mapOverList</label>
            <scores>
                <score>141.27027240281691</score>
            </scores>
        </case>
        <case>
            <label>org.enso.interpreter.bench.benchmarks.semantic.ListBenchmarks.mapVOverList</label>
            <scores>
                <score>141.1073415324603</score>
            </scores>
        </case>

all relevant benchmarks need 140-150ms per operation, but mapConvOverList which needs more than a minute (84287ms).

JaroslavTulach · 2023-07-07T06:33:31Z

The case with conversion shall now be faster since ec5a3b3 - running benchmarks to verify. mapConvOverList is now as fast as regular runs without any conversions:

        <case>
            <label>org.enso.interpreter.bench.benchmarks.semantic.ListBenchmarks.mapConvOverList</label>
            <scores>
                <score>144.07712597041146</score>
            </scores>
        </case>

there seems to be a slowdown in plain

        <case>
            <label>org.enso.interpreter.bench.benchmarks.semantic.ListBenchmarks.mapOverList</label>
            <scores>
                <score>150.43897446282224</score>
            </scores>
        </case>

and it needs to be investigated.

…and uncached conversion test.

JaroslavTulach · 2023-07-09T09:18:11Z

Benchmarking most recent changes at benchmark run.... existing benchmarks seem to be fine...

wdanilo

This is absolutely AMAZING! It also means that we are closer and closer to the .to method, right? Also, would it be possible to notify GUI where such conversion was applied? We would then display that using a special connection style (e.g. dashed one).

@GregoryTravis your concerns are very valid. Every implicit thing has the downside of code being more complex to trace. That's why it's important to visually indicate all such places - both on the graph and in the text editor. Anyway, such auto-conversions will help make the graph clean and fast to work with and I believe they can become one of the most powerful features of Enso.

JaroslavTulach · 2023-07-10T05:34:48Z

@GregoryTravis your concerns are very valid.
Every implicit thing has the downside of code being more complex to trace.
That's why it's important to visually indicate all such places ... on the graph

+1

Also, would it be possible to notify GUI where such conversion was applied?

Yes, we are able to trace when a conversion happens using GraalVM Insight (a toy I love). Imagine following convert.enso program:

from Standard.Base import all

meaning (v : Text) = "Meaning of world " + v

main = meaning 42

Text.from (that : Number) = that.to_text

which will do the automatic conversion when passing number 42 to the meaning function as it requires Text. You can run it with this PR as:

enso$ ./built-distribution/enso-engine-0.0.0-dev-linux-amd64/enso-0.0.0-dev/bin/enso --run convert.enso 
'Meaning of world 42'

Let's now use the GraalVM's Insight capabilities by creating a tracing script to print out a message whenever a from function is invoked. Create trace.js:

insight.on('return', (ctx, frame) => {
    let that = frame.that;
    let to = ctx.returnValue(frame);
    print(`Converting via ${ctx.name} at ${ctx.source.name}:${ctx.line}`
       + ` from ${that}:${typeof that} to ${to}:${typeof to}`);
}, {
    rootNameFilter : '.*from',
    roots : true
});

and then execute with additional JAVA_OPTS parameters:

enso$ JAVA_OPTS=-Dpolyglot.insight=trace.js ./built-distribution/enso-engine-0.0.0-dev-linux-amd64/enso-0.0.0-dev/bin/enso --run convert.enso 
Converting via convert::Standard.Base.Data.Text.Text::from at convert.enso:7 from 42:number to 42:string
'Meaning of world 42'

and we can see that the from conversion has been observed. As GraalVM Insight offers similar capabilities as Enso instrumentation I am pretty sure we can trace automatic conversions at runtime...

We would then display that using a special connection style (e.g. dashed one).

...we just need to design a way to provide such information to the IDE and then display it. That will need some future work, CCing @4e6.

That's why it's important to visually indicate all such places - both on the graph and in the text editor.

I am sure we can do it in the graph - e.g. when the program is running - easily. Alas, I don't know how to do the same in the text editor - e.g. just by static analysis.

JaroslavTulach · 2023-07-10T05:52:17Z

It also means that we are closer and closer to the .to method, right?

This PR proves there is no overhead when applying the conversions - e.g. that the .to method can be implemented effectively. On the other hand, this PR also removes the need to use the .to conversion in certain situations. Rather than writing in the previous example:

meaning (42.to Text)

one can simply call meaning 42 and benefit from the automatic conversion of ascribed runtime arguments.

hubertp

nits

hubertp · 2023-07-11T08:12:00Z

...runtime/src/main/java/org/enso/interpreter/node/callable/argument/ReadArgumentCheckNode.java

+  }
+
+  static ReadArgumentCheckNode build(String argumentName, Type[] expectedTypes) {
+    if (expectedTypes == null || expectedTypes.length == 0) {


That's a bit unusual. IIRC for all nodes we assume that build will return the node, no exceptions?

hubertp · 2023-07-11T08:25:46Z

...runtime/src/main/java/org/enso/interpreter/node/callable/argument/ReadArgumentCheckNode.java

+  }
+
+  private static boolean isAllFitValue(Object v) {
+    return v instanceof DataflowError


speaking of DataflowError logic being distributed ;)

hubertp · 2023-07-11T08:27:46Z

...runtime/src/main/java/org/enso/interpreter/node/callable/argument/ReadArgumentCheckNode.java

+  }
+
+  ApplicationNode findConversionNode(Type from) {
+    var convAndType = findConversion(from);


from can be null I think, but I don't think we handle that nicely in findConversion, do we?

radeusgd · 2023-07-11T12:00:56Z

We would then display that using a special connection style (e.g. dashed one).

...we just need to design a way to provide such information to the IDE and then display it. That will need some future work

Just please note that in many cases we put parameters directly on a node, and this conversion may also happen directly there and there will be no connection to stylize.

i.e. if we have

y = my_operation 42

instead of

x = 42
y = my_operation x

there is no connection between 42 and my_operation to display in the first case as the parameter 42 is directly in the node.

Maybe we should consider to then add some indication for these cases as well? Possibly some dashed frame over the parameter, which when hovered says which conversion has been applied?

) Engine benchmark downloader tool [bench_download.py](https://github.com/enso-org/enso/blob/develop/tools/performance/engine-benchmarks/bench_download.py) can now plot multiple branches in the same charts. The tooltips on the charts, displayed when you hover over some data point, are now broken for some branches. I have no idea why, it might be a technical limitation of the Google charts library. Nevertheless, I have also extended the *selection info* section, that is displayed under every chart, where one can see all the important information, once you click on some data point. Options added: - `--branches` specifies list of branches for which all the benchmark data points will be in the plots. The default is `develop` only. - `--labels` that can limit the number of generated charts. This PR also **deprecates** the `--compare` option. There is no reason to keep that option around since we can now plot all the branches in the same charts. An example for plotting benchmarks for PR #7009 with ``` python bench_download.py -v --since 2023-07-01 --until 2023-07-11 --branches develop wip/jtulach/ArgumentConversion ``` is: ![image](https://github.com/enso-org/enso/assets/14013887/62010850-79d2-4c6c-92bc-9627bb4c6a0b) # Important Notes - Deprecate `--compare` option - Add `--labels` option - Add `--branches` option

JaroslavTulach added 2 commits June 10, 2023 08:12

Require automatic conversion of Integer to V

ce60c11

Automatically apply conversion when runtime argument check fails

cc3805e

JaroslavTulach requested a review from jdunkerley June 10, 2023 06:18

JaroslavTulach requested a review from 4e6 as a code owner June 10, 2023 06:18

JaroslavTulach self-assigned this Jun 10, 2023

JaroslavTulach requested review from hubertp and Akirathan as code owners June 10, 2023 06:18

JaroslavTulach marked this pull request as draft June 10, 2023 06:18

enso-bot bot mentioned this pull request Jun 12, 2023

@widgets for extension functions of Table may not be found #6955

Closed

JaroslavTulach added 4 commits July 5, 2023 07:29

Merge remote-tracking branch 'origin/develop' into wip/jtulach/Argume…

96c86de

…ntConversion

Adding benchmarks for various signature checks

edae8d2

Support for different zero. Checking direct conversion to type V

2069155

Benchmark showing that automatic argument conversions are currently slow

909da8f

JaroslavTulach added 2 commits July 7, 2023 07:09

Splitting ascribed argument read type check into its own node

f041478

Speeding up the case with conversion

ec5a3b3

enso-bot bot mentioned this pull request Jul 9, 2023

Dataflow error in a default of a type-ascribed argument results in a type error instead of being propagated #7137

Closed

JaroslavTulach added 3 commits July 9, 2023 08:39

findConversionNode can never be part of compilation

d044d6a

Rewriting on InvalidAssumptionException. Sharing code between cached …

b7eb3b5

…and uncached conversion test.

Fixing typo

b4fa6e8

Automatic from conversions note in changelog

35bbc9e

JaroslavTulach requested review from wdanilo and farmaazon July 9, 2023 09:42

JaroslavTulach marked this pull request as ready for review July 9, 2023 09:46

wdanilo approved these changes Jul 9, 2023

View reviewed changes

farmaazon approved these changes Jul 10, 2023

View reviewed changes

Akirathan approved these changes Jul 10, 2023

View reviewed changes

jdunkerley approved these changes Jul 10, 2023

View reviewed changes

JaroslavTulach merged commit 2aeef4d into develop Jul 11, 2023

JaroslavTulach deleted the wip/jtulach/ArgumentConversion branch July 11, 2023 06:23

hubertp reviewed Jul 11, 2023

View reviewed changes

Akirathan mentioned this pull request Jul 11, 2023

Engine benchmarks download tool is able to plot multiple branches #7257

Merged

5 tasks

JaroslavTulach mentioned this pull request Aug 29, 2023

Allow access to caches and instrumentation from Enso #7683

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatically apply `from` conversion when runtime argument check fails #7009

Automatically apply `from` conversion when runtime argument check fails #7009

JaroslavTulach commented Jun 10, 2023 •

edited

Loading

GregoryTravis commented Jun 12, 2023

xvcgreg commented Jun 26, 2023

JaroslavTulach commented Jun 28, 2023

JaroslavTulach commented Jul 5, 2023 •

edited

Loading

JaroslavTulach commented Jul 7, 2023 •

edited

Loading

JaroslavTulach commented Jul 9, 2023 •

edited

Loading

wdanilo left a comment •

edited

Loading

JaroslavTulach commented Jul 10, 2023

JaroslavTulach commented Jul 10, 2023

hubertp left a comment

hubertp Jul 11, 2023

hubertp Jul 11, 2023

hubertp Jul 11, 2023

radeusgd commented Jul 11, 2023

Automatically apply from conversion when runtime argument check fails #7009

Automatically apply from conversion when runtime argument check fails #7009

Conversation

JaroslavTulach commented Jun 10, 2023 • edited Loading

Pull Request Description

Details

Non-goal

Future Work

Checklist

GregoryTravis commented Jun 12, 2023

xvcgreg commented Jun 26, 2023

JaroslavTulach commented Jun 28, 2023

JaroslavTulach commented Jul 5, 2023 • edited Loading

JaroslavTulach commented Jul 7, 2023 • edited Loading

JaroslavTulach commented Jul 9, 2023 • edited Loading

wdanilo left a comment • edited Loading

Choose a reason for hiding this comment

JaroslavTulach commented Jul 10, 2023

JaroslavTulach commented Jul 10, 2023

hubertp left a comment

Choose a reason for hiding this comment

hubertp Jul 11, 2023

Choose a reason for hiding this comment

hubertp Jul 11, 2023

Choose a reason for hiding this comment

hubertp Jul 11, 2023

Choose a reason for hiding this comment

radeusgd commented Jul 11, 2023

Automatically apply `from` conversion when runtime argument check fails #7009

Automatically apply `from` conversion when runtime argument check fails #7009

JaroslavTulach commented Jun 10, 2023 •

edited

Loading

JaroslavTulach commented Jul 5, 2023 •

edited

Loading

JaroslavTulach commented Jul 7, 2023 •

edited

Loading

JaroslavTulach commented Jul 9, 2023 •

edited

Loading

wdanilo left a comment •

edited

Loading