Support multiple assignments #978

Smit-create · 2022-08-16T12:01:13Z

Fixes #928

certik · 2022-08-17T20:20:41Z

Thanks. Can you add a test for this as well please?

certik · 2022-08-19T08:24:05Z

Let me try to understand. This:

    a = b = c = 10

creates this ASR:

               (= 
                  (Var 2 c) (IntegerConstant 10 
                  (Integer 4 [])) 
                  (= 
                     (Var 2 b) (IntegerConstant 10 
                     (Integer 4 [])) 
                     (= 
                        (Var 2 a) (IntegerConstant 10 
                        (Integer 4 [])) ())))] ()

So this is doing c=10 and overloading it recursively, so we have b=10 and it is overloaded with a=10. I think that will not work with the way overloaded works: overloaded allows to specify a different (user defined) operation, typically a subroutine call. We should have checked this in verify, but our verify is not very strong yet.

This in turn is the reason for this change:

    void visit_Assignment(const ASR::Assignment_t &x) {
        if( x.m_overloaded ) {
            this->visit_stmt(*x.m_overloaded);
-           return ;
        }

But it will break LFortran, as in there we must do the overloaded operation instead of the original one. So I think the current approach will not work.

So how should this be implemented?

Well, what is a = b = c = 10?

Isn't that exactly equivalent to:

a = 10
b = 10
c = 10

? Or is there some semantic difference?

If it is equivalent, then the AST->ASR should encounter the AST node(s) for a = b = c = 10, and it should generate three ASR statement nodes (append them to the body) of the type a = 10, no overload.

There is a slight issue of appending more than one statement, but I think we have some mechanism for it, and if not, we need to create it.

Smit-create · 2022-08-19T08:30:37Z

Maybe we can follow the same logic but let's leave the overload for now, and implement a new argument in the Assignment node something like multiple_assignment

 stmt
     = Allocate(alloc_arg* args, expr? stat, expr? errmsg, expr? source)
     | Assign(int label, identifier variable)
-    | Assignment(expr target, expr value, stmt? overloaded)
+    | Assignment(expr target, expr value, stmt? overloaded, stmt? multiple_assignment)
     | Associate(expr target, expr value)
     | Cycle()

or,
One approach might be to insert all the individual assignment nodes and then visit them.

certik · 2022-08-19T09:36:24Z

It seems quite complicated, because every backend then has to support it etc. We are trying to keep the ASR as minimal as possible, but without losing any semantic information. It's not black and white, but this feature seems quite rarely used, so using the workaround with multiple Assignments seems like a better way, than modifying ASR. We should only extend ASR when we need to represent the semantic operation, or it simplifies the backends. In this case, it seems we do not necessarily need to preserve this operation, and it makes backends more complicated. So it seems it is not worth doing that way.

czgdp1807 · 2022-08-19T09:51:10Z

I think converting a = b = c = 10 into,

c = 10
b = c
a = b

should be doable. We should do this because it keeps the ASR simple. And semantics of a = b = c = 10 and the above three assignments are no different for LPython (as we do deepcopy by default). If we support shallow copy in future then also, c = 10 will work as it is and b = c and a = b will eventually make a, b point to c (which is the expected behaviour for shallow copy I think).

Appending multiple statements to body can be implemented in multiple ways. You can track the current body by making it a state of the visitor then modify it directly when you create multiple assignment statements. Not so safe approach. The other way is to make Vec<ASR::stmt_t*> stmts and then use it at places where body.push_back is called. Anyways should be doable.

Smit-create · 2022-08-20T06:04:35Z

This is ready for review @czgdp1807 @certik

certik · 2022-08-20T06:39:27Z

src/lpython/semantics/python_ast_to_asr.cpp

                                    overloaded);
+            tmp_vec.push_back(tmp);


Rather than assigning to both tmp and tmp_vec, why not use the convention that if it is just one, it will be in tmp, and if tmp == nullptr, then it will be in tmp_vec?

Here is how I am thinking about it:

tmp is not null: we have exactly one statement, the most common case

tmp is null: the tmp_vec is used to communicate what is returned: tmp_vec.size() == 0 (nothing), size() == 2 or more (two or more statements).

We could in principle only use tmp_vec even for 1 statement (and remove tmp), but I think the tmp convention is used for expressions as well, and it seems like the above idea is better.

Yes. I agree.

In this case we should update other code that currently returns tmp=null to mean "none" to ensure it sets tmp_vec.size() == 0.

Yeah. Otherwise incorrect set of statements mind end up getting into ASR. Well we should always clear the tmp_vec after pushing all the elements from it to the body. That way we will know that its always empty once used.

certik

Looks good to me. @czgdp1807 let me know if this looks good to you. See also my question above regarding the convention of tmp and tmp_vec.

czgdp1807 · 2022-08-20T11:10:35Z

integration_tests/expr_09.py

@@ -8,4 +8,27 @@ def main0():
    print(-i1 ^ -i2)
    assert -i1 ^ -i2 == 6

+
+def test_multiple_assign_1():
+    a: i32; b:i32; c:i32


Let's try with variables of different types as well. Something like,

d: f64; e: f32; f: c64; g: i32 c = d = e = g + 1.0

Also try with lists, tuples (of the same element types) variables everywhere in the multiple assignment statement.

src/lpython/semantics/python_ast_to_asr.cpp

certik · 2022-08-20T17:26:31Z

src/lpython/semantics/python_ast_to_asr.cpp

+                        tmp = make_DictInsert_t(al, x.base.base.loc, se, key, tmp_value);
+                        tmp_vec.push_back(tmp);


Something like this?

Suggested change

tmp = make_DictInsert_t(al, x.base.base.loc, se, key, tmp_value);

tmp_vec.push_back(tmp);

tmp = nullptr;

tmp_vec.push_back(make_DictInsert_t(al, x.base.base.loc, se, key, tmp_value));

Wait, or is this just one? In that case, just put it into tmp, and leave tmp_vec be.

No, we can have many of them. We have a continue statement below this.

src/lpython/semantics/python_ast_to_asr.cpp

certik

I think this is good to be merged, after adding the documentation comment (see my comment above) and polishing the git history.

certik · 2022-08-21T13:24:07Z

Awesome, thanks for implementing this @Smit-create !

Smit-create force-pushed the i-928 branch from 7507d3e to 90ec923 Compare August 19, 2022 07:32

Smit-create requested review from certik and czgdp1807 August 19, 2022 07:33

Smit-create force-pushed the i-928 branch from 90ec923 to 2609811 Compare August 19, 2022 10:19

certik reviewed Aug 20, 2022

View reviewed changes

certik approved these changes Aug 20, 2022

View reviewed changes

czgdp1807 requested changes Aug 20, 2022

View reviewed changes

Smit-create requested a review from czgdp1807 August 20, 2022 15:02

Smit-create force-pushed the i-928 branch from 3c1feb3 to 7fe5318 Compare August 20, 2022 15:21

czgdp1807 approved these changes Aug 20, 2022

View reviewed changes

src/lpython/semantics/python_ast_to_asr.cpp Outdated Show resolved Hide resolved

certik reviewed Aug 20, 2022

View reviewed changes

Smit-create force-pushed the i-928 branch from 7fe5318 to 696cf54 Compare August 21, 2022 05:36

certik reviewed Aug 21, 2022

View reviewed changes

src/lpython/semantics/python_ast_to_asr.cpp Show resolved Hide resolved

certik approved these changes Aug 21, 2022

View reviewed changes

Smit-create added 3 commits August 21, 2022 15:01

Support multiple assignments

83ba6b4

Add and update tests

753a6f8

Fix build warnings

834c3a2

Smit-create force-pushed the i-928 branch from 696cf54 to 834c3a2 Compare August 21, 2022 09:32

Smit-create enabled auto-merge August 21, 2022 09:40

Smit-create merged commit bac0986 into lcompilers:main Aug 21, 2022

Smit-create deleted the i-928 branch August 21, 2022 13:31

		tmp = make_DictInsert_t(al, x.base.base.loc, se, key, tmp_value);
		tmp_vec.push_back(tmp);

Support multiple assignments #978

Support multiple assignments #978

Uh oh!

Conversation

Smit-create commented Aug 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

certik commented Aug 17, 2022

Uh oh!

certik commented Aug 19, 2022

Uh oh!

Smit-create commented Aug 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

certik commented Aug 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

czgdp1807 commented Aug 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Smit-create commented Aug 20, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

certik left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

certik left a comment

Choose a reason for hiding this comment

Uh oh!

certik commented Aug 21, 2022

Uh oh!

Uh oh!

Smit-create commented Aug 16, 2022 •

edited

Loading

Smit-create commented Aug 19, 2022 •

edited

Loading

certik commented Aug 19, 2022 •

edited

Loading

czgdp1807 commented Aug 19, 2022 •

edited

Loading