Rewrite gas mechanisms in terms of explicit gas_charge nodes in the AST #887

vaivaswatha · 2020-09-16T06:22:49Z

The AST is transformed to include explicit gas_charge nodes. The evaluator just evaluates each gas_charge node and does not do any separate gas charging.

This enables the compiler to just generate code for gas_charge expressions and achieve uniform gas charging in both the implementations.

There are minor gas charge differences arising out of this PR:

From commit 6e3d4a1
1. Places where we used String.length is now replaced with literal_cost of the string, which is computed differently.
2. Places where length of "pp_literal l" was used is now replaced with literal_cost. This causes minor differences too.
3. For the Builtin_alt_bn128_G1_mul, computation of Logarithm of argument now first adds one and then proceeds.
From commit 45b53be
1. Due to computing the cost of send as the size of it's "Scilla List of Scilla Messages" (now) vs "OCaml List of Scilla Message" earlier.

In the process of rewriting the gas charging mechanism to ease uniform charging from the interpreter and in the upcoming compiler / VM, the gas charging mechanism is moving towards having explicit gas-charge nodes in the AST. The interpreter will charge gas, and compiler will generate code to charge gas. This commit is the first step, where we define the new AST nodes. A later commit will introduce a pass to insert these nodes and rewrite Eval to charge based on them.

There are minor gas charge differences arising out of this change. 1. Places where we used String.length is now replaced with literal_cost of the string, which is computed differently. 2. Places where length of "pp_literal l" was used is now replaced with literal_cost. This causes minor differences too. 3. For the Builtin_alt_bn128_G1_mul, computation of Logarithm of argument now first adds one and then proceeds.

The gas charge differences in certain contracts is, verified to be, due to computing the cost of `send` as the size of it's "Scilla List of Scilla Messages" (now) vs "OCaml List of Scilla Message" earlier.

jjcnn

I have added a couple of suggestions and commments.

Other than that I don't think it's ideal to have gas charge as a separate statement type, so that every second statement is a gas charge statement. I think the charging of gas should be tied more closely to the statement that incurs the gas charge.

My worry is that either compiler optimisations will be impossible (because gas charge statements prevent us from combining statements, e.g., through deforestation), or that gas charge statements are optimised out by accident (perhaps by the LLVM JIT compilation) because they don't affect the final result.

I don't really know how else to do it, though, but doing it this way makes my spine tingle in an upleasant way...

src/base/Gas.ml

src/eval/Eval.ml

vaivaswatha · 2020-10-15T04:26:58Z

Thank you for the review Jacob.

I have added a couple of suggestions and commments.

I've addressed the individual comments, either by making the changes or responding to your comment.

Regarding your take on the overall change:

Other than that I don't think it's ideal to have gas charge as a separate statement type, so that every second statement is a gas
charge statement. I think the charging of gas should be tied more closely to the statement that incurs the gas charge.

This was conceptually what was happening earlier too. The evaluator would generate a gas charge descriptor after each statement, and then later that was evaluated to charge gas. We are just doing that statically now.

My worry is that either compiler optimisations will be impossible (because gas charge statements prevent us from combining statements, e.g., through deforestation),

I agree that, to an extent, compiler optimizations may be impacted, but not made impossible. You can still, for example, combine statements. You'll have to be more smart about it. For example, do both computations first, and then both gas charges. If the gas charge is before a computation, do both charges before, and then both the computations (this is safe because, it would run out of gas anyway if it was supposed to, but now, without actually doing that computation).

or that gas charge statements are optimised out by accident (perhaps by the LLVM JIT compilation) because they don't affect the final result.

No, this won't happen. Gas charge statements are (currently) compiled to be statements with side effect (check and subtract from a global "gas_remaining" variable). Even if compiled in other ways (say a more "pure" way), the final remaining gas will be a part of the final result, and hence not optimized away.

Thanks again.

jjcnn · 2020-10-15T12:36:11Z

[snip]

My worry is that either compiler optimisations will be impossible (because gas charge statements prevent us from combining statements, e.g., through deforestation),

I agree that, to an extent, compiler optimizations may be impacted, but not made impossible. You can still, for example, combine statements. You'll have to be more smart about it. For example, do both computations first, and then both gas charges. If the gas charge is before a computation, do both charges before, and then both the computations (this is safe because, it would run out of gas anyway if it was supposed to, but now, without actually doing that computation).

Except that with this change you don't know (for statements) whether a gas charge is connected to the previous or the next statement. We knew that before, so we could just optimise away without worrying about gas, and the gas charge would follow the optimised code.

Also, it's not just about whether we run out of gas or not. It's also about the amount of work the miner needs to do before we run out of gas. Say that your program is as follows (s for statement, g for gas charges):

s1;
g1;
s2;
g2; (* we run out of gas here *)
s3;
g3;

We now optimise into this:

s1;
s2;
s3;
g1;
g2; (* We run out of gas here *)
g3;

Now the miner has to execute s1, s2 and s3 before running out of gas, whereas before it ran out of gas after executing only s1 and s2. This is unfair to the miner.

You may then say that this is not a legal optimisation because work that is "scheduled" to happen after an out-of-gas error must not be moved to before the out-of-gas-error. But then I would counter that no optimisation involving multiple statements would be legal. because every gas charge statement may cause an out-of-gas error, so we can't move statements around, i.e., gas charge statements prevent optimisations at statement level.

vaivaswatha · 2020-10-15T12:45:52Z

[snip]

My worry is that either compiler optimisations will be impossible (because gas charge statements prevent us from combining statements, e.g., through deforestation),

I agree that, to an extent, compiler optimizations may be impacted, but not made impossible. You can still, for example, combine statements. You'll have to be more smart about it. For example, do both computations first, and then both gas charges. If the gas charge is before a computation, do both charges before, and then both the computations (this is safe because, it would run out of gas anyway if it was supposed to, but now, without actually doing that computation).

Except that with this change you don't know (for statements) whether a gas charge is connected to the previous or the next statement. We knew that before, so we could just optimise away without worrying about gas, and the gas charge would follow the optimised code.

Also, it's not just about whether we run out of gas or not. It's also about the amount of work the miner needs to do before we run out of gas. Say that your program is as follows (s for statement, g for gas charges):
s1;
g1;
s2;
g2; (* we run out of gas here *)
s3;
g3;
We now optimise into this:
s1;
s2;
s3;
g1;
g2; (* We run out of gas here *)
g3;
Now the miner has to execute s1, s2 and s3 before running out of gas, whereas before it ran out of gas after executing only s1 and s2. This is unfair to the miner.

You may then say that this is not a legal optimisation because work that is "scheduled" to happen after an out-of-gas error must not be moved to before the out-of-gas-error. But then I would counter that no optimisation involving multiple statements would be legal. because every gas charge statement may cause an out-of-gas error, so we can't move statements around, i.e., gas charge statements prevent optimisations at statement level.

My point was that, you don't optimize it the way you said, but as below, assuming it isn't a "fetch value" kind of statement (because in that case, the gas cost is known only after the statement executes).

g1;
g2; (* We run out of gas here *)
g3;
s1;
s2;
s3;

In this case, it is unfair to neither the miner or the person executing the transition.

But in general, yes, optimizations need to be aware of gas, but I see no other way to ensure compatibility b/w gas charging in the interpreter and the VM.

jjcnn · 2020-10-15T12:55:33Z

[snip]
[snip]
My point was that, you don't optimize it the way you said, but as below, assuming it isn't a "fetch value" kind of statement (because in that case, the gas cost is known only after the statement executes).
g1;
g2; (* We run out of gas here *)
g3;
s1;
s2;
s3;
In this case, it is unfair to neither the miner or the person executing the transition.

But that optimisation is also illegal, because g1 and g2 depend on values computed by s1 and s2. So we can't move gi so that it happens before si, and we can't move si so that it happens before g(i-1). Therefore, we can't swap gas charges with non-gas statements, and therefore we won't be able to combine statements. The only exception is when the sequence is g1; s1; s2; g2, but then s2 is a Load statement (and s1 is not), which as far as I can tell does not allow any optimisation to happen.

But in general, yes, optimizations need to be aware of gas, but I see no other way to ensure compatibility b/w gas charging in the interpreter and the VM.

It needs to be very aware of gas, to the point where no optimisations seem to be possible.

jjcnn

There is now only a discussion of the principle of gas charge nodes in relation to optimisations left as an open question, so we can merge this, and postpone the discussion until we actually want to do some optimisations.

vaivaswatha added 9 commits September 8, 2020 16:02

Dynamic gas cost to be a polynomial in terms of variables' sizes

0ba6a4c

Separate SizeOf a variable and ValueOf of a variable

fe48f93

Use a mini-AST instead of polynomials

cf4aa24

Restore polynomials

d6e5f6a

Evaluate gas_charge ast in Eval

060f355

make fmt

19ae8f7

Rewrite Eval in terms of charging gas based on gas_charge AST nodes

45b53be

The gas charge differences in certain contracts is, verified to be, due to computing the cost of `send` as the size of it's "Scilla List of Scilla Messages" (now) vs "OCaml List of Scilla Message" earlier.

vaivaswatha marked this pull request as ready for review September 24, 2020 13:22

vaivaswatha requested review from anton-trunov and jjcnn as code owners September 24, 2020 13:22

vaivaswatha added 2 commits September 24, 2020 18:54

make fmt

d62e9c2

Minor fix in comment

42adf3b

vaivaswatha mentioned this pull request Oct 6, 2020

Runtime gas accounting Zilliqa/scilla-compiler#39

Closed

vaivaswatha and others added 5 commits October 6, 2020 17:25

Merge branch 'master' into ast_gas

7d5d5fe

Add pretty printer for gas_charge

2adf63f

Merge branch 'ast_gas' of github.com:Zilliqa/scilla into ast_gas

ce846e3

Fix bug in GasCharge: replace_variable_name

fd69d84

Merge branch 'master' into ast_gas

3308655

jjcnn suggested changes Oct 14, 2020

View reviewed changes

Couple fixes based on review comments

dec6211

raise error if gas_charge already in AST

50097f6

jjcnn approved these changes Oct 15, 2020

View reviewed changes

make fmt

3b844e7

vaivaswatha merged commit e4e1a0c into master Oct 16, 2020

vaivaswatha deleted the ast_gas branch October 16, 2020 03:19

vaivaswatha mentioned this pull request Oct 16, 2020

Fix gas charge for throw statements and print gas remaining in eval-runner #895

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rewrite gas mechanisms in terms of explicit gas_charge nodes in the AST #887

Rewrite gas mechanisms in terms of explicit gas_charge nodes in the AST #887

vaivaswatha commented Sep 16, 2020 •

edited

jjcnn left a comment

vaivaswatha commented Oct 15, 2020

jjcnn commented Oct 15, 2020

vaivaswatha commented Oct 15, 2020

jjcnn commented Oct 15, 2020

jjcnn left a comment •

edited

Rewrite gas mechanisms in terms of explicit gas_charge nodes in the AST #887

Rewrite gas mechanisms in terms of explicit gas_charge nodes in the AST #887

Conversation

vaivaswatha commented Sep 16, 2020 • edited

jjcnn left a comment

Choose a reason for hiding this comment

vaivaswatha commented Oct 15, 2020

jjcnn commented Oct 15, 2020

vaivaswatha commented Oct 15, 2020

jjcnn commented Oct 15, 2020

jjcnn left a comment • edited

Choose a reason for hiding this comment

vaivaswatha commented Sep 16, 2020 •

edited

jjcnn left a comment •

edited