Executed bytecode source for opcode lookup #73

han0110 · 2021-12-04T07:52:42Z

In EVM there are 3 different sources for executed bytecode:

When contract interaction, it's from contract's bytecode
When contract creation in root call, it's from transaction's calldata
When contract creation in internal call, it's from caller's memory

To avoid too much complexity on EVM circuit, we try to remove 3. by explicitly copying the caller's memory to bytecode_table, since for CREATE2 we already need to hash the initcode, so we abuse this for also CREATE.

Then the sources now are either 1. or 2., but there is another thing EVM circuit desires for:

For executed bytecode, EVM circuit expect executed bytecode to be analyzed and annotated if it's is_code on every byte, for verifying JUMP* in a single-step way, otherwise every time when it comes to JUMP*, it needs to look back to check if it's is_code or not, and this could be multi-steps.

So there are at least 2 options here to support creation transaction:

EVM circuit lookup sources 1. and 2. by condition, and implement is_code annotation on both bytecode circuit and tx circuit.

This seems reasonable but tx circuit has not been explored too much yet. Although it brings some implementation redundancy, the is_code annotation should not be too costly to do.
EVM circuit lookup sources 1. always, and explicitly copying transaction's calldata to bytecode_table when it's a creation one.

This seems to bring more loading to bytecode_table at first glance, but if a DOS attacker wants to blow up the bytecode_table, it would rather use EXTCODECOPY to read different huge contracts, which only costs a constant 2600 and generates 0x6000/2600 ≈ 9.45 bytes per gas in worst case. So it seems also a reasonable approach to avoid implementation redundancy and make the executed bytecode single source for EVM circuit.

Any feedbacks are appreciated.

The text was updated successfully, but these errors were encountered:

roynalnaruto · 2022-05-09T05:04:52Z

I am currently addressing the review comments on #191, and as @ed255 pointed out, it is important to finalise the definition of StepState.code_source in the specs.

@han0110 My opinion on the above issue is to use the bytecode table as the single source even for contract creation, specifically, this:

EVM circuit lookup sources 1. always, and explicitly copying transaction's calldata to bytecode_table when it's a creation one.

This will also reduce implementation complexity and any chances of possibly missing the is_create cases for any *CODE* related opcodes.

From #191 's perspective, I believe I need to populate the bytecode table for the above mentioned cases and document those changes appropriately. Would be nice to know what others think about these changes and whether or not they should be made as a separate PR. Thanks!

miha-stopar · 2022-05-09T09:08:17Z

Replying as an assigned reviewer for #191. I am not very familiar with the details, but 2. option seems cleaner to me too.

ed255 · 2022-05-09T10:05:37Z

I also prefer option 2 (single source of bytecode: the bytecode table) because it unifies the way bytecode is accessed (and I think this simplifies the design). Specially considering that we will have a copy circuit where we can offload copying memory to bytecode table and tx.call_data to bytecode table.

han0110 · 2022-05-13T07:44:53Z

As discussed in this week's call, let's implement option 2 for simplicity of EVM circuit. But for the Copy circuit (#194), it then needs to handle the copy from tx calldata in tx_table to bytecode_table in the future (or we can also implement this in Tx circuit directly), and such copy will be triggered by EVM circuit in BeginTx if it has is_create == True.

The rest todos to resolve this issue:

EVM circuit
- Rename code_source to code_hash
- (maybe) Remove is_create and is_root from tracking state and lookup them when necessary
- In creation tx, lookup Copy circuit to make sure the init code is copied into bytecode_table and get the code_hash
Copy circuit (or Tx circuit)
- Implement copy from tx calldata in tx_table to bytecode_table

ed255 · 2023-05-12T11:18:58Z

I believe we can consider this issue resolved as we already have CREATE in the specs #355
@han0110 since you reviewed the CREATE spec PR, can you confirm if it aligns with this discussion #73 (comment) ? If so, we can then close this issue :)

han0110 · 2023-05-15T02:41:14Z

For creation transaction it's not yet implemented, but the concept is similar, so yeah we can close this issue.

han0110 mentioned this issue Dec 4, 2021

Make gadgets to handle an execution state instead of an opcode per step privacy-scaling-explorations/zkevm-circuits#196

Merged

ChihChengLiang added the help wanted Extra attention is needed label Jan 6, 2022

han0110 mentioned this issue Mar 21, 2022

CODECOPY opcode #148

Merged

han0110 mentioned this issue Apr 11, 2022

[Fix] Constrain code_source to be 0 for root and create case privacy-scaling-explorations/zkevm-circuits#451

Closed

ed255 mentioned this issue May 2, 2022

opcode CODESIZE #191

Merged

This was referenced May 23, 2022

code_source rename to code_hash #205

Merged

Rename code_source to code_hash privacy-scaling-explorations/zkevm-circuits#531

Merged

han0110 mentioned this issue Mar 2, 2023

Add spec for CREATE(2) #355

Merged

han0110 closed this as completed May 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Executed bytecode source for opcode lookup #73

Executed bytecode source for opcode lookup #73

han0110 commented Dec 4, 2021 •

edited

Loading

roynalnaruto commented May 9, 2022 •

edited

Loading

miha-stopar commented May 9, 2022

ed255 commented May 9, 2022

han0110 commented May 13, 2022 •

edited by ed255

Loading

ed255 commented May 12, 2023

han0110 commented May 15, 2023

Executed bytecode source for opcode lookup #73

Executed bytecode source for opcode lookup #73

Comments

han0110 commented Dec 4, 2021 • edited Loading

roynalnaruto commented May 9, 2022 • edited Loading

miha-stopar commented May 9, 2022

ed255 commented May 9, 2022

han0110 commented May 13, 2022 • edited by ed255 Loading

ed255 commented May 12, 2023

han0110 commented May 15, 2023

han0110 commented Dec 4, 2021 •

edited

Loading

roynalnaruto commented May 9, 2022 •

edited

Loading

han0110 commented May 13, 2022 •

edited by ed255

Loading