Move : [transactional-tests] More migrated tests - (128) #10675

rutkaracn · 2022-12-29T11:34:24Z

[transactional-tests] More migrated tests #9042

Motivation

Migrated linker_tests/, method_decorators/, module_member_types/, modules/, mutate_tests/, and mutation/
Most tests were bytecode verifier tests, with a few VM tests
Nearly done migrating bytecode verifier/VM related IR tests to transactional tests.

Test Plan

CI/CD testcases covered.

[move-prover] minor improvements to the verification analysis pass #9053

Motivation

Two improvements are made in the new verification analysis pass:

[move-prover] only verify functions that directly modifies a target invariant the prior behavior is to verify functions that directly accesses a target invariant, which unnecessarily leaves many functions to be marked as verified.
NOTE: only memory modifications cause global invariants to be asserted; memory accesses without modification leads to assumption of global invariants only, but not assertions.
[move-prover] improve error reporting on invariant suspension pramgas. The prior errors are too verbose.

Test Plan

CI/CD
This new pass is not turned on as of now. It will be made as default in a later PR.

[move-prover] algorithm for progressive instantiation #9056

Motivation

This algorithm deals with finding a complete set of instantiation combinations for all type parameters when unifying two types.

// problem definition
The algorithm is encapsulated in struct TypeInstantiationDerivation and is not conceptually hard to understand.

// algorithm description
The algorithm works by finding all instantiations for X0 first, and then progress to X1, X2, ..., until finishing Xn.

// other notes

The implementation has a bit of fine-tuning rooted by the fact that sometimes we want to treat a type parameter as a variable (i.e., participate in type unification) while in other cases, we want to treat a type parameter as a concrete type (i.e., do not participate in type unification).
We also have a fine-tuning on whether we treat a type parameter that does not have any valid instantiations as an error or remains as a concrete type parameter. This is rooted by the differentation of type parameters in function vs type parameters in a global invariant. Essentially, all type parameters in a global invariant must be instantiated in order for the invariant to be instrumented. But not all function type paramters need to be instantiated.
This is not the most efficient algorithm, especially when we have a large number of type parameters. But a vast majority of Move code we have seen so far have at most one type parameter, so in this commit, we trade-off efficiency with simplicity.

Test Plan

CI/CD testcases were covered.

[move-prover] dump bytecode and result in output dir #9052

Motivation

Previous dumping location is at the first Move source location, which may pollute the source directories. Furthermore, if we pass a directory as the first source location, the process will panic.

This commit changes the default output location for --dump-bytecode to the parent directory of the output.bpl file, and format the bytecode dumps with bytecode_{step_number}_{step_name}.bytecode.

Test Plan

CI
cargo run -p move-prover -- <dir-name>/<file-name>.move --dump-bytecode. Note that the bytecode dumps are generated at the current work directory instead of under <dir-name>.

[move prover] Arithmetic mutations #8980

Motivation

This change updates two components of the mutation tester. First, it adds the sub-add, mul-div, and div-mul mutation operators. Second, it provides a fix to needing to provide addresses when working on the Diem framework.

Test Plan

CI/CD testcases were covered.

[move-prover] an analysis pass for the global invariants #9072

Motivation

This pass collects information on how to inject global invariants into the bytecode of a function.
The end result of this analysis is summarized in two structs:

/// A named struct for holding the information on how an invariant is
/// relevant to a bytecode location.
struct PerBytecodeRelevance {
    /// for each `inst_fun` (instantiation of function type parameters) in the key set, the
    /// associated value is a set of `inst_inv` (instantiation of invariant type parameters) that
    /// are applicable to the concrete function instance `F<inst_fun>`.
    insts: BTreeMap<Vec<Type>, BTreeSet<Vec<Type>>>,
}

/// A named struct for holding the information on how invariants are relevant to a function.
struct PerFunctionRelevance {
    /// Invariants that needs to be assumed at function entrypoint
    /// - Key: global invariants that needs to be assumed before the first instruction,
    /// - Value: the instantiation information per each related invariant.
    entrypoint_assumptions: BTreeMap<GlobalId, PerBytecodeRelevance>,

    /// For each bytecode at given code offset, the associated value is a map of
    /// - Key: global invariants that needs to be asserted after the bytecode instruction and
    /// - Value: the instantiation information per each related invariant.
    per_bytecode_assertions: BTreeMap<CodeOffset, BTreeMap<GlobalId, PerBytecodeRelevance>>,
}

A note about PerBytecodeRelevance : in fact, in this phase, we don't intend to instantiation the function nor do we want to collect information on how this function (or this bytecode) needs to be instantiated. All we care is how the invariant should be instantiated in order to be instrumented at this code point, with a generic function (generic bytecode).

But unfortunately, based on how the type unification logic is written now, this two-step instantiation is needed in order to find all possible instantiations of the invariant. I won't deny that there might be a way to collect invariant instantiation combinations without instantiating the function type parameters, but I haven't iron out one so far.

Test Plan

CI/CD testcases were covered.

Add support for CRSNs #8528

Motivation

Bottom Commit

Implements CRSNs and the CRSN logic according to DIP-168. These changes have been approved in #8403.

Middle Commit

Adds support for CRSNs to mempool. In particular, updates needed to be performed in order to support:

Processing of transactions in a non-blocking manner
Eviction of transactions based on the LHS of the sliding nonce of the account, and not on the committed transactions sequence nonce (since this method does not work in the CRSN case).

I'm not that familiar with mempool, so there might be something I've forgotten that I need to change -- please let me know if that's the case. Also feel free to add reviewers if I've forgotten to add someone.

The general methodology for implementation here is:

When a transaction hits mempool we determine if it is a CRSN or seqno transaction
Validation will return both the LHS of the CRSN window and k
Transactions will be removed from mempool if the transaction's seqnonce < the current LHS when a new transaction is added
A transaction will be accepted to mempool if the transaction's seqnonce >= LHS
A transaction will be considered for processing as long as it is >= LHS and < LHS + k
On commit of a transaction, other transactions may be evicted from mempool based off of the LHS of the account at the time the transaction entered mempool.
This adds a number of tests to verify the expected behavior for transactions with CRSNs.

Top Commit

Adds support for CRSNs to the prologue/epilogue.

Test Plan

CI/CD tetscases were covered.

[move] type layout generation from modules #9073

Motivation

This is the clone of #8968 just because I don't know how to commandar that PR.

Add code in the move-binary-format crate to create a MoveTypeLayout from a CompiledModule
In the cli, add new generate struct-layout command that leverages this functionality to dump layouts in YAML format
The eventual goal is to have the CLI spit out YAML that can be used directly by serde-generate to create typed struct bindings in any language supported by serde-generate
Added the error propagation comparing to ([move] type layout generation from modules #8968).

Test Plan

CI/CD testcases were covered.

[diem-framework] Port DiemTimestamp to unit tests #9079

Motivation

Ports the tests for DiemTimestamp to use unit tests. Adds additional tests for full coverage of the module.

TestPlan

CI/CD testcases were covered.

[move-prover] Rewrite the access control spec as two state invariants #9025

Motivation

This PR defines is_txn_signer, and replaced is_signer with it because the previous implementation of is_signer didn't fit for purpose (see the issue [Bug] issue with is_signer #9018).

The existing access control spec uses the schema application which is verbose. To simplify the access control spec and make it easier to read, this PR rewrites the existing access control spec as two state invariants with the new spec construct is_txn_signer_addr (and is_txn_signer).

This PR is mostly about cleaning-up the spec for the role-based access control in the Diem Framework. The next step is to look into the capability-based one (e.g., MintCapability, ...)

Test Plan

cargo test

[move-prover] an instrumentation pass for the global invariants #9077

Motivation

An instrumentation pass that consumes the information produced by the global invariant analysis pass and instrument the global invariants into the function.

The instrumenter supports two mode of operations, depending on whether the prover backend supports monomorphization or not:

With the option --boogie-poly, the instrumenter will instrument instantiated invariants in the generic function (and the generic instance is the only function instance).
Without the --boogie-poly option, the instrumenter will instrument instantiated invariants per each instantiation of a generic function (this is the traditional workflow). And this means that a function will have multiple instances for verification.

This is not exactly the plan we had before (and does not clearly adhere to the paper). The original plan is to go for option 1 first and defer the instantiation of functions to the mono pass. Therefore, the option 2 here is essentially a combination of

instrumentation,
monomorphization, and
optimization (eliminating redundant expressions)

But option 2 does not completely solve the "type-dependent" code problem because the (move_to<T>, move_to<u64>) case still requires a second step of function instantiation and still requires the mono pass to perform such instantiation.

The main reason why we still have option 2 (and not only that, we made option 2 as default as of now) is three-fold:
1.I am uncertain of Boogie's monomorphization implementation matches the complexity of what we have done here.
2.I want to get at least the whole Diem Framework verifying again to test out the whole transformation pipeline, in order to boost our confidence.
3.Solving the move_to<T>; move_to<u64>; problem requires more than instantiation; we need spec language support as well to express the fact that the function will surely abort if T == u64 and may not abort otherwise.

With this final piece, we complete the new invariant instrumentation
pipeline (modulus the misaligned implementation plan mentioned above)
and the next PR will switch the pipeline into the new one and fix the
specs in the Diem Framework.

Test Plan

CI/CD tetscases were covered.

[transactional-tests] Migrate remaining VM/Bytecode verifier IR tests #9078

Motivation

migrated remaining tests
tests left over are either for the DF or the IR compiler itself
Check for tokens after module or script in IR. Caused silent test failures

Test Plan

CI/CD testcases were covered.

ankitkacn · 2022-12-30T03:17:50Z

/canary

- Check for tokens after module or script in IR. Caused silent test failures Closes: #9078 Closes: #10675

bors-diem · 2022-12-30T04:18:50Z

💔 Test Failed - ci-test

ankitkacn · 2023-01-04T04:33:11Z

/canary

- Check for tokens after module or script in IR. Caused silent test failures Closes: #9078 Closes: #10675

bors-diem · 2023-01-04T05:59:29Z

💥 Tests timed-out

rutkaracn · 2023-01-04T06:18:28Z

/canary

- Check for tokens after module or script in IR. Caused silent test failures Closes: #9078 Closes: #10675

bors-diem · 2023-01-04T07:26:27Z

☀️ Canary successful

ankitkacn · 2023-01-04T07:27:25Z

/land

…le_member_types/, modules/, mutate_tests/, and mutation/ - Migrated linker_tests/, method_decorators/, module_member_types/, modules/, mutate_tests/, and mutation/ - Most tests were bytecode verifier tests, with a few VM tests Closes: diem#9042

…nvariant The prior behavior is to verify functions that directly accesses a target invariant, which unnecessarily leaves many functions to be marked as verified. NOTE: only memory modifications cause global invariants to be asserted; memory accesses without modification leads to assumption of global invariants only, but not assertions.

To make it less verbose and more readable. Closes: diem#9053

This algorithm deals with finding a complete set of instantiation combinations for all type parameters when unifying two types. // problem definition The algorithm is encapsulated in `struct TypeInstantiationDerivation` and is not conceptually hard to understand: Suppose we aim to unify `T1 [X0, X1, ..., Xm]` vs `T2 [Y0, Y1, ..., Yn]`, where `T1` and `T2` are types while `X`s and `Y`s are type parameters that show up in `T1` and `T2`, respectively. We want to find all instantiations to `<X0, X1, ..., Xm>` such that for each instantiation `(x0, x1, ..., xm)`, there exists a valid instantiation `(y0, y1, ..., yn)` which makes `T1` and `T2` equivelant, i.e., `T1<x0, x1, ..., xm> == T2<y0, y1, ..., yn>`. We put all these instantiation in a set denoted as `|(x0, x1, ..., xm)|` and this algorithm is about finding this set of instantiations. // algorithm description The algorithm works by finding all instantiations for `X0` first, and then progress to `X1`, `X2`, ..., until finishing `Xn`. - unify `T1 [X0, X1, ..., Xm]` vs `T2 [Y0, Y1, ..., Yn]`, get all possible substitutions for `X0`, denoted as `|x0|` - for each `x0 in |x0|`: - refine `T1` with `x0` - unify `T1 [X0 := x0, X1, ..., Xm]` vs `T2 [Y0, Y1, ..., Yn]`, get all possible substitutions for `X1`, denoted as `|x1|` - for each `x1 in |x1|`: - refine `T1` with `x1` - unify `T1 [X0 := x0, X1 := x1, ..., Xm]` vs `T2 [Y0, Y1, ..., Yn]`, get all possible substitutions for `X2`, denoted as `|x2|` - for each `x2` in `|x2|`: - ...... The process continues until we reach the end of `Xn`. After which, the algorithm should have collected all the legal instantiation combinations for type parameters `<X0, X1, ..., Xm>`. // other notes - The implementation has a bit of fine-tuning rooted by the fact that sometimes we want to treat a type parameter as a variable (i.e., participate in type unification) while in other cases, we want to treat a type parameter as a concrete type (i.e., do not participate in type unification). - We also have a fine-tuning on whether we treat a type parameter that does not have any valid instantiations as an error or remains as a concrete type parameter. This is rooted by the differentation of type parameters in function vs type parameters in a global invariant. Essentially, all type parameters in a global invariant must be instantiated in order for the invariant to be instrumented. But not all function type paramters need to be instantiated. - This is not the most efficient algorithm, especially when we have a large number of type parameters. But a vast majority of Move code we have seen so far have at most one type parameter, so in this commit, we trade-off efficiency with simplicity. Closes: diem#9056

Previous dumping location is at the first Move source location, which may pollute the source directories. Furthermore, if we pass a directory as the first source location, the process will panic. This commit changes the default output location for `--dump-bytecode` to the parent directory of the `output.bpl` file, and format ihe bytecode dumps with `bytecode_{step_number}_{step_name}.bytecode`. Closes: diem#9052

Closes: diem#8980

This pass collects information on how to inject global invariants into the bytecode of a function. The end result of this analysis is summarized in two structs: ```rust /// A named struct for holding the information on how an invariant is /// relevant to a bytecode location. struct PerBytecodeRelevance { /// for each `inst_fun` (instantiation of function type parameters) in the key set, the /// associated value is a set of `inst_inv` (instantiation of invariant type parameters) that /// are applicable to the concrete function instance `F<inst_fun>`. insts: BTreeMap<Vec<Type>, BTreeSet<Vec<Type>>>, } /// A named struct for holding the information on how invariants are relevant to a function. struct PerFunctionRelevance { /// Invariants that needs to be assumed at function entrypoint /// - Key: global invariants that needs to be assumed before the first instruction, /// - Value: the instantiation information per each related invariant. entrypoint_assumptions: BTreeMap<GlobalId, PerBytecodeRelevance>, /// For each bytecode at given code offset, the associated value is a map of /// - Key: global invariants that needs to be asserted after the bytecode instruction and /// - Value: the instantiation information per each related invariant. per_bytecode_assertions: BTreeMap<CodeOffset, BTreeMap<GlobalId, PerBytecodeRelevance>>, } ``` A note about `PerBytecodeRelevance`: in fact, in this phase, we don't intend to instantiation the function nor do we want to collect information on how this function (or this bytecode) needs to be instantiated. All we care is how the invariant should be instantiated in order to be instrumented at this code point, with a generic unction and generic code. But unfortunately, based on how the type unification logic is written now, this two-step instantiation is needed in order to find all possible instantiations of the invariant. I won't deny that there might be a way to collect invariant instantiation combinations without instantiating the function type parameters, but I haven't iron out one so far. Closes: diem#9072

Closes: diem#8528

- Add code in the move-binary-format crate to create a MoveTypeLayout from a CompiledModule - In the cli, add new `generate struct-layout` command that leverages this functionality to dump layouts in YAML format - The eventual goal is to have the CLI spit out YAML that can be used directly by serde-generate to create typed struct bindings in any language supported by serde-generate Closes: diem#9073

Closes: diem#9079

The existing access control spec uses the schema application which is verbose. To simplify the access control spec and make it easier to read, this PR rewrites the existing access control spec as two state invariants with the new spec construct `is_signer`. This PR is mostly about cleaning-up the spec for the role-based access control in the Diem Framework. The next step is to look into the capability-based one (e.g., MintCapability, ...) This PR defines `is_txn_signer`, and replaced `is_signer` with it because the previous implementation of `is_signer` didn't fit for purpose. Closes: diem#9025

An instrumentation pass that consumes the information produced by the global invariant analysis pass and instrument the global invariants into the function. The instrumenter supports two mode of operations, depending on whether the prover backend supports monomorphization or not: 1) With the option `--boogie-poly`, the instrumenter will instrument *instantiated* invariants in the *generic* function (and the generic instance is the only function instance). 2) Without the `--boogie-poly` option, the instrumenter will instrument *instantiated* invariants *per each instantiation* of a generic function (this is the traditional workflow). And this means that a function will have multiple instances for verification. This is not exactly the plan we had before (and does not clearly adhere to the paper). The original plan is to go for option 1 first and defer the instantiation of functions to the mono pass. Therefore, the option 2 here is essentially a combination of 1) instrumentation, 2) monomorphization, and 3) optimization (eliminating redundant expressions) But option 2 does not completely solve the "type-dependent" code problem because the (`move_to<T>`, `move_to<u64>`) case still requires a second step of function instantiation and still requires the mono pass to perform such instantiation. The main reason why we still have option 2 (and not only that, we made option 2 as default as of now) is three-fold: 1) I am uncertain of Boogie's monomorphization implementation matches the complexity of what we have done here. 2) I want to get at least the whole Diem Framework verifying again to test out the whole transformation pipeline, in order to boost our confidence. 3) Solving the `move_to<T>; move_to<u64>;` problem requires more than instantiation; we need spec language support as well to express the fact that the function will surely abort if `T == u64` and may not abort otherwise. With this final piece, we complete the new invariant instrumentation pipeline (modulus the misaligned implementation plan mentioned above) and the next PR will switch the pipeline into the new one and fix the specs in the Diem Framework. Closes: diem#9077

- migrated remaining tests - tests left over are either for the DF or the IR compiler itself

- Check for tokens after module or script in IR. Caused silent test failures Closes: diem#9078 Closes: diem#10675

bors-diem added this to In Review in bors Dec 29, 2022

rutkaracn requested review from dhaneshacn and KoushikGavini December 29, 2022 11:34

bors-diem moved this from In Review to Canary in bors Dec 30, 2022

bors-diem pushed a commit that referenced this pull request Dec 30, 2022

[Move IR] Check for tokens after module/script

c463bf0

- Check for tokens after module or script in IR. Caused silent test failures Closes: #9078 Closes: #10675

bors-diem moved this from Canary to In Review in bors Dec 30, 2022

bors-diem moved this from In Review to Canary in bors Jan 4, 2023

bors-diem pushed a commit that referenced this pull request Jan 4, 2023

[Move IR] Check for tokens after module/script

48c065f

- Check for tokens after module or script in IR. Caused silent test failures Closes: #9078 Closes: #10675

dhaneshacn approved these changes Jan 4, 2023

View reviewed changes

bors-diem moved this from Canary to In Review in bors Jan 4, 2023

rutkaracn force-pushed the move branch from f254df7 to b48b5c9 Compare January 4, 2023 06:17

bors-diem moved this from In Review to Canary in bors Jan 4, 2023

bors-diem pushed a commit that referenced this pull request Jan 4, 2023

[Move IR] Check for tokens after module/script

fbdd7b1

- Check for tokens after module or script in IR. Caused silent test failures Closes: #9078 Closes: #10675

bors-diem moved this from Canary to In Review in bors Jan 4, 2023

bors-diem moved this from In Review to Queued in bors Jan 4, 2023

Todd Nowacki and others added 8 commits January 4, 2023 07:28

[move-prover] improve error reporting on invariant suspension pramgas

7027b12

To make it less verbose and more readable. Closes: diem#9053

Working mutation framework

1f7855b

Closes: diem#8980

[diem-framework] Implement CRSNs in Move

621a58f

Closes: diem#8528

sblackshear and others added 6 commits January 4, 2023 07:28

[diem-framework] Port DiemTimestamp to unit tests

a508d3f

Closes: diem#9079

[transactional-tests] Migrate remaining VM/Bytecode verifier IR tests

dba5a0c

- migrated remaining tests - tests left over are either for the DF or the IR compiler itself

[Move IR] Check for tokens after module/script

9edf8d9

- Check for tokens after module or script in IR. Caused silent test failures Closes: diem#9078 Closes: diem#10675

bors-diem moved this from Queued to Testing in bors Jan 4, 2023

bors-diem force-pushed the move branch from b48b5c9 to 9edf8d9 Compare January 4, 2023 08:15

bors-diem removed this from Testing in bors Jan 4, 2023

bors-diem merged commit 9edf8d9 into diem:latest Jan 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move : [transactional-tests] More migrated tests - (128) #10675

Move : [transactional-tests] More migrated tests - (128) #10675

rutkaracn commented Dec 29, 2022 •

edited

ankitkacn commented Dec 30, 2022

bors-diem commented Dec 30, 2022

ankitkacn commented Jan 4, 2023

bors-diem commented Jan 4, 2023

rutkaracn commented Jan 4, 2023

bors-diem commented Jan 4, 2023

ankitkacn commented Jan 4, 2023

Move : [transactional-tests] More migrated tests - (128) #10675

Move : [transactional-tests] More migrated tests - (128) #10675

Conversation

rutkaracn commented Dec 29, 2022 • edited

[transactional-tests] More migrated tests #9042

Motivation

Test Plan

[move-prover] minor improvements to the verification analysis pass #9053

Motivation

Test Plan

[move-prover] algorithm for progressive instantiation #9056

Motivation

Test Plan

[move-prover] dump bytecode and result in output dir #9052

Motivation

Test Plan

[move prover] Arithmetic mutations #8980

Motivation

Test Plan

[move-prover] an analysis pass for the global invariants #9072

Motivation

Test Plan

Add support for CRSNs #8528

Motivation

Bottom Commit

Middle Commit

Top Commit

Test Plan

[move] type layout generation from modules #9073

Motivation

Test Plan

[diem-framework] Port DiemTimestamp to unit tests #9079

Motivation

TestPlan

[move-prover] Rewrite the access control spec as two state invariants #9025

Motivation

Test Plan

[move-prover] an instrumentation pass for the global invariants #9077

Motivation

Test Plan

[transactional-tests] Migrate remaining VM/Bytecode verifier IR tests #9078

Motivation

Test Plan

ankitkacn commented Dec 30, 2022

bors-diem commented Dec 30, 2022

ankitkacn commented Jan 4, 2023

bors-diem commented Jan 4, 2023

rutkaracn commented Jan 4, 2023

bors-diem commented Jan 4, 2023

ankitkacn commented Jan 4, 2023

rutkaracn commented Dec 29, 2022 •

edited