[bytecode-verifier] Implement phantom params in bytecode #8599

nilehmann · 2021-06-22T18:22:14Z

This PR implements phantom type parameters in the bytecode. A type argument instantiating a phantom type parameter is not considered when deriving the abilities for the struct instantiation. For this to be sound, in a struct declaration a phantom type parameter can only appear in phantom position (or not appear at all), i.e., as an argument to a phantom type parameter.

Motivation

Avoid spurious ability annotations for marker types like XUS.

Have you read the Contributing Guidelines on pull requests?

Yes

Test Plan

Tests will be coming in the next PR.

diem-cla-bot · 2021-06-22T18:22:17Z

Thank you for your submission. We require all contributors to
sign the CLA before we can accept your contribution.

Have you signed the CLA already, but your status is still pending? Recheck CLA

vgao1996 · 2021-06-22T20:02:34Z

language/move-vm/types/src/loaded_data/runtime_types.rs

@@ -14,6 +14,7 @@ pub struct StructType {
    pub fields: Vec<Type>,
    pub abilities: AbilitySet,
    pub type_parameters: Vec<AbilitySet>,
+    pub type_param_kinds: Vec<TypeParamKind>,


I think we should have a discussion about whether to prefer AoS (Array of Structs) or SoA (Struct of Arrays) here. SoA is what the PR currently implements. Following is an example of AoS.

struct TypeParameter { is_phantom: bool, abilities: AbilitySet, } struct StructType { .. pub type_parameters: Vec<TypeParameter>, .. }

AoS seems more intuitive and unlike SoA, we don't have to check that the two arrays have the same length. However one advantage SoA has is that we will continue to be able to pass the old list of ability sets to functions that expect &[AbilitySet]. Although with AoS, the issue can be fixed by defining a getter trait.

trait TypeParamGetAbilitySet { fn ability_set() -> &AbilitySet; } fn consumer(ty_params: &[AbilitySet]); // changes to fn consumer(ty_params: &[impl TypeParamGetAbilitySet]);

@tnowacki thoughts?

We briefly discussed SoA with @tnowacki that's why I ended up implementing it. But after implementing it I think I lean towards AoS. Maintaining the invariant that both array are the same length can get very nasty and it's not very clear where to add the checks (I didn't find any other similar well-formed checks). The getter trait sounds like a good middle ground, didn't think of that.

You can do what you want here. It might make sense here to do Vec<TypeParameterType> or something.

But in the file format I think it is important to keep them separate. See that discussion in the serializer.

vgao1996 · 2021-06-22T20:08:49Z

language/move-binary-format/src/serializer.rs

+    serialize_ability_sets(binary, &struct_handle.type_parameters)?;
+    serialize_type_param_kinds(binary, &struct_handle.type_param_kinds)
+}
+
+fn serialize_type_param_kinds(binary: &mut BinaryData, kinds: &[TypeParamKind]) -> Result<()> {
+    for kind in kinds {
+        serialize_type_param_kind(binary, *kind)?;
+    }
+    Ok(())
+}
+
+fn serialize_type_param_kind(binary: &mut BinaryData, kind: TypeParamKind) -> Result<()> {
+    match kind {
+        TypeParamKind::Phantom => binary.push(SerializedTypeParamKindFlag::PHANTOM as u8),
+        TypeParamKind::NonPhantom => binary.push(SerializedTypeParamKindFlag::NON_PHANTOM as u8),
+    }


Similarly, I think a discussion about AoS (Array of Structs) vs SoA (Struct of Arrays) should happen here.

Do we want

[length] [abi 1] [abi 2] [abi 3] [phantom 1] [phantom 2] [phantom 3]

or

[length] [abi 1] [phantom 1] [abi 2] [phantom 2] [abi 3] [phantom 3]

?

Additionally

In the former case, do we want to compress the phantom bits into a bit vector?

In the latter case, do we want to compress the phantom bit and the ability set into the same bit vector?

The problem with "[length] [abi 1] [phantom 1] [abi 2] [phantom 2] [abi 3] [phantom 3]" is we don't want to make this a breaking change.

So that gives you:

If it is not present, it is assumed that all the type parameters are non-phantom

if it is present, you specify the status/usage for each one

This way you don't need to make a breaking change... but then again you are going to have to gate this based on the version of the file format, so maybe the breaking change isn't a big deal and doing the AoS solution would work

Yeah the mroe I think about it since you have to gate based on the file format version, we might as well just do the cleaner thing and do [length] [abi 1] [phantom 1] [abi 2] [phantom 2] [abi 3] [phantom 3]

Yeah I originally also wanted compare the schemas in terms of binary compatibility, but then came into the same conclusion that any addition to the file format requires a new version number, making it meaningless to talk about backward compatibility, unless we come up with some notation that would allow us to say something like "version 5 is compatible with (can be read as) version 6" or equivalently introduce some sort of ".1" semantics, which I don't think we'll do.

Thinking about binary compatibility might help make it easier for us to implement ser/de though, as minimizing the difference between the old and new format is one of the goals. Still, I don't think this is a major concern and I'd prefer that we simply take the new version as a chance to evolve the file format.

bors-libra · 2021-06-22T20:11:32Z

❗ Invalid command

Bors help and documentation

Bors is a Github App used to manage merging of PRs in order to ensure that CI is always green. It does so by maintaining a Merge Queue. Once a PR reaches the head of the Merge Queue it is rebased on top of the latest version of the PR's base-branch (generally master) and then triggers CI. If CI comes back green the PR is then merged into the base-branch. Regardless of the outcome, the next PR is the queue is then processed.

General

This project's Merge Queue can be found here.
This project requires PRs to be Reviewed and Approved before they can be queued for merging.
Before PR's can be merged they must be configured with "Allow edits from maintainers" enabled. This is needed for Bors to be able to update PR's in-place so that Github can properly recognize and mark them as "merged" once they've been merged into the upstream branch.

Commands

Bors actions can be triggered by posting a comment which includes a line of the form /<action>.

Command	Action	Description
Land	`land`, `merge`	attempt to land or merge a PR
Canary	`canary`, `try`	canary a PR by performing all checks without merging
Cancel	`cancel`, `stop`	stop an in-progress land
Cherry Pick	`cherry-pick <target>`	cherry-pick a PR into `<target>` branch
Priority	`priority`	set the priority level for a PR (`high`, `normal`, `low`)
Help	`help`, `h`	show this help message

Options

Options for Pull Requests are configured through the application of labels.

Option	Description
	Indicates that the PR is high-priority. When queued the PR will be placed at the head of the merge queue.
	Indicates that the PR is low-priority. When queued the PR will be placed at the back of the merge queue.
	Before merging the PR will be squashed down to a single commit, only retaining the commit message of the first commit in the PR.

language/move-binary-format/src/file_format.rs

tnowacki · 2021-06-22T23:19:58Z

language/move-binary-format/src/serializer.rs

+    serialize_ability_sets(binary, &struct_handle.type_parameters)?;
+    serialize_type_param_kinds(binary, &struct_handle.type_param_kinds)
+}
+
+fn serialize_type_param_kinds(binary: &mut BinaryData, kinds: &[TypeParamKind]) -> Result<()> {
+    for kind in kinds {
+        serialize_type_param_kind(binary, *kind)?;
+    }
+    Ok(())
+}
+
+fn serialize_type_param_kind(binary: &mut BinaryData, kind: TypeParamKind) -> Result<()> {
+    match kind {
+        TypeParamKind::Phantom => binary.push(SerializedTypeParamKindFlag::PHANTOM as u8),
+        TypeParamKind::NonPhantom => binary.push(SerializedTypeParamKindFlag::NON_PHANTOM as u8),
+    }


The problem with "[length] [abi 1] [phantom 1] [abi 2] [phantom 2] [abi 3] [phantom 3]" is we don't want to make this a breaking change.

So that gives you:

If it is not present, it is assumed that all the type parameters are non-phantom

if it is present, you specify the status/usage for each one

This way you don't need to make a breaking change... but then again you are going to have to gate this based on the version of the file format, so maybe the breaking change isn't a big deal and doing the AoS solution would work

language/move-core/types/src/vm_status.rs

tnowacki · 2021-06-22T23:23:16Z

language/move-vm/types/src/loaded_data/runtime_types.rs

@@ -14,6 +14,7 @@ pub struct StructType {
    pub fields: Vec<Type>,
    pub abilities: AbilitySet,
    pub type_parameters: Vec<AbilitySet>,
+    pub type_param_kinds: Vec<TypeParamKind>,


You can do what you want here. It might make sense here to do Vec<TypeParameterType> or something.

But in the file format I think it is important to keep them separate. See that discussion in the serializer.

nilehmann · 2021-06-22T23:51:43Z

@tnowacki If I read #8599 (comment) correctly then you agree on going AoS all the way and in file_format have

struct TypeParameter {
  constraint: AbilitySet,
  is_phantom: bool,
}
struct StructHandle {
   ...
   type_parameters: Vec<TypeParameter>
   ...
}

and serialize it like:

[length] [abi 1] [phantom 1] ... [abi n] [phantom n]

?

If so, the only thing remaining is whether to serialize abi and phantom in the same bit vector, but I can implement the above and come to this later.

tnowacki · 2021-06-23T16:26:35Z

@nilehmann This struct TypeParameter plan looks great!

The fileformat is not hyper-optimized already so I would just serialize each field of phantom and abilities separately. We can always piggyback on the phantom u8 for other flags in the future if needed

And just looking ahead, definitely don't use TypeParameter struct for fn polymorphic_abilities. I know it has the exactly correct signature, but in meaning it is sadly very different (as the arguments to polymorphic_abilities are the abilities for the type argument/actuals and the is_phantom is a descriptor for the declaration

nilehmann · 2021-06-23T19:32:36Z

@tnowacki @vgao1996 I updated the PR as discussed.

tnowacki

This is looking really great! This AoS solution is looking a lot mroe slick. I have mostly focused on the vm bits and will take a deeper look at the other parts later. But some more minor things to change up at least at the moment :)

tnowacki · 2021-06-24T02:59:07Z

language/move-binary-format/src/file_format.rs

+/// Struct type parameters can be declared as phantom while
+/// function type parameters cannot. This getter trait
+/// is used to work over both of them uniformily.
+pub trait TypeParamGetConstraints {
+    fn constraints(&self) -> &AbilitySet;
+}
+
+impl TypeParamGetConstraints for TypeParameter {
+    fn constraints(&self) -> &AbilitySet {
+        &self.constraints
+    }
+}
+
+impl TypeParamGetConstraints for AbilitySet {
+    fn constraints(&self) -> &AbilitySet {
+        self
+    }


Maybe this is nice in some spots? but to me this feels really unnecessary, especially since the field is public.

I think using impl IntoIterator in a few spots should clean this up for you

The main motivation for this is to use it in BinaryIndexedView::abilities where we need the indexing. So yeah, we can get away with impl IntoIterator in many places, but I don't know of a better solution for this case.

IntoIterator should work for that case too

Sorry, I don't see how IntoIterator could work in this case as we have to access arbitrary indices.

TypeParameter(idx) => *constraints[*idx as usize].constraints(),

Yeah I would say the trait is for cases where you need random access, e.g. checking type arguments against type parameters. In such cases, collecting into a vector could result in a regression in terms time complexity.

@nilehmann could you confirm if such cases actually exist in our codebase?

@vgao1996 BinaryIndexedView::abilities is the only place where random access is used. If we keep that as a &[AbilitySet] I think the only place that needs an extra collect is here

That random access is also duplicated for test generation here.

There are also other places that access the length, but I think those are not too problematic.

language/move-binary-format/src/serializer.rs

tnowacki · 2021-06-24T03:03:59Z

language/move-binary-format/src/proptest_types/functions.rs

-    fn get_signature_from_type_params(&mut self, abilities: &[AbilitySet]) -> Signature {
+    fn get_signature_from_type_params(
+        &mut self,
+        abilities: &[impl TypeParamGetConstraints],


one thing you might want to do instead is impl IntoIterator<Item = &AbilitySet> that way you won't need this other trait

language/move-binary-format/src/file_format.rs

tnowacki · 2021-06-24T03:08:05Z

language/move-lang/src/interface_generator.rs

+        .map(|(idx, ty_param)| {
+            format!(
+                "{}{}{}",
+                if ty_param.is_phantom { "phantom " } else { "" },


this is probably fine. but would also be fine with doing an assert!(ty_param.is_phantom()) or something for now. Up to you though since you will likely be doing both parts :)

tnowacki · 2021-06-24T03:09:01Z

language/move-vm/runtime/src/loader.rs

-    fn verify_ty_args(&self, constraints: &[AbilitySet], ty_args: &[Type]) -> PartialVMResult<()> {
+    fn verify_ty_args(
+        &self,
+        constraints: &[impl TypeParamGetConstraints],


same as the other spot impl IntoIterator<Item = &AbilitySet> will fix this. Also maybe just impl IntoIterator<Item = AbilitySet> since Ability set is just a u8

This function in particular accesses the length so the implementation would also need to change. I think the trait is cleaner.

Then we can duplicate the function :) Or just copy them out into another vec. Both of these seem better to me than the trait

So one thing we could do is leave everything as it was with &[AbilitySet] everywhere and add an extra collect at the call site when calling from a "struct context". I guess you'd be trading extra allocations for the complexity of adding the trait. I particularly like more the trait as I think it's the right abstraction, it captures what's common about function type parameters and struct type parameters, but I'm fine either way.

You could also have two different entry points. and then have a shared _impl that takes a size and the iterator

But I also don't think the extra allocations will be a big deal, like at all. In practice this vector will have like 5 elements at most.

tnowacki · 2021-06-24T03:10:17Z

language/move-vm/runtime/src/loader.rs

            )),
            Type::Struct(idx) => Ok(self.module_cache.read().struct_at(*idx).abilities),
            Type::StructInstantiation(idx, type_args) => {
-                let declared_abilities = self.module_cache.read().struct_at(*idx).abilities;
+                let struct_type = self.module_cache.read().struct_at(*idx);
                let type_argument_abilities = type_args


wild, didn't notice before. Or I did and looked it up then. But do we check some where that the correct number of type arguments were passed in?

None of the code I touched seems to check it, but there may be a check somewhere else? I just went on and zip the args and params.

language/move-binary-format/src/deserializer.rs

nilehmann · 2021-06-24T23:53:26Z

@tnowacki @vgao1996 I added a commit without the trait to see how it looks. It avoids extra allocations with IntoIterator whenever possible and ExactSizeIterator when the length is required. The only extra allocations required because random access is needeed are here and here.

tnowacki · 2021-06-25T18:25:38Z

The only extra allocations required because random access is needeed are here and here.

Definitely love the Iterator traits! I do find it funny how much Rust makes you ("you" being any Rust programmer) feel bad for extra allocations, when most of the time you probably won't see a big performance difference :P A very amusing fallout from some choices around clone and the like

runtian-zhou · 2021-07-02T03:11:25Z

@runtian-zhou

Uh unfortunately there doesn't seem to be an easy way to bypass the test. If we would like to land it now we can strip off the diem framework recompilation so that it won't trigger the error?

That would be great. Is it easy to instruct @nilehmann to do the necessary change?

Basically revert all changes under diem-releases/artifacts folder and check if the failed test would pass or not. And create a new PR that includes those changes so that we will have recompiled DiemFramework in the main branch

tnowacki · 2021-07-02T03:12:56Z

Basically revert all changes under diem-releases/artifacts folder and check if the failed test would pass or not. And create a new PR that includes those changes so that we will have recompiled DiemFramework in the main branch

I don't think you can do that. There is another test that stops you from landing anything unless artifacts is up to date. At least that is what @tzakian told me.

nilehmann · 2021-07-02T04:25:25Z

@tnowacki yeah there's a test that checks the working directory doesn't have unstaged changes. That's how I realized artifact even existed in the first place.

tzakian · 2021-07-06T17:24:41Z

I think Runtian's suggestion is the right course of action if we want to get this in soon. You can turn off the test to make sure the files are up-to-date by putting a #[ignore] on top of this line https://github.com/diem/diem/blob/main/language/diem-framework/tests/generated_files.rs#L23

runtian-zhou · 2021-07-06T20:28:53Z

Double checked the hypothesis why LBT failed using following command:

./scripts/cti --tag land_2c154089 --cluster-test-tag land_2c154089 -E BATCH_SIZE=15 -E UPDATE_TO_TAG=land_230f6395 --report report.json --suite land_blocking_compat

This command start a cluster test runner from main branch and update nodes to main+PR image.

The test was exxecuted successfully with following output:

2021-07-06T19:58:57.678737Z [main] INFO testsuite/cluster-test/src/aws.rs:28 Scaling to desired_capacity : 0, buffer: 0, asg_name: ct-1-k8s-testnet-validators
Test runner setup time spent 238 secs
Compatibility test results for land_2c154089 ==> land_230f6395 (PR)
1. All instances running land_2c154089, generating some traffic on network
2. First full node land_2c154089 ==> land_230f6395, to validate new full node to old validator node traffic
3. First Validator node land_2c154089 ==> land_230f6395, to validate storage compatibility
4. First batch validators (14) land_2c154089 ==> land_230f6395, to test consensus and traffic between old full nodes and new validator node
5. First batch full nodes (14) land_2c154089 ==> land_230f6395
6. Second batch validators (15) land_2c154089 ==> land_230f6395, to upgrade rest of the validators
7. Second batch of full nodes (15) land_2c154089 ==> land_230f6395, to finish the network upgrade, time spent 673 secs
all up : 963 TPS, 4711 ms latency, 5350 ms p99 latency, no expired txns, time spent 250 secs

This proved that the PR has no compatibility issue with the older file format. With this we can proceed landing this code with cluster test disabled @sausagee .

runtian-zhou · 2021-07-06T20:31:24Z

This PR will change the file format so the next Diem Framework release will need to be dependent on the Diem Core release.

sausagee · 2021-07-06T20:55:57Z

/land priority=high

bors-libra · 2021-07-06T20:55:59Z

@sausagee ❗ This PR is still missing approvals, unable to queue for landing

runtian-zhou

Let's do this!

runtian-zhou · 2021-07-06T20:57:26Z

/land priority=high

Closes: diem#8599

bors-libra added this to In Review in bors Jun 22, 2021

diem-cla-bot bot added the cla-unsigned label Jun 22, 2021

nilehmann requested review from tnowacki, sblackshear, shazqadeer and vgao1996 June 22, 2021 18:22

nilehmann force-pushed the phantom-type-params branch from ec5b275 to 02f367f Compare June 22, 2021 18:31

vgao1996 reviewed Jun 22, 2021

View reviewed changes

tnowacki reviewed Jun 22, 2021

View reviewed changes

diem-cla-bot bot added cla-signed and removed cla-unsigned labels Jun 23, 2021

nilehmann force-pushed the phantom-type-params branch 3 times, most recently from 912fd99 to 24cb27d Compare June 23, 2021 19:29

nilehmann force-pushed the phantom-type-params branch from 24cb27d to 4d68793 Compare June 24, 2021 00:43

nilehmann requested review from tnowacki and vgao1996 June 24, 2021 00:47

nilehmann force-pushed the phantom-type-params branch from 4d68793 to 597f41c Compare June 24, 2021 02:50

tnowacki reviewed Jun 24, 2021

View reviewed changes

nilehmann force-pushed the phantom-type-params branch 3 times, most recently from c2c67cc to de64021 Compare June 24, 2021 23:47

nilehmann requested a review from tnowacki June 25, 2021 16:27

nilehmann dismissed tnowacki’s stale review via a6887ba July 6, 2021 17:45

nilehmann force-pushed the phantom-type-params branch 3 times, most recently from 60f2cd1 to add6060 Compare July 6, 2021 17:48

nilehmann mentioned this pull request Jul 6, 2021

[diem-framework] recompile with phantom params #8687

Closed

runtian-zhou removed the breaking Any changes that will require restarting the testnet label Jul 6, 2021

nilehmann force-pushed the phantom-type-params branch from add6060 to 1660f39 Compare July 6, 2021 20:41

bors-libra added the bors-high-priority label Jul 6, 2021

runtian-zhou previously approved these changes Jul 6, 2021

View reviewed changes

bors-libra moved this from In Review to Queued in bors Jul 6, 2021

[bytecode] Implement phantom type parameters

5c45fd2

Closes: diem#8599

bors-libra moved this from Queued to Testing in bors Jul 6, 2021

bors-libra dismissed runtian-zhou’s stale review via 5c45fd2 July 6, 2021 21:53

bors-libra force-pushed the phantom-type-params branch from 1660f39 to 5c45fd2 Compare July 6, 2021 21:53

bors-libra removed this from Testing in bors Jul 6, 2021

bors-libra closed this in 5c45fd2 Jul 6, 2021

bors-libra merged commit 5c45fd2 into diem:main Jul 6, 2021

bors-libra temporarily deployed to Docker July 6, 2021 21:53 Inactive

bors-libra temporarily deployed to Sccache July 6, 2021 21:53 Inactive

nilehmann mentioned this pull request Jul 7, 2021

[move-lang] Implement phantom parameters #8694

Merged

[bytecode-verifier] Implement phantom params in bytecode #8599

[bytecode-verifier] Implement phantom params in bytecode #8599

Conversation

nilehmann commented Jun 22, 2021 • edited

Motivation

Have you read the Contributing Guidelines on pull requests?

Test Plan

diem-cla-bot bot commented Jun 22, 2021

Choose a reason for hiding this comment

nilehmann Jun 22, 2021 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bors-libra commented Jun 22, 2021

General

Commands

Options

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nilehmann commented Jun 22, 2021

tnowacki commented Jun 23, 2021

nilehmann commented Jun 23, 2021

tnowacki left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nilehmann Jun 24, 2021 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vgao1996 Jun 24, 2021 • edited

Choose a reason for hiding this comment

nilehmann Jun 24, 2021 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nilehmann Jun 24, 2021 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nilehmann commented Jun 24, 2021 • edited

tnowacki commented Jun 25, 2021 • edited

runtian-zhou commented Jul 2, 2021

tnowacki commented Jul 2, 2021

nilehmann commented Jul 2, 2021

tzakian commented Jul 6, 2021

runtian-zhou commented Jul 6, 2021

runtian-zhou commented Jul 6, 2021

sausagee commented Jul 6, 2021

bors-libra commented Jul 6, 2021

runtian-zhou left a comment

Choose a reason for hiding this comment

runtian-zhou commented Jul 6, 2021

nilehmann commented Jun 22, 2021 •

edited

nilehmann Jun 22, 2021 •

edited

nilehmann Jun 24, 2021 •

edited

vgao1996 Jun 24, 2021 •

edited

nilehmann Jun 24, 2021 •

edited

nilehmann Jun 24, 2021 •

edited

nilehmann commented Jun 24, 2021 •

edited

tnowacki commented Jun 25, 2021 •

edited