Runtime vector refactoring #261

brson · 2023-07-29T22:59:04Z

Work towards #249

These patches are primarily about moving all the vector types and functions into the vector module, and converting all the free functions into methods on those types.

Many existing functions take a pair of MoveType and MoveUntypedVector, pair them and perform a typed operation. This is always unsafe because it requires the two to agree. After these patches, this pairing is always performed in the unsafe constructors of e.g. TypedMoveBorrowedRustVec, which allows some of the subsequent method calls to be declared safe.

During this process I discovered that slight refactorings could drastically change the number of executed rbpf instructions, sometimes exhausting the cpu budget in the bitvector test specifically. Certainly due to whims of the llvm optimizer. So I tracked the perf changes between commits - indicated in some of the commit messages. I did not attempt to maintain perfect parity with existing instruction counts, just to keep them from ballooning too much.

I think the optimizer has particular difficulty seeing all the way through from the constructor to the destructor of the big TypedMoveBorrowedRustVec enum - when it does it can avoid switching multiple times on it. But that's just speculation. I haven't looked closely. I did put #[inline(always)] on the new function of this type as I saw huge decreases in instruction counts doing so - usually inline(always) is not recommended as the llvm inliner tends to make better decisions than people do, but in this case it looked like too big a win to pass up.

I also added docs to the vector module.

This patch also replaces all uses of &mut AnyValue with *mut AnyValue as the former is not sound: safely writing to a &mut AnyValue could result in the actual type having an invalid representation, so writing to an AnyValue must be unsafe (though no code actually does this). I've updated the docs to indicate this.

ksolana

Assuming there were no functionality changes

nvjle · 2023-08-01T15:26:31Z

LGTM.

Some asides:
Regarding bit_vector mentioned above. I also ran into the problem of exceeding computational budget when originally implementing generics and getting the bit_vector stdlib tests working. While LLVM can do a better job, it should be noted the the bit_vector module is absurdly, horribly inefficient-- and there is nothing LLVM can do about this. I cannot imagine a real program using this module as it exists today. Clearly it was written as a first effort for baseline functionality, and not as a serious, production facility. Even ignoring the inefficient way the Move code itself was written, all of the functions should be natives for such fine-grained bit diddling (which can also control computation costs). The other inefficiency is that it also relies on std::vector, which just is not very efficient to begin with (in the speed-of-light sense) because of the layering needed in move-native (i.e., everything you see in this patch, reinterpreting Move vectors as Rust, transfers to and fro, etc).

Regarding safety-- in the general sense-- we still are not doing anything about invalid inputs (either from an adversary or a compiler defect). All these routines will still gladly interpret, say, invalid descriptors. I mentioned in the past that we need some way to validate the various descriptors. Even something as simple as a storing a "magic number" with a descriptor will at least help guard against compiler problems (but not adversaries). For example, I recently fixed a defect where we did not dereference a pointer to a runtime descriptor of some sort. By chance, the tests still happen to pass-- just by bad luck. When I happened to be adding more functionality and tests, I was getting strange behavior and crashes in the runtime. This was difficult to track down, but would have been simple with a magic number check. But the more concerning problem is how to make sure adversaries can't send in bad data.

brson added 30 commits July 29, 2023 22:46

Move MoveBorrowedRustVec into vector mod

673a348

Move TypedMoveBorrowedRustVec into vector mod

db6b311

Move MoveBorrowedRustVecOfStruct into vector mod

e202ffe

Update AnyValue docs

25eb55e

Convert &mut AnyValue to *mut AnyValue

e0a0764

Add some fixme to vector copy ops

46ce701

Always inline TypedMoveBorrowedRustVec ctor

cea9e46

Convert TypedMoveBorrowedRustVec::len to method

7d41bf4

Convert TypedMoveBorrowedRustVec::borrow to method

d829a14

Convert TypedMoveBorrowedRustVecMut::push_back to method - perf loss

16f290e

Convert TypedMoveBorrowedRustVecMut::borrow_mut to method - perf loss

cdc11a6

Convert TypedMoveBorrowedRustVecMut::pop_back to method - perf loss

534eeae

Convert TypedBorrowedRustVecMut::swap to method - perf loss

206728c

Convert TypedBorrowedRustVecMut::pop_back_discard to method

e327f56

Add TypedMoveBorrowedRustVecMut::len

dc8777a

Use method calls in vector::copy - perf loss

5715ef6

Convert TypedBorrowedRustVecMut::copy_from to method

1ea4398

vector::empty is unsafe

215b9de

Use method calls in vector::cmp_eq - perf win!

8ac9411

Use TypedMoveBorrowedRustVec in cmp_eq

6a48e3a

Remove stale todos

c25a090

Convert TypedBorrowedRustVec::cmp_eq to method - perf same

6ec55c6

Convert MoveUntypedVector::{empty,destroy_empty} to methods

035b746

move-native tests are unsafe

cb3511e

Remove vector free functions

e5eeac7

Reorder vector items

e2a39c3

Add docs to vector module

8ddbf68

Convert MoveByteVector::as_rust_vec to method

60fe35a

Move more vector conversion functions to methods

10f778d

Remove vector free functions

0ade3b7

brson added 4 commits July 29, 2023 23:21

Reorganize vector items

ee59b95

Move Debug impl into vector module

41f5ca1

clippy

3b3a2a0

fmt

7a8f372

brson force-pushed the rt-safer-2 branch from 78b9c8e to 7a8f372 Compare July 29, 2023 23:21

ksolana requested review from dmakarov, jcivlin and nvjle August 1, 2023 14:39

ksolana approved these changes Aug 1, 2023

View reviewed changes

dmakarov approved these changes Aug 1, 2023

View reviewed changes

nvjle approved these changes Aug 1, 2023

View reviewed changes

brson merged commit ef376eb into anza-xyz:llvm-sys Aug 10, 2023
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Runtime vector refactoring #261

Runtime vector refactoring #261

brson commented Jul 29, 2023 •

edited

ksolana left a comment

nvjle commented Aug 1, 2023 •

edited

Runtime vector refactoring #261

Runtime vector refactoring #261

Conversation

brson commented Jul 29, 2023 • edited

ksolana left a comment

Choose a reason for hiding this comment

nvjle commented Aug 1, 2023 • edited

brson commented Jul 29, 2023 •

edited

nvjle commented Aug 1, 2023 •

edited